Model parameters: d_model 2560 ffw_size 10240 kv_size 128 n_heads 20 n_layers 34 Megatron-DeepSpeed/pretrain_gpt.py --tensor-model-parallel-size 1 --pipeline-model-parallel-size 1 --num-layers 34 --hidden-size 2560 --num-attention-heads 20 --kv-channels 128 --ffn-hidden-size 10240 --seq-length 2048 --max-position-embeddings 2048 --micro-batch-size 2 --global-batch-size 512 --train-samples 1 --vocab-file gpt2/vocab.json --merge-file gpt2/merges.txt --clip-grad 1.0 --kill-switch-path kill-switch-2b855b55bperplexityval --bf16 --optimizer adam --adam-beta1 0.9 --adam-beta2 0.999 --adam-eps 1e-8 --lr 2e-4 --min-lr 2e-5 --lr-decay-style cosine --lr-decay-samples 1 --lr-warmup-samples 0 --clip-grad 1.0 --weight-decay 1e-1 --override-lr-scheduler --reset-progress --no-load-optim --log-interval 10 --save-interval 1000 --eval-interval 1 --eval-iters 100 --eval-only true --tensorboard-dir tensorboard_2b855b55bperplexityval --tensorboard-queue-size 5 --log-timers-to-tensorboard --log-batch-size-to-tensorboard --log-validation-ppl-to-tensorboard --save lm1-2b8-55b-c4-perplexity --load lm1-2b8-55b-c4-perplexity --train-weighted-split-paths-path train1b5.txt --valid-weighted-split-paths-path val.txt --data-impl mmap --deepspeed --deepspeed_config ds_configs/3489889.json --zero-stage 0 START 3489889: Wed 10 May 2023 10:05:59 AM EEST 0: 0: 0: ======================= ROCm System Management Interface ======================= 0: ================================= Concise Info ================================= 0: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 0: 0 45.0c 94.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 0: 1 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 0: 2 42.0c 83.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 0: 3 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 0: 4 44.0c 85.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 0: 5 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 0: 6 38.0c 87.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 0: 7 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 0: ================================================================================ 0: ============================= End of ROCm SMI Log ============================== 7: 7: 7: ======================= ROCm System Management Interface ======================= 7: ================================= Concise Info ================================= 7: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 7: 0 48.0c 97.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 7: 1 48.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 7: 2 46.0c 93.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 7: 3 49.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 7: 4 50.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 7: 5 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 7: 6 44.0c 96.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 7: 7 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 7: ================================================================================ 7: ============================= End of ROCm SMI Log ============================== 21: 21: 21: ======================= ROCm System Management Interface ======================= 21: ================================= Concise Info ================================= 21: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 21: 0 47.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 21: 1 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 21: 2 36.0c 89.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 21: 3 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 21: 4 42.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 21: 5 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 21: 6 39.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 21: 7 40.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 21: ================================================================================ 21: ============================= End of ROCm SMI Log ============================== 4: 4: 4: ======================= ROCm System Management Interface ======================= 4: ================================= Concise Info ================================= 4: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 4: 0 48.0c 94.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 4: 1 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 4: 2 43.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 4: 3 48.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 4: 4 38.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 4: 5 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 4: 6 43.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 4: 7 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 4: ================================================================================ 4: ============================= End of ROCm SMI Log ============================== 25: 25: 25: ======================= ROCm System Management Interface ======================= 25: ================================= Concise Info ================================= 25: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 25: 0 45.0c 94.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 25: 1 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 25: 2 41.0c 89.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 25: 3 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 25: 4 42.0c 89.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 25: 5 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 25: 6 39.0c 96.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 25: 7 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 25: ================================================================================ 25: ============================= End of ROCm SMI Log ============================== 30: 30: 30: ======================= ROCm System Management Interface ======================= 30: ================================= Concise Info ================================= 30: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 30: 0 45.0c 94.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 30: 1 49.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 30: 2 42.0c 89.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 30: 3 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 30: 4 41.0c 85.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 30: 5 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 30: 6 44.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 30: 7 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 30: ================================================================================ 30: ============================= End of ROCm SMI Log ============================== 29: 29: 29: ======================= ROCm System Management Interface ======================= 29: ================================= Concise Info ================================= 29: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 29: 0 47.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 29: 1 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 29: 2 44.0c 95.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 29: 3 41.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 29: 4 48.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 29: 5 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 29: 6 48.0c 100.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 29: 7 41.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 29: ================================================================================ 29: ============================= End of ROCm SMI Log ============================== 11: 11: 11: ======================= ROCm System Management Interface ======================= 11: ================================= Concise Info ================================= 11: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 11: 0 42.0c 98.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 11: 1 50.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 11: 2 40.0c 95.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 11: 3 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 11: 4 43.0c 93.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 11: 5 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 11: 6 43.0c 86.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 11: 7 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 11: ================================================================================ 11: ============================= End of ROCm SMI Log ============================== 18: 18: 18: ======================= ROCm System Management Interface ======================= 18: ================================= Concise Info ================================= 18: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 18: 0 45.0c 91.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 18: 1 48.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 18: 2 41.0c 93.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 18: 3 42.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 18: 4 44.0c 96.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 18: 5 42.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 18: 6 43.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 18: 7 42.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 18: ================================================================================ 18: ============================= End of ROCm SMI Log ============================== 16: 16: 16: ======================= ROCm System Management Interface ======================= 16: ================================= Concise Info ================================= 16: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 16: 0 44.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 16: 1 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 16: 2 38.0c 93.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 16: 3 41.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 16: 4 42.0c 94.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 16: 5 49.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 16: 6 43.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 16: 7 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 16: ================================================================================ 16: ============================= End of ROCm SMI Log ============================== 17: 17: 17: ======================= ROCm System Management Interface ======================= 17: ================================= Concise Info ================================= 17: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 17: 0 41.0c 91.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 17: 1 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 17: 2 40.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 17: 3 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 17: 4 47.0c 87.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 17: 5 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 17: 6 36.0c 91.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 17: 7 42.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 17: ================================================================================ 17: ============================= End of ROCm SMI Log ============================== 2: 2: 2: ======================= ROCm System Management Interface ======================= 2: ================================= Concise Info ================================= 2: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 2: 0 49.0c 91.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 2: 1 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 2: 2 44.0c 80.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 2: 3 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 2: 4 38.0c 91.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 2: 5 49.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 2: 6 45.0c 91.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 2: 7 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 2: ================================================================================ 2: ============================= End of ROCm SMI Log ============================== 27: 27: 27: ======================= ROCm System Management Interface ======================= 27: ================================= Concise Info ================================= 27: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 27: 0 46.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 27: 1 49.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 27: 2 42.0c 91.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 27: 3 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 27: 4 46.0c 89.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 27: 5 49.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 27: 6 40.0c 87.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 27: 7 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 27: ================================================================================ 27: ============================= End of ROCm SMI Log ============================== 26: 26: 26: ======================= ROCm System Management Interface ======================= 26: ================================= Concise Info ================================= 26: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 26: 0 46.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 26: 1 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 26: 2 42.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 26: 3 48.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 26: 4 37.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 26: 5 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 26: 6 40.0c 93.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 26: 7 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 26: ================================================================================ 26: ============================= End of ROCm SMI Log ============================== 1: 1: 1: ======================= ROCm System Management Interface ======================= 1: ================================= Concise Info ================================= 1: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 1: 0 39.0c 97.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 1: 1 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 1: 2 41.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 1: 3 42.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 1: 4 41.0c 86.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 1: 5 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 1: 6 45.0c 85.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 1: 7 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 1: ================================================================================ 1: ============================= End of ROCm SMI Log ============================== 10: 10: 10: ======================= ROCm System Management Interface ======================= 10: ================================= Concise Info ================================= 10: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 10: 0 49.0c 85.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 10: 1 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 10: 2 35.0c 87.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 10: 3 48.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 10: 4 44.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 10: 5 41.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 10: 6 40.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 10: 7 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 10: ================================================================================ 10: ============================= End of ROCm SMI Log ============================== 9: 9: 9: ======================= ROCm System Management Interface ======================= 9: ================================= Concise Info ================================= 9: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 9: 0 48.0c 89.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 9: 1 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 9: 2 42.0c 93.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 9: 3 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 9: 4 46.0c 95.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 9: 5 50.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 9: 6 40.0c 89.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 9: 7 39.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 9: ================================================================================ 9: ============================= End of ROCm SMI Log ============================== 12: 12: 12: ======================= ROCm System Management Interface ======================= 12: ================================= Concise Info ================================= 12: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 12: 0 42.0c 91.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 12: 1 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 12: 2 39.0c 96.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 12: 3 39.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 12: 4 41.0c 87.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 12: 5 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 12: 6 41.0c 98.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 12: 7 48.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 12: ================================================================================ 12: ============================= End of ROCm SMI Log ============================== 19: 19: 19: ======================= ROCm System Management Interface ======================= 19: ================================= Concise Info ================================= 19: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 19: 0 44.0c 93.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 19: 1 50.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 19: 2 43.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 19: 3 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 19: 4 43.0c 87.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 19: 5 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 19: 6 37.0c 95.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 19: 7 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 19: ================================================================================ 19: ============================= End of ROCm SMI Log ============================== 3: 3: 3: ======================= ROCm System Management Interface ======================= 3: ================================= Concise Info ================================= 3: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 3: 0 46.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 3: 1 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 3: 2 41.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 3: 3 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 3: 4 46.0c 78.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 3: 5 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 3: 6 46.0c 86.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 3: 7 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 3: ================================================================================ 3: ============================= End of ROCm SMI Log ============================== 24: 24: 24: ======================= ROCm System Management Interface ======================= 24: ================================= Concise Info ================================= 24: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 24: 0 45.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 24: 1 40.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 24: 2 40.0c 95.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 24: 3 41.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 24: 4 47.0c 95.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 24: 5 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 24: 6 41.0c 87.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 24: 7 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 24: ================================================================================ 24: ============================= End of ROCm SMI Log ============================== 31: 31: 31: ======================= ROCm System Management Interface ======================= 31: ================================= Concise Info ================================= 31: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 31: 0 45.0c 89.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 31: 1 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 31: 2 45.0c 86.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 31: 3 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 31: 4 42.0c 93.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 31: 5 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 31: 6 43.0c 97.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 31: 7 48.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 31: ================================================================================ 31: ============================= End of ROCm SMI Log ============================== 23: 23: 23: ======================= ROCm System Management Interface ======================= 23: ================================= Concise Info ================================= 23: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 23: 0 47.0c 93.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 23: 1 38.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 23: 2 40.0c 99.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 23: 3 42.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 23: 4 45.0c 89.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 23: 5 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 23: 6 41.0c 94.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 23: 7 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 23: ================================================================================ 23: ============================= End of ROCm SMI Log ============================== 28: 28: 28: ======================= ROCm System Management Interface ======================= 28: ================================= Concise Info ================================= 28: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 28: 0 46.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 28: 1 50.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 28: 2 42.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 28: 3 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 28: 4 43.0c 87.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 28: 5 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 28: 6 43.0c 94.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 28: 7 40.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 28: ================================================================================ 28: ============================= End of ROCm SMI Log ============================== 22: 22: 22: ======================= ROCm System Management Interface ======================= 22: ================================= Concise Info ================================= 22: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 22: 0 43.0c 93.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 22: 1 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 22: 2 45.0c 85.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 22: 3 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 22: 4 43.0c 82.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 22: 5 48.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 22: 6 42.0c 94.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 22: 7 40.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 22: ================================================================================ 22: ============================= End of ROCm SMI Log ============================== 8: 8: 8: ======================= ROCm System Management Interface ======================= 8: ================================= Concise Info ================================= 8: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 8: 0 47.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 8: 1 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 8: 2 41.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 8: 3 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 8: 4 48.0c 95.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 8: 5 42.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 8: 6 43.0c 96.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 8: 7 41.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 8: ================================================================================ 8: ============================= End of ROCm SMI Log ============================== 5: 5: 5: ======================= ROCm System Management Interface ======================= 5: ================================= Concise Info ================================= 5: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 5: 0 46.0c 93.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 5: 1 41.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 5: 2 44.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 5: 3 42.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 5: 4 43.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 5: 5 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 5: 6 45.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 5: 7 40.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 5: ================================================================================ 5: ============================= End of ROCm SMI Log ============================== 13: 13: 13: ======================= ROCm System Management Interface ======================= 13: ================================= Concise Info ================================= 13: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 13: 0 45.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 13: 1 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 13: 2 38.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 13: 3 47.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 13: 4 49.0c 83.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 13: 5 49.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 13: 6 39.0c 84.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 13: 7 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 13: ================================================================================ 13: ============================= End of ROCm SMI Log ============================== 6: 6: 6: ======================= ROCm System Management Interface ======================= 6: ================================= Concise Info ================================= 6: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 6: 0 50.0c 90.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 6: 1 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 6: 2 40.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 6: 3 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 6: 4 46.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 6: 5 50.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 6: 6 41.0c 86.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 6: 7 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 6: ================================================================================ 6: ============================= End of ROCm SMI Log ============================== 14: 14: 14: ======================= ROCm System Management Interface ======================= 14: ================================= Concise Info ================================= 14: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 14: 0 47.0c 84.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 14: 1 51.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 14: 2 40.0c 84.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 14: 3 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 14: 4 41.0c 83.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 14: 5 49.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 14: 6 39.0c 86.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 14: 7 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 14: ================================================================================ 14: ============================= End of ROCm SMI Log ============================== 15: 15: 15: ======================= ROCm System Management Interface ======================= 15: ================================= Concise Info ================================= 15: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 15: 0 44.0c 92.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 15: 1 48.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 15: 2 48.0c 86.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 15: 3 43.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 15: 4 44.0c 89.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 15: 5 49.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 15: 6 43.0c 91.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 15: 7 48.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 15: ================================================================================ 15: ============================= End of ROCm SMI Log ============================== 20: 20: 20: ======================= ROCm System Management Interface ======================= 20: ================================= Concise Info ================================= 20: GPU Temp AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 20: 0 42.0c 95.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 20: 1 45.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 20: 2 43.0c 87.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 20: 3 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 20: 4 39.0c 94.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 20: 5 46.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 20: 6 40.0c 88.0W 800Mhz 1600Mhz 0% auto 560.0W 0% 0% 20: 7 44.0c N/A 800Mhz 1600Mhz 0% auto 0.0W 0% 0% 20: ================================================================================ 20: ============================= End of ROCm SMI Log ============================== 3: Launching on nid005440 (3/32), master nid005437 port 9999, GPUs 8, CUDA: True 25: Launching on nid007185 (25/32), master nid005437 port 9999, GPUs 8, CUDA: True 16: Launching on nid007176 (16/32), master nid005437 port 9999, GPUs 8, CUDA: True 4: Launching on nid005441 (4/32), master nid005437 port 9999, GPUs 8, CUDA: True 18: Launching on nid007178 (18/32), master nid005437 port 9999, GPUs 8, CUDA: True 30: Launching on nid007190 (30/32), master nid005437 port 9999, GPUs 8, CUDA: True 1: Launching on nid005438 (1/32), master nid005437 port 9999, GPUs 8, CUDA: True 9: Launching on nid007169 (9/32), master nid005437 port 9999, GPUs 8, CUDA: True 29: Launching on nid007189 (29/32), master nid005437 port 9999, GPUs 8, CUDA: True 21: Launching on nid007181 (21/32), master nid005437 port 9999, GPUs 8, CUDA: True 27: Launching on nid007187 (27/32), master nid005437 port 9999, GPUs 8, CUDA: True 24: Launching on nid007184 (24/32), master nid005437 port 9999, GPUs 8, CUDA: True 19: Launching on nid007179 (19/32), master nid005437 port 9999, GPUs 8, CUDA: True 26: Launching on nid007186 (26/32), master nid005437 port 9999, GPUs 8, CUDA: True 2: Launching on nid005439 (2/32), master nid005437 port 9999, GPUs 8, CUDA: True 12: Launching on nid007172 (12/32), master nid005437 port 9999, GPUs 8, CUDA: True 10: Launching on nid007170 (10/32), master nid005437 port 9999, GPUs 8, CUDA: True 7: Launching on nid007167 (7/32), master nid005437 port 9999, GPUs 8, CUDA: True 17: Launching on nid007177 (17/32), master nid005437 port 9999, GPUs 8, CUDA: True 11: Launching on nid007171 (11/32), master nid005437 port 9999, GPUs 8, CUDA: True 22: Launching on nid007182 (22/32), master nid005437 port 9999, GPUs 8, CUDA: True 14: Launching on nid007174 (14/32), master nid005437 port 9999, GPUs 8, CUDA: True 31: Launching on nid007191 (31/32), master nid005437 port 9999, GPUs 8, CUDA: True 13: Launching on nid007173 (13/32), master nid005437 port 9999, GPUs 8, CUDA: True 6: Launching on nid005443 (6/32), master nid005437 port 9999, GPUs 8, CUDA: True 8: Launching on nid007168 (8/32), master nid005437 port 9999, GPUs 8, CUDA: True 23: Launching on nid007183 (23/32), master nid005437 port 9999, GPUs 8, CUDA: True 28: Launching on nid007188 (28/32), master nid005437 port 9999, GPUs 8, CUDA: True 5: Launching on nid005442 (5/32), master nid005437 port 9999, GPUs 8, CUDA: True 0: Launching on nid005437 (0/32), master nid005437 port 9999, GPUs 8, CUDA: True 15: Launching on nid007175 (15/32), master nid005437 port 9999, GPUs 8, CUDA: True 20: Launching on nid007180 (20/32), master nid005437 port 9999, GPUs 8, CUDA: True 0: using world size: 256, data-parallel-size: 256, tensor-model-parallel size: 1, pipeline-model-parallel size: 1 0: accumulate and all-reduce gradients in fp32 for bfloat16 data type. 0: using torch.bfloat16 for parameters ... 0: ------------------------ arguments ------------------------ 0: abort_on_unmet_fused_kernel_constraints ......... False 0: accumulate_allreduce_grads_in_fp32 .............. True 0: adam_beta1 ...................................... 0.9 0: adam_beta2 ...................................... 0.999 0: adam_eps ........................................ 1e-08 0: adlr_autoresume ................................. False 0: adlr_autoresume_interval ........................ 1000 0: apply_query_key_layer_scaling ................... True 0: apply_residual_connection_post_layernorm ........ False 0: attention_dropout ............................... 0.1 0: attention_softmax_in_fp32 ....................... False 0: bert_binary_head ................................ True 0: bert_load ....................................... None 0: bf16 ............................................ True 0: bias_dropout_fusion ............................. True 0: bias_gelu_fusion ................................ True 0: biencoder_projection_dim ........................ 0 0: biencoder_shared_query_context_model ............ False 0: block_data_path ................................. None 0: checkpoint_activations .......................... False 0: checkpoint_in_cpu ............................... False 0: checkpoint_num_layers ........................... 1 0: clip_grad ....................................... 1.0 0: codecarbon_dir .................................. None 0: consumed_train_samples .......................... 0 0: consumed_train_tokens ........................... 0 0: consumed_valid_samples .......................... 0 0: contigious_checkpointing ........................ False 0: cpu_optimizer ................................... False 0: cpu_torch_adam .................................. False 0: curriculum_learning ............................. False 0: data_impl ....................................... mmap 0: data_parallel_size .............................. 256 0: data_path ....................................... None 0: dataloader_type ................................. single 0: DDP_impl ........................................ local 0: decoder_seq_length .............................. None 0: deepscale ....................................... False 0: deepscale_config ................................ None 0: deepspeed ....................................... True 0: deepspeed_activation_checkpointing .............. False 0: deepspeed_config ................................ ds_configs/3489889.json 0: deepspeed_mpi ................................... False 0: distribute_checkpointed_activations ............. False 0: distributed_backend ............................. nccl 0: embed_layernorm ................................. False 0: embedding_path .................................. None 0: encoder_seq_length .............................. 2048 0: eod_mask_loss ................................... False 0: eval_interval ................................... 1 0: eval_iters ...................................... 100 0: eval_only ....................................... True 0: evidence_data_path .............................. None 0: exit_duration_in_mins ........................... None 0: exit_interval ................................... None 0: ffn_hidden_size ................................. 10240 0: finetune ........................................ False 0: fp16 ............................................ False 0: fp16_lm_cross_entropy ........................... False 0: fp32_residual_connection ........................ False 0: gigaflos_no_embeds .............................. 0 0: global_batch_size ............................... 512 0: glu_activation .................................. None 0: hidden_dropout .................................. 0.1 0: hidden_size ..................................... 2560 0: hysteresis ...................................... 2 0: ict_head_size ................................... None 0: ict_load ........................................ None 0: img_dim ......................................... 224 0: indexer_batch_size .............................. 128 0: indexer_log_interval ............................ 1000 0: inference ....................................... False 0: init_method_std ................................. 0.02 0: init_method_xavier_uniform ...................... False 0: initial_loss_scale .............................. 4294967296 0: kill_switch_path ................................ kill-switch-2b855b55bperplexityval 0: kv_channels ..................................... 128 0: layer_norm_fusion ............................... True 0: layernorm_epsilon ............................... 1e-05 0: lazy_mpu_init ................................... None 0: load ............................................ lm1-2b8-55b-c4-perplexity 0: local_rank ...................................... None 0: log_batch_size_to_tensorboard ................... True 0: log_interval .................................... 10 0: log_learning_rate_to_tensorboard ................ True 0: log_level ....................................... None 0: log_level_replica ............................... None 0: log_loss_scale_to_tensorboard ................... True 0: log_num_zeros_in_grad ........................... False 0: log_params_norm ................................. False 0: log_path ........................................ None 0: log_timers_to_tensorboard ....................... True 0: log_validation_ppl_to_tensorboard ............... True 0: loss_on_targets_only ............................ False 0: loss_scale ...................................... None 0: loss_scale_window ............................... 1000 0: lr .............................................. 0.0002 0: lr_decay_iters .................................. None 0: lr_decay_samples ................................ 1 0: lr_decay_style .................................. cosine 0: lr_decay_tokens ................................. None 0: lr_warmup_fraction .............................. None 0: lr_warmup_iters ................................. 0 0: lr_warmup_samples ............................... 0 0: make_vocab_size_divisible_by .................... 128 0: mask_prob ....................................... 0.15 0: masked_softmax_fusion ........................... True 0: max_position_embeddings ......................... 2048 0: mean_noise_span_length .......................... None 0: memory_centric_tiled_linear ..................... False 0: merge_file ...................................... gpt2/merges.txt 0: micro_batch_size ................................ 2 0: min_loss_scale .................................. 1.0 0: min_lr .......................................... 2e-05 0: mmap_warmup ..................................... False 0: no_load_optim ................................... True 0: no_load_rng ..................................... None 0: no_save_optim ................................... None 0: no_save_rng ..................................... None 0: noise_density ................................... None 0: num_attention_heads ............................. 20 0: num_channels .................................... 3 0: num_classes ..................................... 1000 0: num_layers ...................................... 34 0: num_layers_per_virtual_pipeline_stage ........... None 0: num_workers ..................................... 2 0: onnx_safe ....................................... None 0: openai_gelu ..................................... False 0: optimizer ....................................... adam 0: optimizer_fusion ................................ True 0: override_lr_scheduler ........................... True 0: pad_vocab_size_to ............................... None 0: params_dtype .................................... torch.bfloat16 0: partition_activations ........................... False 0: patch_dim ....................................... 16 0: pipeline_model_parallel_size .................... 1 0: position_embedding_type ......................... PositionEmbeddingType.absolute 0: pp_partition_method ............................. None 0: profile_backward ................................ False 0: query_in_block_prob ............................. 0.1 0: rampup_batch_size ............................... None 0: rank ............................................ 0 0: remote_device ................................... none 0: reset_attention_mask ............................ False 0: reset_position_ids .............................. False 0: reset_progress .................................. True 0: retriever_report_topk_accuracies ................ [] 0: retriever_score_scaling ......................... False 0: retriever_seq_length ............................ 256 0: reweight_loss_based_on_position_frequency ....... False 0: sample_rate ..................................... 1.0 0: save ............................................ lm1-2b8-55b-c4-perplexity 0: save_interval ................................... 1000 0: scatter_gather_tensors_in_pipeline .............. True 0: scattered_embeddings ............................ False 0: seed ............................................ 1234 0: seq_length ...................................... 2048 0: sgd_momentum .................................... 0.9 0: short_seq_prob .................................. 0.1 0: skip_train_iteration_range ...................... None 0: split ........................................... None 0: split_transformers .............................. False 0: sync_tp_duplicated_parameters ................... False 0: synchronize_each_layer .......................... False 0: tensor_model_parallel_size ...................... 1 0: tensorboard_dir ................................. tensorboard_2b855b55bperplexityval 0: tensorboard_log_interval ........................ 1 0: tensorboard_queue_size .......................... 5 0: test_weighted_split_paths ....................... None 0: test_weighted_split_paths_path .................. None 0: tile_factor ..................................... 1 0: titles_data_path ................................ None 0: tokenizer_name_or_path .......................... None 0: tokenizer_type .................................. GPT2BPETokenizer 0: train_iters ..................................... None 0: train_samples ................................... 1 0: train_tokens .................................... None 0: train_weighted_split_names ...................... ['train'] 0: train_weighted_split_paths ...................... [['/scratch/project_462000119/data/c4_subsampled/gpt2tok_c4_en_1B5_text_document']] 0: train_weighted_split_paths_path ................. None 0: train_weighted_split_splits ..................... [['0:1']] 0: train_weighted_split_weights .................... [['1.0']] 0: universal_checkpoint ............................ False 0: use_bnb_optimizer ............................... False 0: use_checkpoint_lr_scheduler ..................... False 0: use_contiguous_buffers_in_ddp ................... True 0: use_cpu_initialization .......................... None 0: use_one_sent_docs ............................... False 0: use_pin_memory .................................. False 0: valid_num_workers ............................... 2 0: valid_weighted_split_names ...................... ['validation'] 0: valid_weighted_split_paths ...................... [['/scratch/project_462000119/data/c4_validation/gpt2tok_c4validation_rerun_text_document']] 0: valid_weighted_split_paths_path ................. None 0: valid_weighted_split_splits ..................... [['0:1']] 0: valid_weighted_split_weights .................... [['1.0']] 0: virtual_pipeline_model_parallel_size ............ None 0: vocab_extra_ids ................................. 0 0: vocab_file ...................................... gpt2/vocab.json 0: weight_decay .................................... 0.1 0: world_size ...................................... 256 0: zero_allgather_bucket_size ...................... 0.0 0: zero_contigious_gradients ....................... False 0: zero_reduce_bucket_size ......................... 0.0 0: zero_reduce_scatter ............................. False 0: zero_stage ...................................... 0 0: -------------------- end of arguments --------------------- 0: setting number of micro-batches to constant 1 0: > building GPT2BPETokenizer tokenizer ... 31: > setting tensorboard ... 0: > padded vocab (size: 50257) with 47 dummy tokens (new size: 50304) 0: DeepSpeed general environment info: 0: torch install path ............... ['/pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/venv/lib/python3.9/site-packages/torch'] 0: torch version .................... 1.13.0+rocm5.2 0: torch cuda version ............... None 0: torch hip version ................ 5.2.21151-afdc89f8 0: nvcc version ..................... None 0: deepspeed install path ........... ['/pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/venv/lib/python3.9/site-packages/deepspeed'] 0: deepspeed info ................... 0.7.5, unknown, unknown 0: deepspeed wheel compiled w. ...... torch 1.13, hip 5.1 0: **** Git info for Megatron: git_hash=unknown git_branch=unknown **** 0: > initializing torch distributed ... 0: [2023-05-10 10:09:25,996] [INFO] [comm.py:633:init_distributed] Initializing TorchBackend in DeepSpeed with backend nccl 0: > initializing tensor model parallel with size 1 0: > initializing pipeline model parallel with size 1 0: > setting random seeds to 1234 ... 0: > initializing model parallel cuda seeds on global rank 0, model parallel rank 0, and data parallel rank 0 with model parallel seed: 3952 and data parallel seed: 1234 0: > compiling dataset index builder ... 0: make: Entering directory '/pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/data' 0: make: Nothing to be done for 'default'. 0: make: Leaving directory '/pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/data' 0: >>> done with dataset index builder. Compilation time: 0.092 seconds 0: > compiling and loading fused kernels ... 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax.cpp -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax_hip.cpp [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax_hip.h [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax_cuda.cu -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax_hip.hip [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax_hip.h [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax_hip.h [skipped, already hipified] 0: Total number of unsupported CUDA function calls: 0 0: 0: 0: Total number of replaced kernel launches: 87 0: ninja: no work to do. 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax.cpp -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax_hip.cpp [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax_cuda.cu -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax_hip.hip [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax_hip.h [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax_hip.h [skipped, already hipified] 0: Total number of unsupported CUDA function calls: 0 0: 0: 0: Total number of replaced kernel launches: 63 0: ninja: no work to do. 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/layer_norm_cuda.cpp -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/layer_norm_cuda.cpp [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/layer_norm_cuda_kernel.cu -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/layer_norm_hip_kernel.hip [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/type_shim.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/compat.h [skipped, no changes] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_upper_triang_masked_softmax_hip.h [skipped, already hipified] 0: /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax.h -> /pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/Megatron-DeepSpeed/megatron/fused_kernels/scaled_masked_softmax_hip.h [skipped, already hipified] 0: Total number of unsupported CUDA function calls: 0 0: 0: 0: Total number of replaced kernel launches: 67 0: [1/1] c++ layer_norm_cuda.o layer_norm_hip_kernel.cuda.o -shared -L/pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/venv/lib/python3.9/site-packages/torch/lib -lc10 -lc10_hip -ltorch_cpu -ltorch_hip -ltorch -ltorch_python -L/opt/rocm/lib -lamdhip64 -o fused_mix_prec_layer_norm_cuda.so 0: >>> done with compiling and loading fused kernels. Compilation time: 35.727 seconds 0: time to initialize megatron (seconds): 85.114 0: [after megatron is initialized] datetime: 2023-05-10 10:10:13 0: building GPT model ... 0: [2023-05-10 10:10:13,275] [INFO] [utils.py:827:see_memory_usage] Before Building Model 0: [2023-05-10 10:10:13,276] [INFO] [utils.py:828:see_memory_usage] MA 0.0 GB Max_MA 0.0 GB CA 0.0 GB Max_CA 0 GB 0: [2023-05-10 10:10:13,276] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 40.1 GB, percent = 8.0% 0: SEED_LAYERS=False BASE_SEED=1234 SEED_FN=None 0: Using topology: {ProcessCoord(pipe=0, data=0, model=0): 0, ProcessCoord(pipe=0, data=1, model=0): 1, ProcessCoord(pipe=0, data=2, model=0): 2, ProcessCoord(pipe=0, data=3, model=0): 3, ProcessCoord(pipe=0, data=4, model=0): 4, ProcessCoord(pipe=0, data=5, model=0): 5, ProcessCoord(pipe=0, data=6, model=0): 6, ProcessCoord(pipe=0, data=7, model=0): 7, ProcessCoord(pipe=0, data=8, model=0): 8, ProcessCoord(pipe=0, data=9, model=0): 9, ProcessCoord(pipe=0, data=10, model=0): 10, ProcessCoord(pipe=0, data=11, model=0): 11, ProcessCoord(pipe=0, data=12, model=0): 12, ProcessCoord(pipe=0, data=13, model=0): 13, ProcessCoord(pipe=0, data=14, model=0): 14, ProcessCoord(pipe=0, data=15, model=0): 15, ProcessCoord(pipe=0, data=16, model=0): 16, ProcessCoord(pipe=0, data=17, model=0): 17, ProcessCoord(pipe=0, data=18, model=0): 18, ProcessCoord(pipe=0, data=19, model=0): 19, ProcessCoord(pipe=0, data=20, model=0): 20, ProcessCoord(pipe=0, data=21, model=0): 21, ProcessCoord(pipe=0, data=22, model=0): 22, ProcessCoord(pi 0: pe=0, data=23, model=0): 23, ProcessCoord(pipe=0, data=24, model=0): 24, ProcessCoord(pipe=0, data=25, model=0): 25, ProcessCoord(pipe=0, data=26, model=0): 26, ProcessCoord(pipe=0, data=27, model=0): 27, ProcessCoord(pipe=0, data=28, model=0): 28, ProcessCoord(pipe=0, data=29, model=0): 29, ProcessCoord(pipe=0, data=30, model=0): 30, ProcessCoord(pipe=0, data=31, model=0): 31, ProcessCoord(pipe=0, data=32, model=0): 32, ProcessCoord(pipe=0, data=33, model=0): 33, ProcessCoord(pipe=0, data=34, model=0): 34, ProcessCoord(pipe=0, data=35, model=0): 35, ProcessCoord(pipe=0, data=36, model=0): 36, ProcessCoord(pipe=0, data=37, model=0): 37, ProcessCoord(pipe=0, data=38, model=0): 38, ProcessCoord(pipe=0, data=39, model=0): 39, ProcessCoord(pipe=0, data=40, model=0): 40, ProcessCoord(pipe=0, data=41, model=0): 41, ProcessCoord(pipe=0, data=42, model=0): 42, ProcessCoord(pipe=0, data=43, model=0): 43, ProcessCoord(pipe=0, data=44, model=0): 44, ProcessCoord(pipe=0, data=45, model=0): 45, ProcessCoord(pipe=0, data=4 0: 6, model=0): 46, ProcessCoord(pipe=0, data=47, model=0): 47, ProcessCoord(pipe=0, data=48, model=0): 48, ProcessCoord(pipe=0, data=49, model=0): 49, ProcessCoord(pipe=0, data=50, model=0): 50, ProcessCoord(pipe=0, data=51, model=0): 51, ProcessCoord(pipe=0, data=52, model=0): 52, ProcessCoord(pipe=0, data=53, model=0): 53, ProcessCoord(pipe=0, data=54, model=0): 54, ProcessCoord(pipe=0, data=55, model=0): 55, ProcessCoord(pipe=0, data=56, model=0): 56, ProcessCoord(pipe=0, data=57, model=0): 57, ProcessCoord(pipe=0, data=58, model=0): 58, ProcessCoord(pipe=0, data=59, model=0): 59, ProcessCoord(pipe=0, data=60, model=0): 60, ProcessCoord(pipe=0, data=61, model=0): 61, ProcessCoord(pipe=0, data=62, model=0): 62, ProcessCoord(pipe=0, data=63, model=0): 63, ProcessCoord(pipe=0, data=64, model=0): 64, ProcessCoord(pipe=0, data=65, model=0): 65, ProcessCoord(pipe=0, data=66, model=0): 66, ProcessCoord(pipe=0, data=67, model=0): 67, ProcessCoord(pipe=0, data=68, model=0): 68, ProcessCoord(pipe=0, data=69, model=0): 0: 69, ProcessCoord(pipe=0, data=70, model=0): 70, ProcessCoord(pipe=0, data=71, model=0): 71, ProcessCoord(pipe=0, data=72, model=0): 72, ProcessCoord(pipe=0, data=73, model=0): 73, ProcessCoord(pipe=0, data=74, model=0): 74, ProcessCoord(pipe=0, data=75, model=0): 75, ProcessCoord(pipe=0, data=76, model=0): 76, ProcessCoord(pipe=0, data=77, model=0): 77, ProcessCoord(pipe=0, data=78, model=0): 78, ProcessCoord(pipe=0, data=79, model=0): 79, ProcessCoord(pipe=0, data=80, model=0): 80, ProcessCoord(pipe=0, data=81, model=0): 81, ProcessCoord(pipe=0, data=82, model=0): 82, ProcessCoord(pipe=0, data=83, model=0): 83, ProcessCoord(pipe=0, data=84, model=0): 84, ProcessCoord(pipe=0, data=85, model=0): 85, ProcessCoord(pipe=0, data=86, model=0): 86, ProcessCoord(pipe=0, data=87, model=0): 87, ProcessCoord(pipe=0, data=88, model=0): 88, ProcessCoord(pipe=0, data=89, model=0): 89, ProcessCoord(pipe=0, data=90, model=0): 90, ProcessCoord(pipe=0, data=91, model=0): 91, ProcessCoord(pipe=0, data=92, model=0): 92, Process 0: Coord(pipe=0, data=93, model=0): 93, ProcessCoord(pipe=0, data=94, model=0): 94, ProcessCoord(pipe=0, data=95, model=0): 95, ProcessCoord(pipe=0, data=96, model=0): 96, ProcessCoord(pipe=0, data=97, model=0): 97, ProcessCoord(pipe=0, data=98, model=0): 98, ProcessCoord(pipe=0, data=99, model=0): 99, ProcessCoord(pipe=0, data=100, model=0): 100, ProcessCoord(pipe=0, data=101, model=0): 101, ProcessCoord(pipe=0, data=102, model=0): 102, ProcessCoord(pipe=0, data=103, model=0): 103, ProcessCoord(pipe=0, data=104, model=0): 104, ProcessCoord(pipe=0, data=105, model=0): 105, ProcessCoord(pipe=0, data=106, model=0): 106, ProcessCoord(pipe=0, data=107, model=0): 107, ProcessCoord(pipe=0, data=108, model=0): 108, ProcessCoord(pipe=0, data=109, model=0): 109, ProcessCoord(pipe=0, data=110, model=0): 110, ProcessCoord(pipe=0, data=111, model=0): 111, ProcessCoord(pipe=0, data=112, model=0): 112, ProcessCoord(pipe=0, data=113, model=0): 113, ProcessCoord(pipe=0, data=114, model=0): 114, ProcessCoord(pipe=0, data=115, mo 0: del=0): 115, ProcessCoord(pipe=0, data=116, model=0): 116, ProcessCoord(pipe=0, data=117, model=0): 117, ProcessCoord(pipe=0, data=118, model=0): 118, ProcessCoord(pipe=0, data=119, model=0): 119, ProcessCoord(pipe=0, data=120, model=0): 120, ProcessCoord(pipe=0, data=121, model=0): 121, ProcessCoord(pipe=0, data=122, model=0): 122, ProcessCoord(pipe=0, data=123, model=0): 123, ProcessCoord(pipe=0, data=124, model=0): 124, ProcessCoord(pipe=0, data=125, model=0): 125, ProcessCoord(pipe=0, data=126, model=0): 126, ProcessCoord(pipe=0, data=127, model=0): 127, ProcessCoord(pipe=0, data=128, model=0): 128, ProcessCoord(pipe=0, data=129, model=0): 129, ProcessCoord(pipe=0, data=130, model=0): 130, ProcessCoord(pipe=0, data=131, model=0): 131, ProcessCoord(pipe=0, data=132, model=0): 132, ProcessCoord(pipe=0, data=133, model=0): 133, ProcessCoord(pipe=0, data=134, model=0): 134, ProcessCoord(pipe=0, data=135, model=0): 135, ProcessCoord(pipe=0, data=136, model=0): 136, ProcessCoord(pipe=0, data=137, model=0): 137, 0: ProcessCoord(pipe=0, data=138, model=0): 138, ProcessCoord(pipe=0, data=139, model=0): 139, ProcessCoord(pipe=0, data=140, model=0): 140, ProcessCoord(pipe=0, data=141, model=0): 141, ProcessCoord(pipe=0, data=142, model=0): 142, ProcessCoord(pipe=0, data=143, model=0): 143, ProcessCoord(pipe=0, data=144, model=0): 144, ProcessCoord(pipe=0, data=145, model=0): 145, ProcessCoord(pipe=0, data=146, model=0): 146, ProcessCoord(pipe=0, data=147, model=0): 147, ProcessCoord(pipe=0, data=148, model=0): 148, ProcessCoord(pipe=0, data=149, model=0): 149, ProcessCoord(pipe=0, data=150, model=0): 150, ProcessCoord(pipe=0, data=151, model=0): 151, ProcessCoord(pipe=0, data=152, model=0): 152, ProcessCoord(pipe=0, data=153, model=0): 153, ProcessCoord(pipe=0, data=154, model=0): 154, ProcessCoord(pipe=0, data=155, model=0): 155, ProcessCoord(pipe=0, data=156, model=0): 156, ProcessCoord(pipe=0, data=157, model=0): 157, ProcessCoord(pipe=0, data=158, model=0): 158, ProcessCoord(pipe=0, data=159, model=0): 159, ProcessCoor 0: d(pipe=0, data=160, model=0): 160, ProcessCoord(pipe=0, data=161, model=0): 161, ProcessCoord(pipe=0, data=162, model=0): 162, ProcessCoord(pipe=0, data=163, model=0): 163, ProcessCoord(pipe=0, data=164, model=0): 164, ProcessCoord(pipe=0, data=165, model=0): 165, ProcessCoord(pipe=0, data=166, model=0): 166, ProcessCoord(pipe=0, data=167, model=0): 167, ProcessCoord(pipe=0, data=168, model=0): 168, ProcessCoord(pipe=0, data=169, model=0): 169, ProcessCoord(pipe=0, data=170, model=0): 170, ProcessCoord(pipe=0, data=171, model=0): 171, ProcessCoord(pipe=0, data=172, model=0): 172, ProcessCoord(pipe=0, data=173, model=0): 173, ProcessCoord(pipe=0, data=174, model=0): 174, ProcessCoord(pipe=0, data=175, model=0): 175, ProcessCoord(pipe=0, data=176, model=0): 176, ProcessCoord(pipe=0, data=177, model=0): 177, ProcessCoord(pipe=0, data=178, model=0): 178, ProcessCoord(pipe=0, data=179, model=0): 179, ProcessCoord(pipe=0, data=180, model=0): 180, ProcessCoord(pipe=0, data=181, model=0): 181, ProcessCoord(pipe=0, da 0: ta=182, model=0): 182, ProcessCoord(pipe=0, data=183, model=0): 183, ProcessCoord(pipe=0, data=184, model=0): 184, ProcessCoord(pipe=0, data=185, model=0): 185, ProcessCoord(pipe=0, data=186, model=0): 186, ProcessCoord(pipe=0, data=187, model=0): 187, ProcessCoord(pipe=0, data=188, model=0): 188, ProcessCoord(pipe=0, data=189, model=0): 189, ProcessCoord(pipe=0, data=190, model=0): 190, ProcessCoord(pipe=0, data=191, model=0): 191, ProcessCoord(pipe=0, data=192, model=0): 192, ProcessCoord(pipe=0, data=193, model=0): 193, ProcessCoord(pipe=0, data=194, model=0): 194, ProcessCoord(pipe=0, data=195, model=0): 195, ProcessCoord(pipe=0, data=196, model=0): 196, ProcessCoord(pipe=0, data=197, model=0): 197, ProcessCoord(pipe=0, data=198, model=0): 198, ProcessCoord(pipe=0, data=199, model=0): 199, ProcessCoord(pipe=0, data=200, model=0): 200, ProcessCoord(pipe=0, data=201, model=0): 201, ProcessCoord(pipe=0, data=202, model=0): 202, ProcessCoord(pipe=0, data=203, model=0): 203, ProcessCoord(pipe=0, data=204, mode 0: l=0): 204, ProcessCoord(pipe=0, data=205, model=0): 205, ProcessCoord(pipe=0, data=206, model=0): 206, ProcessCoord(pipe=0, data=207, model=0): 207, ProcessCoord(pipe=0, data=208, model=0): 208, ProcessCoord(pipe=0, data=209, model=0): 209, ProcessCoord(pipe=0, data=210, model=0): 210, ProcessCoord(pipe=0, data=211, model=0): 211, ProcessCoord(pipe=0, data=212, model=0): 212, ProcessCoord(pipe=0, data=213, model=0): 213, ProcessCoord(pipe=0, data=214, model=0): 214, ProcessCoord(pipe=0, data=215, model=0): 215, ProcessCoord(pipe=0, data=216, model=0): 216, ProcessCoord(pipe=0, data=217, model=0): 217, ProcessCoord(pipe=0, data=218, model=0): 218, ProcessCoord(pipe=0, data=219, model=0): 219, ProcessCoord(pipe=0, data=220, model=0): 220, ProcessCoord(pipe=0, data=221, model=0): 221, ProcessCoord(pipe=0, data=222, model=0): 222, ProcessCoord(pipe=0, data=223, model=0): 223, ProcessCoord(pipe=0, data=224, model=0): 224, ProcessCoord(pipe=0, data=225, model=0): 225, ProcessCoord(pipe=0, data=226, model=0): 226, P 0: rocessCoord(pipe=0, data=227, model=0): 227, ProcessCoord(pipe=0, data=228, model=0): 228, ProcessCoord(pipe=0, data=229, model=0): 229, ProcessCoord(pipe=0, data=230, model=0): 230, ProcessCoord(pipe=0, data=231, model=0): 231, ProcessCoord(pipe=0, data=232, model=0): 232, ProcessCoord(pipe=0, data=233, model=0): 233, ProcessCoord(pipe=0, data=234, model=0): 234, ProcessCoord(pipe=0, data=235, model=0): 235, ProcessCoord(pipe=0, data=236, model=0): 236, ProcessCoord(pipe=0, data=237, model=0): 237, ProcessCoord(pipe=0, data=238, model=0): 238, ProcessCoord(pipe=0, data=239, model=0): 239, ProcessCoord(pipe=0, data=240, model=0): 240, ProcessCoord(pipe=0, data=241, model=0): 241, ProcessCoord(pipe=0, data=242, model=0): 242, ProcessCoord(pipe=0, data=243, model=0): 243, ProcessCoord(pipe=0, data=244, model=0): 244, ProcessCoord(pipe=0, data=245, model=0): 245, ProcessCoord(pipe=0, data=246, model=0): 246, ProcessCoord(pipe=0, data=247, model=0): 247, ProcessCoord(pipe=0, data=248, model=0): 248, ProcessCoord( 0: pipe=0, data=249, model=0): 249, ProcessCoord(pipe=0, data=250, model=0): 250, ProcessCoord(pipe=0, data=251, model=0): 251, ProcessCoord(pipe=0, data=252, model=0): 252, ProcessCoord(pipe=0, data=253, model=0): 253, ProcessCoord(pipe=0, data=254, model=0): 254, ProcessCoord(pipe=0, data=255, model=0): 255} 0: [2023-05-10 10:10:21,409] [INFO] [module.py:366:_partition_layers] Partitioning pipeline stages with method type:transformer 0: stage=0 layers=41 0: 0: _to_float16 0: 1: EmbeddingPipe 0: 2: 0: 3: ParallelTransformerLayerPipe 0: 4: ParallelTransformerLayerPipe 0: 5: ParallelTransformerLayerPipe 0: 6: ParallelTransformerLayerPipe 0: 7: ParallelTransformerLayerPipe 0: 8: ParallelTransformerLayerPipe 0: 9: ParallelTransformerLayerPipe 0: 10: ParallelTransformerLayerPipe 0: 11: ParallelTransformerLayerPipe 0: 12: ParallelTransformerLayerPipe 0: 13: ParallelTransformerLayerPipe 0: 14: ParallelTransformerLayerPipe 0: 15: ParallelTransformerLayerPipe 0: 16: ParallelTransformerLayerPipe 0: 17: ParallelTransformerLayerPipe 0: 18: ParallelTransformerLayerPipe 0: 19: ParallelTransformerLayerPipe 0: 20: ParallelTransformerLayerPipe 0: 21: ParallelTransformerLayerPipe 0: 22: ParallelTransformerLayerPipe 0: 23: ParallelTransformerLayerPipe 0: 24: ParallelTransformerLayerPipe 0: 25: ParallelTransformerLayerPipe 0: 26: ParallelTransformerLayerPipe 0: 27: ParallelTransformerLayerPipe 0: 28: ParallelTransformerLayerPipe 0: 29: ParallelTransformerLayerPipe 0: 30: ParallelTransformerLayerPipe 0: 31: ParallelTransformerLayerPipe 0: 32: ParallelTransformerLayerPipe 0: 33: ParallelTransformerLayerPipe 0: 34: ParallelTransformerLayerPipe 0: 35: ParallelTransformerLayerPipe 0: 36: ParallelTransformerLayerPipe 0: 37: undo 0: 38: MixedFusedLayerNorm 0: 39: EmbeddingPipe 0: 40: float16_to_fp32 0: loss: CrossEntropy 0: [2023-05-10 10:10:21,689] [INFO] [utils.py:827:see_memory_usage] After Building Model 0: [2023-05-10 10:10:21,690] [INFO] [utils.py:828:see_memory_usage] MA 5.26 GB Max_MA 5.26 GB CA 5.31 GB Max_CA 5 GB 0: [2023-05-10 10:10:21,690] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 40.28 GB, percent = 8.0% 0: setting training iterations to 0 0: > learning rate decay style: cosine 0: DeepSpeed is enabled. 0: [2023-05-10 10:10:21,693] [INFO] [logging.py:68:log_dist] [Rank 0] DeepSpeed info: version=0.7.5, git-hash=unknown, git-branch=unknown 0: [2023-05-10 10:10:44,391] [INFO] [logging.py:68:log_dist] [Rank 0] DeepSpeed Flops Profiler Enabled: False 0: [2023-05-10 10:10:44,392] [INFO] [logging.py:68:log_dist] [Rank 0] Removing param_group that has no 'params' in the client Optimizer 0: [2023-05-10 10:10:44,392] [INFO] [logging.py:68:log_dist] [Rank 0] Using client Optimizer as basic optimizer 0: [2023-05-10 10:10:44,411] [INFO] [logging.py:68:log_dist] [Rank 0] DeepSpeed Basic Optimizer = FusedAdam 0: [2023-05-10 10:10:44,411] [INFO] [logging.py:68:log_dist] [Rank 0] Creating BF16 optimizer 0: [2023-05-10 10:10:44,536] [INFO] [utils.py:827:see_memory_usage] begin bf16_optimizer 0: [2023-05-10 10:10:44,536] [INFO] [utils.py:828:see_memory_usage] MA 5.25 GB Max_MA 5.27 GB CA 5.32 GB Max_CA 5 GB 0: [2023-05-10 10:10:44,537] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 41.01 GB, percent = 8.2% 0: ninja: no work to do. 0: Time to load utils op: 0.31937503814697266 seconds 0: Time to load utils op: 0.36258602142333984 seconds 6: Time to load utils op: 0.35329365730285645 secondsTime to load utils op: 0.3533034324645996 seconds 6: 6: Time to load utils op: 0.3533053398132324 seconds 6: Time to load utils op: 0.35332632064819336 seconds 6: Time to load utils op: 0.35333800315856934 seconds 6: Time to load utils op: 0.353229284286499 secondsTime to load utils op: 0.3533327579498291 seconds 6: 6: Time to load utils op: 0.35334086418151855 seconds 9: Time to load utils op: 0.36747002601623535 seconds 9: Time to load utils op: 0.36652040481567383 seconds 9: Time to load utils op: 0.3665299415588379 seconds 9: Time to load utils op: 0.36812496185302734 seconds 9: Time to load utils op: 0.3663959503173828 seconds 9: Time to load utils op: 0.3672771453857422 secondsTime to load utils op: 0.3683497905731201 seconds 9: 9: Time to load utils op: 0.3683640956878662 seconds 8: Time to load utils op: 0.36431097984313965 secondsTime to load utils op: 0.36430883407592773 seconds 8: 8: Time to load utils op: 0.36432647705078125 seconds 8: Time to load utils op: 0.36435747146606445 seconds 8: Time to load utils op: 0.3643641471862793 secondsTime to load utils op: 0.36435723304748535 seconds 8: 8: Time to load utils op: 0.3643779754638672 secondsTime to load utils op: 0.36437392234802246 seconds 8: 0: Time to load utils op: 0.4174518585205078 seconds 0: Time to load utils op: 0.4172933101654053 seconds 0: Time to load utils op: 0.4180161952972412 seconds 0: Time to load utils op: 0.41826558113098145 seconds 0: Time to load utils op: 0.4176008701324463 seconds 0: Time to load utils op: 0.41773247718811035 seconds 18: Time to load utils op: 0.39951348304748535 seconds 18: Time to load utils op: 0.3998277187347412 seconds 18: Time to load utils op: 0.40004754066467285 seconds 18: Time to load utils op: 0.3998997211456299 seconds 18: Time to load utils op: 0.40073156356811523 secondsTime to load utils op: 0.4002096652984619 seconds 18: 18: Time to load utils op: 0.4002034664154053 seconds 18: Time to load utils op: 0.4010791778564453 seconds 3: Time to load utils op: 0.41860222816467285 secondsTime to load utils op: 0.41969776153564453 seconds 3: 3: Time to load utils op: 0.4206085205078125 seconds 3: Time to load utils op: 0.419386625289917 seconds 3: Time to load utils op: 0.41987085342407227 secondsTime to load utils op: 0.41861701011657715 seconds 3: 3: Time to load utils op: 0.42008090019226074 seconds 3: Time to load utils op: 0.4195530414581299 seconds 17: Time to load utils op: 0.399935245513916 seconds 17: Time to load utils op: 0.3999476432800293 secondsTime to load utils op: 0.39994287490844727 seconds 17: Time to load utils op: 0.3999607563018799 seconds 17: 17: Time to load utils op: 0.3999814987182617 seconds 17: Time to load utils op: 0.39998483657836914 seconds 17: Time to load utils op: 0.3999800682067871 secondsTime to load utils op: 0.3999781608581543 seconds 17: 28: Time to load utils op: 0.39063239097595215 secondsTime to load utils op: 0.3909730911254883 seconds 28: 28: Time to load utils op: 0.3908543586730957 seconds 28: Time to load utils op: 0.3912782669067383 secondsTime to load utils op: 0.390944242477417 seconds 28: 28: Time to load utils op: 0.39075231552124023 seconds 28: Time to load utils op: 0.3906838893890381 secondsTime to load utils op: 0.39017462730407715 seconds 28: 15: Time to load utils op: 0.4069559574127197 secondsTime to load utils op: 0.40672802925109863 seconds 15: 15: Time to load utils op: 0.40698862075805664 secondsTime to load utils op: 0.4070117473602295 secondsTime to load utils op: 0.40740966796875 seconds 15: 15: Time to load utils op: 0.40732741355895996 seconds 15: 15: Time to load utils op: 0.40686678886413574 seconds 15: Time to load utils op: 0.40705037117004395 seconds 7: Time to load utils op: 0.41312384605407715 secondsTime to load utils op: 0.4131321907043457 seconds 7: 7: Time to load utils op: 0.41315484046936035 secondsTime to load utils op: 0.41313934326171875 seconds 7: Time to load utils op: 0.41314029693603516 seconds 7: 7: Time to load utils op: 0.41315674781799316 secondsTime to load utils op: 0.4131631851196289 seconds 7: 7: Time to load utils op: 0.41316962242126465 seconds 16: Time to load utils op: 0.4056549072265625 secondsTime to load utils op: 0.4056713581085205 seconds 16: 23: Time to load utils op: 0.39775538444519043 seconds 23: Time to load utils op: 0.3977646827697754 seconds 23: Time to load utils op: 0.39867329597473145 seconds 23: Time to load utils op: 0.3988208770751953 seconds 20: Time to load utils op: 0.39711499214172363 seconds 20: Time to load utils op: 0.39713358879089355 seconds 16: Time to load utils op: 0.4057121276855469 seconds 23: Time to load utils op: 0.3978092670440674 secondsTime to load utils op: 0.39792537689208984 seconds 23: 22: Time to load utils op: 0.3997664451599121 seconds 22: Time to load utils op: 0.39910316467285156 seconds 16: Time to load utils op: 0.405684232711792 secondsTime to load utils op: 0.40568041801452637 seconds 16: 23: Time to load utils op: 0.3977079391479492 seconds 20: Time to load utils op: 0.3971414566040039 secondsTime to load utils op: 0.39714741706848145 seconds 20: 20: Time to load utils op: 0.3971526622772217 seconds 16: Time to load utils op: 0.40569472312927246 secondsTime to load utils op: 0.40604567527770996 seconds 16: 23: Time to load utils op: 0.3979463577270508 seconds 22: Time to load utils op: 0.3992037773132324 secondsTime to load utils op: 0.3980696201324463 seconds 22: 20: Time to load utils op: 0.3971579074859619 seconds 20: Time to load utils op: 0.3971676826477051 seconds 16: Time to load utils op: 0.4057285785675049 seconds 20: Time to load utils op: 0.3971874713897705 seconds 22: Time to load utils op: 0.39940619468688965 secondsTime to load utils op: 0.3997838497161865 seconds 22: Time to load utils op: 0.39922094345092773 seconds 22: 12: Time to load utils op: 0.4110245704650879 secondsTime to load utils op: 0.41149401664733887 seconds 12: 22: Time to load utils op: 0.3987388610839844 seconds 12: Time to load utils op: 0.41050004959106445 seconds 12: Time to load utils op: 0.4108703136444092 seconds 12: Time to load utils op: 0.41127538681030273 secondsTime to load utils op: 0.4107480049133301 secondsTime to load utils op: 0.41081833839416504 seconds 12: 12: 12: Time to load utils op: 0.4131021499633789 seconds 26: Time to load utils op: 0.38994646072387695 seconds 26: Time to load utils op: 0.3899664878845215 seconds 4: Time to load utils op: 0.41814756393432617 secondsTime to load utils op: 0.4181380271911621 seconds 4: 26: Time to load utils op: 0.3899705410003662 seconds 26: Time to load utils op: 0.38999295234680176 seconds 4: Time to load utils op: 0.418165922164917 secondsTime to load utils op: 0.41817307472229004 secondsTime to load utils op: 0.41817426681518555 seconds 4: 4: 4: Time to load utils op: 0.4181842803955078 seconds 4: Time to load utils op: 0.4181842803955078 secondsTime to load utils op: 0.4181978702545166 seconds 4: 26: Time to load utils op: 0.39001035690307617 secondsTime to load utils op: 0.3900158405303955 secondsTime to load utils op: 0.3900272846221924 seconds 26: 26: 26: Time to load utils op: 0.39002537727355957 seconds 14: Time to load utils op: 0.4054243564605713 secondsTime to load utils op: 0.40543174743652344 seconds 14: 14: Time to load utils op: 0.4054546356201172 seconds 14: Time to load utils op: 0.40547752380371094 seconds 1: Time to load utils op: 0.4207026958465576 secondsTime to load utils op: 0.42069125175476074 seconds 1: 1: Time to load utils op: 0.4207034111022949 seconds 14: Time to load utils op: 0.40548014640808105 seconds 14: Time to load utils op: 0.40549230575561523 seconds 14: Time to load utils op: 0.40549659729003906 secondsTime to load utils op: 0.4054911136627197 seconds 2: Time to load utils op: 0.4198925495147705 seconds 2: Time to load utils op: 0.4199032783508301 secondsTime to load utils op: 0.4198942184448242 seconds 2: 14: 1: Time to load utils op: 0.42072606086730957 seconds 24: Time to load utils op: 0.39211416244506836 seconds 24: Time to load utils op: 0.39211511611938477 seconds 2: Time to load utils op: 0.419919490814209 seconds 2: Time to load utils op: 0.41993212699890137 seconds 1: Time to load utils op: 0.42073750495910645 secondsTime to load utils op: 0.4207448959350586 secondsTime to load utils op: 0.42073869705200195 seconds 1: 1: 1: Time to load utils op: 0.42074012756347656 seconds 2: Time to load utils op: 0.4199354648590088 seconds 2: Time to load utils op: 0.41995716094970703 secondsTime to load utils op: 0.4199514389038086 seconds 2: 25: Time to load utils op: 0.3921473026275635 seconds 25: Time to load utils op: 0.39214468002319336 seconds 25: Time to load utils op: 0.39216041564941406 seconds 25: Time to load utils op: 0.39216136932373047 seconds 25: Time to load utils op: 0.3921661376953125 seconds 30: Time to load utils op: 0.38922548294067383 secondsTime to load utils op: 0.390134334564209 seconds 30: 24: Time to load utils op: 0.39214539527893066 secondsTime to load utils op: 0.392136812210083 seconds 24: 24: Time to load utils op: 0.39214587211608887 seconds 24: Time to load utils op: 0.39215898513793945 seconds 25: Time to load utils op: 0.3921825885772705 seconds 24: Time to load utils op: 0.39217209815979004 seconds 25: Time to load utils op: 0.39220356941223145 seconds 30: Time to load utils op: 0.39014649391174316 seconds 30: Time to load utils op: 0.3893120288848877 seconds 24: Time to load utils op: 0.39217209815979004 seconds 11: Time to load utils op: 0.40936994552612305 secondsTime to load utils op: 0.4093761444091797 seconds 11: 11: Time to load utils op: 0.4093811511993408 seconds 25: Time to load utils op: 0.3921937942504883 seconds 31: Time to load utils op: 0.38665270805358887 seconds 31: Time to load utils op: 0.388075590133667 seconds 31: Time to load utils op: 0.38885998725891113 seconds 31: Time to load utils op: 0.3885629177093506 seconds 30: Time to load utils op: 0.3908672332763672 secondsTime to load utils op: 0.3901669979095459 seconds 30: 30: Time to load utils op: 0.39074277877807617 seconds 11: Time to load utils op: 0.40941381454467773 seconds 13: Time to load utils op: 0.4111311435699463 secondsTime to load utils op: 0.4123094081878662 seconds 13: 13: Time to load utils op: 0.4107935428619385 seconds 11: Time to load utils op: 0.4094245433807373 secondsTime to load utils op: 0.40941381454467773 seconds 11: 11: Time to load utils op: 0.4094250202178955 seconds 11: Time to load utils op: 0.40943336486816406 seconds 31: Time to load utils op: 0.38793349266052246 seconds 31: Time to load utils op: 0.3887350559234619 seconds 31: Time to load utils op: 0.38825035095214844 seconds 31: Time to load utils op: 0.3894619941711426 seconds 30: Time to load utils op: 0.38956570625305176 seconds 13: Time to load utils op: 0.41162586212158203 secondsTime to load utils op: 0.41144466400146484 seconds 13: 13: Time to load utils op: 0.4110527038574219 seconds 13: Time to load utils op: 0.4116230010986328 seconds 13: Time to load utils op: 0.4112884998321533 seconds 29: Time to load utils op: 0.3909339904785156 seconds 29: Time to load utils op: 0.3925435543060303 seconds 27: Time to load utils op: 0.3937692642211914 secondsTime to load utils op: 0.3928995132446289 seconds 27: 29: Time to load utils op: 0.3921997547149658 seconds 29: Time to load utils op: 0.39118456840515137 seconds 29: Time to load utils op: 0.39139890670776367 seconds 27: Time to load utils op: 0.3935661315917969 seconds 27: Time to load utils op: 0.3937337398529053 seconds 29: Time to load utils op: 0.3909463882446289 seconds 27: Time to load utils op: 0.39374256134033203 seconds 29: Time to load utils op: 0.3923213481903076 secondsTime to load utils op: 0.3904755115509033 seconds 27: Time to load utils op: 0.39377665519714355 seconds 29: 27: Time to load utils op: 0.39376378059387207 seconds 27: Time to load utils op: 0.39227795600891113 seconds 19: Time to load utils op: 0.3999967575073242 seconds 19: Time to load utils op: 0.40001749992370605 seconds 21: Time to load utils op: 0.40155029296875 seconds 21: Time to load utils op: 0.40172553062438965 seconds 10: Time to load utils op: 0.41070103645324707 secondsTime to load utils op: 0.4107072353363037 seconds 10: 10: Time to load utils op: 0.4107182025909424 seconds 19: Time to load utils op: 0.4000422954559326 seconds 19: Time to load utils op: 0.4000535011291504 seconds 21: Time to load utils op: 0.4008345603942871 seconds 10: Time to load utils op: 0.4107511043548584 secondsTime to load utils op: 0.410752534866333 seconds 10: 19: Time to load utils op: 0.40005922317504883 secondsTime to load utils op: 0.40006208419799805 secondsTime to load utils op: 0.40006518363952637 seconds 21: Time to load utils op: 0.40147972106933594 secondsTime to load utils op: 0.4008772373199463 seconds 21: 10: Time to load utils op: 0.41069626808166504 secondsTime to load utils op: 0.41076207160949707 seconds 10: 10: Time to load utils op: 0.4107627868652344 seconds 19: 19: 21: Time to load utils op: 0.40177416801452637 seconds 21: Time to load utils op: 0.4017796516418457 secondsTime to load utils op: 0.40062379837036133 seconds 19: Time to load utils op: 0.4000687599182129 seconds 21: 5: Time to load utils op: 0.4274570941925049 seconds 5: Time to load utils op: 0.42748355865478516 seconds 5: Time to load utils op: 0.4274871349334717 seconds 5: Time to load utils op: 0.42751216888427734 seconds 5: Time to load utils op: 0.42752838134765625 seconds 5: Time to load utils op: 0.4275374412536621 seconds 5: Time to load utils op: 0.4271574020385742 seconds 5: Time to load utils op: 0.42754697799682617 seconds 0: Time to load utils op: 0.0005612373352050781 seconds 0: Time to load utils op: 0.0006220340728759766 seconds 0: Time to load utils op: 0.0006031990051269531 seconds 0: Time to load utils op: 0.0005645751953125 seconds 0: Time to load utils op: 0.0006518363952636719 secondsTime to load utils op: 0.0006365776062011719 seconds 0: 0: Time to load utils op: 0.0005102157592773438 seconds 6: Time to load utils op: 0.0011091232299804688 seconds 6: Time to load utils op: 0.0014526844024658203 seconds 6: Time to load utils op: 0.0014629364013671875 seconds 6: Time to load utils op: 0.0014195442199707031 secondsTime to load utils op: 0.0014333724975585938 seconds 6: 6: Time to load utils op: 0.0013844966888427734 seconds 6: Time to load utils op: 0.0014355182647705078 seconds 6: Time to load utils op: 0.0014791488647460938 seconds 0: [2023-05-10 10:10:44,981] [INFO] [utils.py:827:see_memory_usage] before initializing group 0 0: [2023-05-10 10:10:44,982] [INFO] [utils.py:828:see_memory_usage] MA 5.25 GB Max_MA 5.25 GB CA 5.32 GB Max_CA 5 GB 0: [2023-05-10 10:10:44,982] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 41.18 GB, percent = 8.2% 3: Time to load utils op: 0.0007796287536621094 seconds 3: Time to load utils op: 0.0009713172912597656 seconds 8: Time to load utils op: 0.0010085105895996094 seconds 8: Time to load utils op: 0.0009377002716064453 seconds 3: Time to load utils op: 0.0011587142944335938 seconds 3: Time to load utils op: 0.0010738372802734375 seconds 3: Time to load utils op: 0.0011644363403320312 secondsTime to load utils op: 0.001127481460571289 secondsTime to load utils op: 0.0011227130889892578 seconds 3: 3: 8: Time to load utils op: 0.00118255615234375 seconds 3: Time to load utils op: 0.0012295246124267578 seconds 8: Time to load utils op: 0.0013246536254882812 seconds 8: Time to load utils op: 0.0013952255249023438 seconds 8: Time to load utils op: 0.0013835430145263672 secondsTime to load utils op: 0.0013492107391357422 seconds 8: 8: Time to load utils op: 0.001432657241821289 seconds 9: Time to load utils op: 0.0009443759918212891 seconds 9: Time to load utils op: 0.0013761520385742188 seconds 9: Time to load utils op: 0.0013284683227539062 secondsTime to load utils op: 0.001386880874633789 seconds 9: 9: Time to load utils op: 0.0013072490692138672 seconds 9: Time to load utils op: 0.0013394355773925781 seconds 9: Time to load utils op: 0.0013682842254638672 seconds 9: Time to load utils op: 0.0013730525970458984 seconds 28: Time to load utils op: 0.0009286403656005859 seconds 28: Time to load utils op: 0.001348257064819336 secondsTime to load utils op: 0.0013194084167480469 seconds 28: 28: Time to load utils op: 0.001249551773071289 seconds 28: Time to load utils op: 0.0012331008911132812 secondsTime to load utils op: 0.001214742660522461 seconds 28: 28: Time to load utils op: 0.0012924671173095703 seconds 28: Time to load utils op: 0.0013027191162109375 seconds 18: Time to load utils op: 0.0008363723754882812 seconds 18: Time to load utils op: 0.0008635520935058594 seconds 18: Time to load utils op: 0.0009567737579345703 seconds 18: Time to load utils op: 0.0008771419525146484 seconds 18: Time to load utils op: 0.0008764266967773438 seconds 18: Time to load utils op: 0.0008916854858398438 seconds 18: Time to load utils op: 0.0009164810180664062 seconds 18: Time to load utils op: 0.0009386539459228516 seconds 16: Time to load utils op: 0.0009245872497558594 seconds 16: Time to load utils op: 0.0012750625610351562 seconds 16: Time to load utils op: 0.0013120174407958984 seconds 16: Time to load utils op: 0.001165151596069336 seconds 16: Time to load utils op: 0.0012009143829345703 secondsTime to load utils op: 0.001184701919555664 seconds 16: Time to load utils op: 0.0012354850769042969 seconds 16: 21: Time to load utils op: 0.0009050369262695312 seconds 16: Time to load utils op: 0.0012216567993164062 seconds 21: Time to load utils op: 0.001096963882446289 seconds 29: Time to load utils op: 0.0005102157592773438 seconds 21: Time to load utils op: 0.0013866424560546875 seconds 21: Time to load utils op: 0.001318216323852539 seconds 29: Time to load utils op: 0.000518798828125 seconds 21: Time to load utils op: 0.0012950897216796875 seconds 21: Time to load utils op: 0.0012311935424804688 seconds 21: Time to load utils op: 0.001241922378540039 seconds 29: Time to load utils op: 0.00052642822265625 seconds 29: Time to load utils op: 0.0004630088806152344 secondsTime to load utils op: 0.0005178451538085938 secondsTime to load utils op: 0.0005006790161132812 seconds 29: Time to load utils op: 0.0005016326904296875 seconds 29: 29: Time to load utils op: 0.0005202293395996094 seconds 29: 21: Time to load utils op: 0.00135040283203125 seconds 15: Time to load utils op: 0.0008647441864013672 seconds 15: Time to load utils op: 0.0010209083557128906 seconds 15: Time to load utils op: 0.0012331008911132812 seconds 15: Time to load utils op: 0.0011508464813232422 seconds 15: Time to load utils op: 0.0011568069458007812 seconds 15: Time to load utils op: 0.0011701583862304688 seconds 15: Time to load utils op: 0.001155853271484375 seconds 15: Time to load utils op: 0.0009996891021728516 seconds 12: Time to load utils op: 0.001031637191772461 seconds 12: Time to load utils op: 0.0008749961853027344 seconds 12: Time to load utils op: 0.0010991096496582031 seconds 12: Time to load utils op: 0.0012068748474121094 seconds 12: Time to load utils op: 0.0009617805480957031 secondsTime to load utils op: 0.001050710678100586 seconds 12: 12: Time to load utils op: 0.0011265277862548828 seconds 12: Time to load utils op: 0.0011744499206542969 seconds 20: Time to load utils op: 0.0010268688201904297 seconds 20: Time to load utils op: 0.0009062290191650391 seconds 1: Time to load utils op: 0.0007932186126708984 seconds 1: Time to load utils op: 0.0007078647613525391 seconds 5: Time to load utils op: 0.0009198188781738281 seconds 20: Time to load utils op: 0.0014421939849853516 seconds 20: Time to load utils op: 0.0013270378112792969 seconds 4: Time to load utils op: 0.0008275508880615234 seconds 20: Time to load utils op: 0.001369476318359375 seconds 4: Time to load utils op: 0.0007486343383789062 seconds 20: Time to load utils op: 0.0013346672058105469 secondsTime to load utils op: 0.0013928413391113281 seconds 20: 20: Time to load utils op: 0.001516103744506836 seconds 2: Time to load utils op: 0.0006949901580810547 seconds 1: Time to load utils op: 0.001322031021118164 seconds 1: Time to load utils op: 0.001375436782836914 secondsTime to load utils op: 0.0012614727020263672 seconds 4: Time to load utils op: 0.0009722709655761719 seconds 1: 1: Time to load utils op: 0.0012431144714355469 seconds 2: Time to load utils op: 0.0008916854858398438 seconds 1: Time to load utils op: 0.001238107681274414 seconds 5: Time to load utils op: 0.0013322830200195312 secondsTime to load utils op: 0.0013456344604492188 secondsTime to load utils op: 0.0013606548309326172 seconds 5: 5: 5: Time to load utils op: 0.0013136863708496094 secondsTime to load utils op: 0.0013127326965332031 seconds 5: 1: Time to load utils op: 0.0013320446014404297 seconds 5: Time to load utils op: 0.0013031959533691406 seconds 5: Time to load utils op: 0.0014157295227050781 seconds 4: Time to load utils op: 0.0010936260223388672 seconds 2: Time to load utils op: 0.0011260509490966797 seconds 4: Time to load utils op: 0.0011553764343261719 seconds 4: Time to load utils op: 0.001149892807006836 seconds 4: Time to load utils op: 0.0009889602661132812 seconds 4: Time to load utils op: 0.0011823177337646484 seconds 2: Time to load utils op: 0.001165151596069336 seconds 2: Time to load utils op: 0.001294851303100586 seconds 2: Time to load utils op: 0.0012621879577636719 seconds 2: Time to load utils op: 0.0012578964233398438 seconds 2: Time to load utils op: 0.001352071762084961 seconds 7: Time to load utils op: 0.0009844303131103516 seconds 19: Time to load utils op: 0.0007746219635009766 secondsTime to load utils op: 0.0008437633514404297 seconds 19: 19: Time to load utils op: 0.0009062290191650391 seconds 7: Time to load utils op: 0.0014209747314453125 seconds 7: Time to load utils op: 0.001260995864868164 secondsTime to load utils op: 0.0013432502746582031 seconds 7: 7: Time to load utils op: 0.0014407634735107422 seconds 7: Time to load utils op: 0.0013363361358642578 seconds 7: Time to load utils op: 0.0014574527740478516 seconds 19: Time to load utils op: 0.0011756420135498047 secondsTime to load utils op: 0.0010843276977539062 seconds 19: 19: Time to load utils op: 0.001180887222290039 seconds 19: Time to load utils op: 0.0011601448059082031 seconds 7: Time to load utils op: 0.0013833045959472656 seconds 19: Time to load utils op: 0.0011997222900390625 seconds 0: [2023-05-10 10:10:45,106] [INFO] [utils.py:827:see_memory_usage] after initializing group 0 0: [2023-05-10 10:10:45,107] [INFO] [utils.py:828:see_memory_usage] MA 10.64 GB Max_MA 10.64 GB CA 13.39 GB Max_CA 13 GB 0: [2023-05-10 10:10:45,107] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 41.18 GB, percent = 8.2% 17: Time to load utils op: 0.0018610954284667969 seconds 17: Time to load utils op: 0.0017621517181396484 seconds 17: Time to load utils op: 0.0019092559814453125 seconds 17: Time to load utils op: 0.002070188522338867 seconds 17: Time to load utils op: 0.0022156238555908203 seconds 17: Time to load utils op: 0.0021965503692626953 seconds 17: Time to load utils op: 0.0021545886993408203 seconds 17: Time to load utils op: 0.0022089481353759766 seconds 27: Time to load utils op: 0.000698089599609375 seconds 27: Time to load utils op: 0.0011186599731445312 seconds 27: Time to load utils op: 0.0011601448059082031 seconds 13: Time to load utils op: 0.0010981559753417969 seconds 27: Time to load utils op: 0.0013151168823242188 seconds 27: Time to load utils op: 0.0012581348419189453 seconds 27: Time to load utils op: 0.0012559890747070312 seconds 27: Time to load utils op: 0.0012540817260742188 seconds 27: Time to load utils op: 0.001338958740234375 seconds 13: Time to load utils op: 0.001474142074584961 secondsTime to load utils op: 0.0014505386352539062 secondsTime to load utils op: 0.0014095306396484375 seconds 13: 13: 13: Time to load utils op: 0.001445770263671875 seconds 13: Time to load utils op: 0.001451253890991211 secondsTime to load utils op: 0.0014297962188720703 seconds 13: 13: Time to load utils op: 0.0014374256134033203 seconds 23: Time to load utils op: 0.000701904296875 seconds 23: Time to load utils op: 0.0009419918060302734 seconds 23: Time to load utils op: 0.0009608268737792969 seconds 23: Time to load utils op: 0.001184225082397461 seconds 23: Time to load utils op: 0.0010678768157958984 seconds 23: Time to load utils op: 0.0011360645294189453 secondsTime to load utils op: 0.001123666763305664 seconds 23: 23: Time to load utils op: 0.0011610984802246094 seconds 22: Time to load utils op: 0.0005130767822265625 seconds 22: Time to load utils op: 0.0005164146423339844 seconds 22: Time to load utils op: 0.0005609989166259766 seconds 22: Time to load utils op: 0.0006434917449951172 seconds 22: Time to load utils op: 0.0005812644958496094 secondsTime to load utils op: 0.0005807876586914062 secondsTime to load utils op: 0.000621795654296875 seconds 22: 22: 22: Time to load utils op: 0.0006008148193359375 seconds 31: Time to load utils op: 0.0005486011505126953 seconds 31: Time to load utils op: 0.0005979537963867188 secondsTime to load utils op: 0.0006206035614013672 seconds 31: 31: Time to load utils op: 0.0006582736968994141 seconds 31: Time to load utils op: 0.0006232261657714844 seconds 31: Time to load utils op: 0.0005939006805419922 seconds 31: Time to load utils op: 0.0006356239318847656 seconds 31: Time to load utils op: 0.0006084442138671875 seconds 10: Time to load utils op: 0.0010616779327392578 seconds 11: Time to load utils op: 0.0014655590057373047 seconds 10: Time to load utils op: 0.0012235641479492188 seconds 26: Time to load utils op: 0.0012655258178710938 seconds 30: Time to load utils op: 0.0012128353118896484 seconds 14: Time to load utils op: 0.0014472007751464844 seconds 14: Time to load utils op: 0.0014646053314208984 seconds 26: Time to load utils op: 0.0013988018035888672 seconds 26: Time to load utils op: 0.0014061927795410156 seconds 26: Time to load utils op: 0.0014195442199707031 seconds 26: Time to load utils op: 0.0014281272888183594 seconds 30: Time to load utils op: 0.0014653205871582031 seconds 26: Time to load utils op: 0.0014066696166992188 seconds 26: Time to load utils op: 0.0014600753784179688 seconds 26: Time to load utils op: 0.0014634132385253906 seconds 25: Time to load utils op: 0.0018553733825683594 seconds 25: Time to load utils op: 0.0017879009246826172 seconds 10: Time to load utils op: 0.001708984375 seconds 11: Time to load utils op: 0.0022144317626953125 seconds 30: Time to load utils op: 0.0014982223510742188 secondsTime to load utils op: 0.0015332698822021484 seconds 30: 30: Time to load utils op: 0.0015091896057128906 seconds 30: Time to load utils op: 0.0014691352844238281 seconds 30: Time to load utils op: 0.0015878677368164062 seconds 25: Time to load utils op: 0.00189971923828125 seconds 10: Time to load utils op: 0.00191497802734375 secondsTime to load utils op: 0.0018894672393798828 seconds 10: 10: Time to load utils op: 0.0018754005432128906 seconds 10: Time to load utils op: 0.0018243789672851562 seconds 24: Time to load utils op: 0.001996755599975586 seconds 30: Time to load utils op: 0.0015933513641357422 seconds 24: Time to load utils op: 0.002025127410888672 seconds 11: Time to load utils op: 0.002228260040283203 seconds 10: Time to load utils op: 0.0019388198852539062 seconds 25: Time to load utils op: 0.0020139217376708984 seconds 14: Time to load utils op: 0.0020704269409179688 seconds 14: Time to load utils op: 0.001958608627319336 secondsTime to load utils op: 0.002068758010864258 secondsTime to load utils op: 0.0020651817321777344 seconds 14: Time to load utils op: 0.0019898414611816406 seconds 14: 14: 14: Time to load utils op: 0.0020821094512939453 seconds 11: Time to load utils op: 0.002269744873046875 seconds 11: Time to load utils op: 0.0023827552795410156 secondsTime to load utils op: 0.0022916793823242188 secondsTime to load utils op: 0.0023126602172851562 seconds 11: 11: 11: Time to load utils op: 0.002267122268676758 seconds 25: Time to load utils op: 0.0020406246185302734 secondsTime to load utils op: 0.002025604248046875 secondsTime to load utils op: 0.002009868621826172 seconds 25: 25: 24: Time to load utils op: 0.0022735595703125 seconds 24: Time to load utils op: 0.002346515655517578 seconds 25: Time to load utils op: 0.0020668506622314453 seconds 24: Time to load utils op: 0.002244234085083008 seconds 24: Time to load utils op: 0.0023827552795410156 seconds 24: Time to load utils op: 0.0023195743560791016 seconds 24: Time to load utils op: 0.0023651123046875 seconds 0: [2023-05-10 10:10:45,214] [INFO] [utils.py:827:see_memory_usage] before initializing group 1 0: [2023-05-10 10:10:45,214] [INFO] [utils.py:828:see_memory_usage] MA 10.64 GB Max_MA 10.64 GB CA 13.39 GB Max_CA 13 GB 0: [2023-05-10 10:10:45,215] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 41.18 GB, percent = 8.2% 0: [2023-05-10 10:10:45,322] [INFO] [utils.py:827:see_memory_usage] after initializing group 1 0: [2023-05-10 10:10:45,323] [INFO] [utils.py:828:see_memory_usage] MA 15.73 GB Max_MA 15.73 GB CA 21.01 GB Max_CA 21 GB 0: [2023-05-10 10:10:45,323] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 41.18 GB, percent = 8.2% 0: [2023-05-10 10:10:45,427] [INFO] [utils.py:827:see_memory_usage] before initializing group 2 0: [2023-05-10 10:10:45,427] [INFO] [utils.py:828:see_memory_usage] MA 15.73 GB Max_MA 15.73 GB CA 21.01 GB Max_CA 21 GB 0: [2023-05-10 10:10:45,427] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 41.18 GB, percent = 8.2% 0: [2023-05-10 10:10:45,537] [INFO] [utils.py:827:see_memory_usage] after initializing group 2 0: [2023-05-10 10:10:45,538] [INFO] [utils.py:828:see_memory_usage] MA 15.74 GB Max_MA 15.74 GB CA 21.01 GB Max_CA 21 GB 0: [2023-05-10 10:10:45,538] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 41.18 GB, percent = 8.2% 0: [2023-05-10 10:10:45,640] [INFO] [utils.py:827:see_memory_usage] before initialize_optimizer 0: [2023-05-10 10:10:45,641] [INFO] [utils.py:828:see_memory_usage] MA 15.74 GB Max_MA 15.74 GB CA 21.01 GB Max_CA 21 GB 0: [2023-05-10 10:10:45,641] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 41.18 GB, percent = 8.2% 0: [2023-05-10 10:10:45,747] [INFO] [utils.py:827:see_memory_usage] end initialize_optimizer 0: [2023-05-10 10:10:45,748] [INFO] [utils.py:828:see_memory_usage] MA 15.82 GB Max_MA 15.82 GB CA 21.01 GB Max_CA 21 GB 0: [2023-05-10 10:10:45,748] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 41.18 GB, percent = 8.2% 0: [2023-05-10 10:10:45,850] [INFO] [utils.py:827:see_memory_usage] end bf16_optimizer 0: [2023-05-10 10:10:45,850] [INFO] [utils.py:828:see_memory_usage] MA 15.82 GB Max_MA 15.82 GB CA 21.01 GB Max_CA 21 GB 0: [2023-05-10 10:10:45,851] [INFO] [utils.py:836:see_memory_usage] CPU Virtual Memory: used = 41.18 GB, percent = 8.2% 0: [2023-05-10 10:10:45,851] [INFO] [logging.py:68:log_dist] [Rank 0] DeepSpeed Final Optimizer = FusedAdam 0: [2023-05-10 10:10:45,851] [INFO] [logging.py:68:log_dist] [Rank 0] DeepSpeed using client LR scheduler 0: [2023-05-10 10:10:45,851] [INFO] [logging.py:68:log_dist] [Rank 0] DeepSpeed LR Scheduler = 0: [2023-05-10 10:10:45,851] [INFO] [logging.py:68:log_dist] [Rank 0] step=0, skipped=0, lr=[0.0002, 0.0002, 0.0002], mom=[(0.9, 0.999), (0.9, 0.999), (0.9, 0.999)] 0: [2023-05-10 10:10:45,852] [INFO] [config.py:1007:print] DeepSpeedEngine configuration: 0: [2023-05-10 10:10:45,852] [INFO] [config.py:1011:print] activation_checkpointing_config { 0: "partition_activations": false, 0: "contiguous_memory_optimization": false, 0: "cpu_checkpointing": false, 0: "number_checkpoints": null, 0: "synchronize_checkpoint_boundary": false, 0: "profile": false 0: } 0: [2023-05-10 10:10:45,852] [INFO] [config.py:1011:print] aio_config ................... {'block_size': 1048576, 'queue_depth': 8, 'thread_count': 1, 'single_submit': False, 'overlap_events': True} 0: [2023-05-10 10:10:45,852] [INFO] [config.py:1011:print] amp_enabled .................. False 0: [2023-05-10 10:10:45,852] [INFO] [config.py:1011:print] amp_params ................... False 0: [2023-05-10 10:10:45,852] [INFO] [config.py:1011:print] autotuning_config ............ { 0: "enabled": false, 0: "start_step": null, 0: "end_step": null, 0: "metric_path": null, 0: "arg_mappings": null, 0: "metric": "throughput", 0: "model_info": null, 0: "results_dir": "/pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/autotuning_results", 0: "exps_dir": "/pfs/lustrep4/scratch/project_462000119/muennighoff/nov-2022-bettercom/autotuning_exps", 0: "overwrite": true, 0: "fast": true, 0: "start_profile_step": 3, 0: "end_profile_step": 5, 0: "tuner_type": "gridsearch", 0: "tuner_early_stopping": 5, 0: "tuner_num_trials": 50, 0: "model_info_path": null, 0: "mp_size": 1, 0: "max_train_batch_size": null, 0: "min_train_batch_size": 1, 0: "max_train_micro_batch_size_per_gpu": 1.024000e+03, 0: "min_train_micro_batch_size_per_gpu": 1, 0: "num_tuning_micro_batch_sizes": 3 0: } 0: [2023-05-10 10:10:45,852] [INFO] [config.py:1011:print] bfloat16_enabled ............. True 0: [2023-05-10 10:10:45,852] [INFO] [config.py:1011:print] checkpoint_parallel_write_pipeline False 0: [2023-05-10 10:10:45,852] [INFO] [config.py:1011:print] checkpoint_tag_validation_enabled True 0: [2023-05-10 10:10:45,852] [INFO] [config.py:1011:print] checkpoint_tag_validation_fail False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] comms_config ................. 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] communication_data_type ...... None 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] compression_config ........... {'weight_quantization': {'shared_parameters': {'enabled': False, 'quantizer_kernel': False, 'schedule_offset': 0, 'quantize_groups': 1, 'quantize_verbose': False, 'quantization_type': 'symmetric', 'quantize_weight_in_forward': False, 'rounding': 'nearest', 'fp16_mixed_quantize': False, 'quantize_change_ratio': 0.001}, 'different_groups': {}}, 'activation_quantization': {'shared_parameters': {'enabled': False, 'quantization_type': 'symmetric', 'range_calibration': 'dynamic', 'schedule_offset': 1000}, 'different_groups': {}}, 'sparse_pruning': {'shared_parameters': {'enabled': False, 'method': 'l1', 'schedule_offset': 1000}, 'different_groups': {}}, 'row_pruning': {'shared_parameters': {'enabled': False, 'method': 'l1', 'schedule_offset': 1000}, 'different_groups': {}}, 'head_pruning': {'shared_parameters': {'enabled': False, 'method': 'topk', 'schedule_offset': 1000}, 'different_groups': {}}, 'channel_pruning': {'shared_pa 0: rameters': {'enabled': False, 'method': 'l1', 'schedule_offset': 1000}, 'different_groups': {}}, 'layer_reduction': {'enabled': False}} 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] curriculum_enabled ........... False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] curriculum_params ............ False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] dataloader_drop_last ......... False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] disable_allgather ............ False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] dump_state ................... False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] dynamic_loss_scale_args ...... None 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] eigenvalue_enabled ........... False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] eigenvalue_gas_boundary_resolution 1 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] eigenvalue_layer_name ........ bert.encoder.layer 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] eigenvalue_layer_num ......... 0 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] eigenvalue_max_iter .......... 100 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] eigenvalue_stability ......... 1e-06 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] eigenvalue_tol ............... 0.01 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] eigenvalue_verbose ........... False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] elasticity_enabled ........... False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] flops_profiler_config ........ { 0: "enabled": false, 0: "profile_step": 1, 0: "module_depth": -1, 0: "top_modules": 1, 0: "detailed": true, 0: "output_file": null 0: } 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] fp16_auto_cast ............... None 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] fp16_enabled ................. False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] fp16_master_weights_and_gradients False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] global_rank .................. 0 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] gradient_accumulation_steps .. 1 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] gradient_clipping ............ 1.0 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] gradient_predivide_factor .... 1.0 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] initial_dynamic_scale ........ 1 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] load_universal_checkpoint .... False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] loss_scale ................... 1.0 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] memory_breakdown ............. False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] monitor_config ............... 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] nebula_config ................ { 0: "enabled": false, 0: "persistent_storage_path": null, 0: "persistent_time_interval": 100, 0: "num_of_version_in_retention": 2, 0: "enable_nebula_load": true, 0: "load_path": null 0: } 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] optimizer_legacy_fusion ...... False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] optimizer_name ............... None 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] optimizer_params ............. None 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] pipeline ..................... {'stages': 'auto', 'partition': 'best', 'seed_layers': False, 'activation_checkpoint_interval': 0} 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] pld_enabled .................. False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] pld_params ................... False 0: [2023-05-10 10:10:45,853] [INFO] [config.py:1011:print] prescale_gradients ........... False 0: [2023-05-10 10:10:45,854] [INFO] [config.py:1011:print] scheduler_name ............... None 0: [2023-05-10 10:10:45,854] [INFO] [config.py:1011:print] scheduler_params ............. None 0: [2023-05-10 10:10:45,854] [INFO] [config.py:1011:print] sparse_attention ............. None 0: [2023-05-10 10:10:45,854] [INFO] [config.py:1011:print] sparse_gradients_enabled ..... False 0: [2023-05-10 10:10:45,854] [INFO] [config.py:1011:print] steps_per_print .............. 2000 0: [2023-05-10 10:10:45,854] [INFO] [config.py:1011:print] train_batch_size ............. 512 0: [2023-05-10 10:10:45,854] [INFO] [config.py:1011:print] train_micro_batch_size_per_gpu 2 0: [2023-05-10 10:10:45,854] [INFO] [config.py:1011:print] use_node_local_storage ....... False 0: [2023-05-10 10:10:45,854] [INFO] [config.py:1011:print] wall_clock_breakdown ......... False 0: [2023-05-10 10:10:45,854] [INFO] [config.py:1011:print] world_size ................... 256 0: [2023-05-10 10:10:45,854] [INFO] [config.py:1011:print] zero_allow_untested_optimizer False 0: [2023-05-10 10:10:45,854] [INFO] [config.py:1011:print] zero_config .................. stage=0 contiguous_gradients=True reduce_scatter=True reduce_bucket_size=500000000 allgather_partitions=True allgather_bucket_size=500000000 overlap_comm=False load_from_fp32_weights=True elastic_checkpoint=False offload_param=None offload_optimizer=None sub_group_size=1000000000 cpu_offload_param=None cpu_offload_use_pin_memory=None cpu_offload=None prefetch_bucket_size=50000000 param_persistence_threshold=100000 model_persistence_threshold=9223372036854775807 max_live_parameters=1000000000 max_reuse_distance=1000000000 gather_16bit_weights_on_model_save=False stage3_gather_fp16_weights_on_model_save=False ignore_unused_parameters=True legacy_stage1=False round_robin_gradients=False 0: [2023-05-10 10:10:45,854] [INFO] [config.py:1011:print] zero_enabled ................. False 0: [2023-05-10 10:10:45,854] [INFO] [config.py:1011:print] zero_optimization_stage ...... 0 0: [2023-05-10 10:10:45,854] [INFO] [config.py:996:print_user_config] json = { 0: "train_micro_batch_size_per_gpu": 2, 0: "train_batch_size": 512, 0: "gradient_clipping": 1.0, 0: "zero_optimization": { 0: "stage": 0 0: }, 0: "bf16": { 0: "enabled": true 0: }, 0: "steps_per_print": 2.000000e+03, 0: "wall_clock_breakdown": false 0: } 0: Time to load utils op: 0.0004394054412841797 seconds 0: [2023-05-10 10:10:45,854] [INFO] [engine.py:87:__init__] CONFIG: micro_batches=1 micro_batch_size=2 0: [2023-05-10 10:10:45,927] [INFO] [engine.py:145:__init__] RANK=0 STAGE=0 LAYERS=41 [0, 41) STAGE_PARAMS=2809026560 (2809.027M) TOTAL_PARAMS=2809026560 (2809.027M) UNIQUE_PARAMS=2809026560 (2809.027M) 16: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 26: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 26: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 26: [2023-05-10 10:10:45,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 26: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 26: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 26: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 26: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 22: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 22: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 22: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 22: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 22: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 22: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 26: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 20: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 4: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 4: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 5: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 7: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 15: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:45,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 5: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 5: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 7: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 5: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 7: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 4: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 20: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 20: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 20: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 0: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 5: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 5: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 7: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 20: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 20: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 5: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 0: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 0: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 0: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 0: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 4: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 0: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 0: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 4: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 4: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 4: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 0: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 22: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 20: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:45,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 5: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 7: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 13: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 25: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 25: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 25: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 25: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 25: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 25: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 25: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 13: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 13: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 13: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 13: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 7: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 7: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 4: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 13: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 20: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 13: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 29: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 29: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 29: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 29: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 7: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 29: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 29: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 29: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 1: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 1: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 1: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 1: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 1: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 25: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 29: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:45,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 15: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 1: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 13: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 22: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 1: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 1: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 15: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 15: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 15: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 15: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 15: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 15: [2023-05-10 10:10:45,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:45,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 19: [2023-05-10 10:10:45,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 16: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 16: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 16: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 16: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 16: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 16: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 4: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 4: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 19: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 16: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 26: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 19: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 19: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 26: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 19: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 19: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 26: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 19: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 4: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 19: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 4: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 4: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 26: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 19: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 19: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 26: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 26: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 19: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 4: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 19: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 19: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 26: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 4: [2023-05-10 10:10:45,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 19: [2023-05-10 10:10:45,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 26: [2023-05-10 10:10:45,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 19: [2023-05-10 10:10:45,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 26: [2023-05-10 10:10:45,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 26: [2023-05-10 10:10:45,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 4: [2023-05-10 10:10:45,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 19: [2023-05-10 10:10:45,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 26: [2023-05-10 10:10:45,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 19: [2023-05-10 10:10:45,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 9: [2023-05-10 10:10:46,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 9: [2023-05-10 10:10:46,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 28: [2023-05-10 10:10:46,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 28: [2023-05-10 10:10:46,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 28: [2023-05-10 10:10:46,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 28: [2023-05-10 10:10:46,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 28: [2023-05-10 10:10:46,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 28: [2023-05-10 10:10:46,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 28: [2023-05-10 10:10:46,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:46,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:46,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:46,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:46,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:46,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:46,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:46,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 28: [2023-05-10 10:10:46,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 28: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 28: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 28: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 28: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 28: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 28: [2023-05-10 10:10:46,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 28: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 20: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 20: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 27: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 27: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 27: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 27: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 27: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 27: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 27: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:46,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 27: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 23: [2023-05-10 10:10:46,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 12: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 27: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 24: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 27: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 12: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 8: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 23: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 27: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 11: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 27: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 12: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 12: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 12: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 12: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 14: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 27: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 12: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 14: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 27: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 14: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 27: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 14: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 14: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 13: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 24: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 21: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 21: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 21: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 13: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 13: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 13: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 21: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 21: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 21: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 31: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 31: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 13: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 14: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 21: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 31: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 31: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 31: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 24: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 11: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 31: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 21: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 13: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 24: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 14: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 21: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 24: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 21: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 11: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 11: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 24: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 11: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 11: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 8: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 3: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 11: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 24: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 8: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 11: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 3: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 11: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 24: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 11: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 24: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 3: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 0: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 0: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 0: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 0: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 24: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 0: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 0: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 3: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 22: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 0: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 0: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 0: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 0: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 11: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 0: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 0: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 0: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 0: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 8: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 8: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 11: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 24: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 3: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 8: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 8: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 11: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 13: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 0: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 11: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 11: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 11: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 21: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 21: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 3: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 3: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 31: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 31: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 22: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 31: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 3: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 17: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 17: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 5: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 5: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 5: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 5: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 5: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 17: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 17: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 17: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 31: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 17: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 5: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 7: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 31: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 5: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 5: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 5: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 21: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 31: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 17: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 5: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 5: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 5: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 17: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 31: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 13: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 13: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 17: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 21: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 30: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 13: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 13: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 21: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 21: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 21: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 21: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 22: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 13: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 21: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 21: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 13: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 21: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 13: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 13: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 7: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 7: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 30: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 25: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 7: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 17: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 7: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 5: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 7: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 7: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 7: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 7: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 15: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 25: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 7: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 17: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 7: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 29: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 7: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 7: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 7: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 5: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 5: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 7: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 25: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 25: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 5: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 7: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 5: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 7: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 25: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 17: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 17: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 0: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 5: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 17: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 5: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 1: [2023-05-10 10:10:46,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,077] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,077] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:46,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 2: [2023-05-10 10:10:46,080] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,080] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,080] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,080] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,080] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,080] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,080] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,080] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 2: [2023-05-10 10:10:46,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 10: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 18: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 18: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 18: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 10: [2023-05-10 10:10:46,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 18: [2023-05-10 10:10:46,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 10: [2023-05-10 10:10:46,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 18: [2023-05-10 10:10:46,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 10: [2023-05-10 10:10:46,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 10: [2023-05-10 10:10:46,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:46,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt... 6: [2023-05-10 10:10:46,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 6: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/mp_rank_00_model_states.pt. 6: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 6: [2023-05-10 10:10:46,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 6: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 6: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 6: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 6: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 6: [2023-05-10 10:10:46,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 23: [2023-05-10 10:10:46,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 23: [2023-05-10 10:10:46,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 23: [2023-05-10 10:10:46,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 23: [2023-05-10 10:10:46,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 23: [2023-05-10 10:10:46,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 23: [2023-05-10 10:10:46,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 23: [2023-05-10 10:10:46,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 23: [2023-05-10 10:10:46,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 24: [2023-05-10 10:10:46,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,673] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 28: [2023-05-10 10:10:46,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 24: [2023-05-10 10:10:46,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 28: [2023-05-10 10:10:46,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 28: [2023-05-10 10:10:46,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 28: [2023-05-10 10:10:46,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 28: [2023-05-10 10:10:46,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 28: [2023-05-10 10:10:46,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 24: [2023-05-10 10:10:46,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 24: [2023-05-10 10:10:46,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 24: [2023-05-10 10:10:46,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 11: [2023-05-10 10:10:46,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 11: [2023-05-10 10:10:46,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 24: [2023-05-10 10:10:46,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 24: [2023-05-10 10:10:46,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 24: [2023-05-10 10:10:46,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 8: [2023-05-10 10:10:46,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 8: [2023-05-10 10:10:46,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 8: [2023-05-10 10:10:46,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 8: [2023-05-10 10:10:46,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 8: [2023-05-10 10:10:46,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 8: [2023-05-10 10:10:46,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 8: [2023-05-10 10:10:46,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,697] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 23: [2023-05-10 10:10:46,697] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 28: [2023-05-10 10:10:46,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 11: [2023-05-10 10:10:46,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 11: [2023-05-10 10:10:46,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 11: [2023-05-10 10:10:46,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 11: [2023-05-10 10:10:46,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 11: [2023-05-10 10:10:46,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 11: [2023-05-10 10:10:46,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,715] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,715] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,716] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 23: [2023-05-10 10:10:46,716] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 23: [2023-05-10 10:10:46,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 23: [2023-05-10 10:10:46,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 23: [2023-05-10 10:10:46,723] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 14: [2023-05-10 10:10:46,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 14: [2023-05-10 10:10:46,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 14: [2023-05-10 10:10:46,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 14: [2023-05-10 10:10:46,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 14: [2023-05-10 10:10:46,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 14: [2023-05-10 10:10:46,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 14: [2023-05-10 10:10:46,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 14: [2023-05-10 10:10:46,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,726] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,726] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 30: [2023-05-10 10:10:46,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,730] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,732] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:46,732] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 20: [2023-05-10 10:10:46,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 14: [2023-05-10 10:10:46,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 14: [2023-05-10 10:10:46,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 14: [2023-05-10 10:10:46,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 14: [2023-05-10 10:10:46,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 14: [2023-05-10 10:10:46,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 14: [2023-05-10 10:10:46,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 14: [2023-05-10 10:10:46,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 14: [2023-05-10 10:10:46,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:46,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:46,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:46,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:46,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 23: [2023-05-10 10:10:46,763] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 24: [2023-05-10 10:10:46,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,770] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,770] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 28: [2023-05-10 10:10:46,770] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 28: [2023-05-10 10:10:46,770] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,783] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 29: [2023-05-10 10:10:46,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 27: [2023-05-10 10:10:46,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 8: [2023-05-10 10:10:46,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 12: [2023-05-10 10:10:46,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 28: [2023-05-10 10:10:46,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 29: [2023-05-10 10:10:46,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,802] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 11: [2023-05-10 10:10:46,804] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,804] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 28: [2023-05-10 10:10:46,804] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,804] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,804] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 11: [2023-05-10 10:10:46,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 24: [2023-05-10 10:10:46,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 28: [2023-05-10 10:10:46,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 12: [2023-05-10 10:10:46,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 12: [2023-05-10 10:10:46,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 29: [2023-05-10 10:10:46,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 12: [2023-05-10 10:10:46,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,811] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,811] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 14: [2023-05-10 10:10:46,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 14: [2023-05-10 10:10:46,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 8: [2023-05-10 10:10:46,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 8: [2023-05-10 10:10:46,822] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:46,822] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 19: [2023-05-10 10:10:46,822] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 14: [2023-05-10 10:10:46,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 8: [2023-05-10 10:10:46,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 24: [2023-05-10 10:10:46,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:46,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 24: [2023-05-10 10:10:46,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 24: [2023-05-10 10:10:46,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 8: [2023-05-10 10:10:46,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 24: [2023-05-10 10:10:46,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:46,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 8: [2023-05-10 10:10:46,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 29: [2023-05-10 10:10:46,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 8: [2023-05-10 10:10:46,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 29: [2023-05-10 10:10:46,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:46,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 8: [2023-05-10 10:10:46,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 8: [2023-05-10 10:10:46,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:46,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 27: [2023-05-10 10:10:46,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:46,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 24: [2023-05-10 10:10:46,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:46,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 24: [2023-05-10 10:10:46,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 11: [2023-05-10 10:10:46,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 27: [2023-05-10 10:10:46,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 27: [2023-05-10 10:10:46,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:46,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:46,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 28: [2023-05-10 10:10:46,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 27: [2023-05-10 10:10:46,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:46,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 14: [2023-05-10 10:10:46,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 14: [2023-05-10 10:10:46,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 20: [2023-05-10 10:10:46,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 29: [2023-05-10 10:10:46,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 20: [2023-05-10 10:10:46,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 28: [2023-05-10 10:10:46,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 28: [2023-05-10 10:10:46,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 28: [2023-05-10 10:10:46,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 29: [2023-05-10 10:10:46,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 11: [2023-05-10 10:10:46,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 29: [2023-05-10 10:10:46,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 4: [2023-05-10 10:10:46,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 11: [2023-05-10 10:10:46,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 4: [2023-05-10 10:10:46,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 28: [2023-05-10 10:10:46,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 11: [2023-05-10 10:10:46,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 16: [2023-05-10 10:10:46,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:46,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 27: [2023-05-10 10:10:46,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 16: [2023-05-10 10:10:46,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:46,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:46,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:46,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:46,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:46,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 14: [2023-05-10 10:10:46,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:46,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 11: [2023-05-10 10:10:46,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 4: [2023-05-10 10:10:46,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 4: [2023-05-10 10:10:46,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 4: [2023-05-10 10:10:46,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 4: [2023-05-10 10:10:46,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 14: [2023-05-10 10:10:46,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:46,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 16: [2023-05-10 10:10:46,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:46,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:46,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:46,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:46,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:46,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 4: [2023-05-10 10:10:46,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 28: [2023-05-10 10:10:46,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 4: [2023-05-10 10:10:46,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:46,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:46,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 25: [2023-05-10 10:10:46,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 25: [2023-05-10 10:10:46,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:46,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 16: [2023-05-10 10:10:46,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 16: [2023-05-10 10:10:46,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 16: [2023-05-10 10:10:46,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 16: [2023-05-10 10:10:46,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 16: [2023-05-10 10:10:46,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:46,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 7: [2023-05-10 10:10:46,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 7: [2023-05-10 10:10:46,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:46,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 25: [2023-05-10 10:10:46,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 25: [2023-05-10 10:10:46,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:46,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:46,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:46,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 25: [2023-05-10 10:10:46,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 25: [2023-05-10 10:10:46,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 25: [2023-05-10 10:10:46,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:46,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 25: [2023-05-10 10:10:46,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:46,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:46,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:46,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 14: [2023-05-10 10:10:46,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 25: [2023-05-10 10:10:46,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 25: [2023-05-10 10:10:46,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:46,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:46,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 20: [2023-05-10 10:10:46,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:46,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:46,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:46,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 7: [2023-05-10 10:10:46,879] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 7: [2023-05-10 10:10:46,879] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,879] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 15: [2023-05-10 10:10:46,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 15: [2023-05-10 10:10:46,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 15: [2023-05-10 10:10:46,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 15: [2023-05-10 10:10:46,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 15: [2023-05-10 10:10:46,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 7: [2023-05-10 10:10:46,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 7: [2023-05-10 10:10:46,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 7: [2023-05-10 10:10:46,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 7: [2023-05-10 10:10:46,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 7: [2023-05-10 10:10:46,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 7: [2023-05-10 10:10:46,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 14: [2023-05-10 10:10:46,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 25: [2023-05-10 10:10:46,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 25: [2023-05-10 10:10:46,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 25: [2023-05-10 10:10:46,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 25: [2023-05-10 10:10:46,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 25: [2023-05-10 10:10:46,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 25: [2023-05-10 10:10:46,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 9: [2023-05-10 10:10:46,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 7: [2023-05-10 10:10:46,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 7: [2023-05-10 10:10:46,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,889] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:46,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 7: [2023-05-10 10:10:46,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:46,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 20: [2023-05-10 10:10:46,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:46,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:46,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:46,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 30: [2023-05-10 10:10:46,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:46,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 21: [2023-05-10 10:10:46,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 30: [2023-05-10 10:10:46,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:46,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 21: [2023-05-10 10:10:46,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 21: [2023-05-10 10:10:46,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 21: [2023-05-10 10:10:46,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:46,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 20: [2023-05-10 10:10:46,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:46,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 30: [2023-05-10 10:10:46,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 30: [2023-05-10 10:10:46,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:46,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 1: [2023-05-10 10:10:46,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 1: [2023-05-10 10:10:46,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 1: [2023-05-10 10:10:46,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 1: [2023-05-10 10:10:46,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 1: [2023-05-10 10:10:46,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 1: [2023-05-10 10:10:46,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 1: [2023-05-10 10:10:46,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:46,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:46,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:46,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:46,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:46,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 5: [2023-05-10 10:10:46,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 5: [2023-05-10 10:10:46,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:46,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 5: [2023-05-10 10:10:46,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:46,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:46,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:46,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:46,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 0: [2023-05-10 10:10:46,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 0: [2023-05-10 10:10:46,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 0: [2023-05-10 10:10:46,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 0: [2023-05-10 10:10:46,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 0: [2023-05-10 10:10:46,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 0: [2023-05-10 10:10:46,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:46,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:46,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:46,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 0: [2023-05-10 10:10:46,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:46,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:46,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 5: [2023-05-10 10:10:46,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 5: [2023-05-10 10:10:46,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 5: [2023-05-10 10:10:46,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 5: [2023-05-10 10:10:46,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 5: [2023-05-10 10:10:46,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:46,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:46,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:46,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 10: [2023-05-10 10:10:46,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 10: [2023-05-10 10:10:46,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 10: [2023-05-10 10:10:46,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 10: [2023-05-10 10:10:46,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 10: [2023-05-10 10:10:46,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 10: [2023-05-10 10:10:46,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 10: [2023-05-10 10:10:46,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 21: [2023-05-10 10:10:46,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:46,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 21: [2023-05-10 10:10:46,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:46,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:46,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:46,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:46,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:46,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:46,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 6: [2023-05-10 10:10:46,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 12: [2023-05-10 10:10:46,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:46,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 30: [2023-05-10 10:10:46,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:46,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 3: [2023-05-10 10:10:46,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 21: [2023-05-10 10:10:46,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 31: [2023-05-10 10:10:46,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:46,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 21: [2023-05-10 10:10:46,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 21: [2023-05-10 10:10:46,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 21: [2023-05-10 10:10:46,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 3: [2023-05-10 10:10:46,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 3: [2023-05-10 10:10:46,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 3: [2023-05-10 10:10:46,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 3: [2023-05-10 10:10:46,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 3: [2023-05-10 10:10:46,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 6: [2023-05-10 10:10:46,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 6: [2023-05-10 10:10:46,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 3: [2023-05-10 10:10:46,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 12: [2023-05-10 10:10:46,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 31: [2023-05-10 10:10:46,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:46,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 6: [2023-05-10 10:10:46,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 6: [2023-05-10 10:10:46,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 0: [2023-05-10 10:10:46,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 6: [2023-05-10 10:10:46,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 1: [2023-05-10 10:10:46,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 6: [2023-05-10 10:10:46,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 3: [2023-05-10 10:10:46,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 5: [2023-05-10 10:10:46,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 10: [2023-05-10 10:10:46,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 12: [2023-05-10 10:10:46,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:46,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 2: [2023-05-10 10:10:46,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 19: [2023-05-10 10:10:46,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:46,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:46,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:46,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 17: [2023-05-10 10:10:46,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:46,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:46,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:46,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:46,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 17: [2023-05-10 10:10:46,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 17: [2023-05-10 10:10:46,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:46,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:46,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:46,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 17: [2023-05-10 10:10:46,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 17: [2023-05-10 10:10:46,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 17: [2023-05-10 10:10:46,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 17: [2023-05-10 10:10:46,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 17: [2023-05-10 10:10:46,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 13: [2023-05-10 10:10:46,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 13: [2023-05-10 10:10:46,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 13: [2023-05-10 10:10:46,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 13: [2023-05-10 10:10:46,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 13: [2023-05-10 10:10:46,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 13: [2023-05-10 10:10:46,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 13: [2023-05-10 10:10:46,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 13: [2023-05-10 10:10:46,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:46,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 26: [2023-05-10 10:10:46,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 26: [2023-05-10 10:10:46,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 26: [2023-05-10 10:10:46,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 26: [2023-05-10 10:10:46,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 26: [2023-05-10 10:10:46,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 26: [2023-05-10 10:10:46,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 26: [2023-05-10 10:10:46,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:46,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 18: [2023-05-10 10:10:46,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:46,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 22: [2023-05-10 10:10:46,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:46,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 19: [2023-05-10 10:10:46,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:46,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 4: [2023-05-10 10:10:46,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:46,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 18: [2023-05-10 10:10:46,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 18: [2023-05-10 10:10:46,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 12: [2023-05-10 10:10:46,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 18: [2023-05-10 10:10:46,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 26: [2023-05-10 10:10:46,959] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 26: [2023-05-10 10:10:46,959] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 4: [2023-05-10 10:10:46,960] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 9: [2023-05-10 10:10:46,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:46,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 17: [2023-05-10 10:10:46,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 15: [2023-05-10 10:10:46,963] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 25: [2023-05-10 10:10:46,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 25: [2023-05-10 10:10:46,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 13: [2023-05-10 10:10:46,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 13: [2023-05-10 10:10:46,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 13: [2023-05-10 10:10:46,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 17: [2023-05-10 10:10:46,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 16: [2023-05-10 10:10:46,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:46,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 13: [2023-05-10 10:10:46,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 13: [2023-05-10 10:10:46,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 26: [2023-05-10 10:10:46,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 13: [2023-05-10 10:10:46,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 13: [2023-05-10 10:10:46,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 7: [2023-05-10 10:10:46,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 7: [2023-05-10 10:10:46,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 26: [2023-05-10 10:10:46,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 26: [2023-05-10 10:10:46,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 26: [2023-05-10 10:10:46,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 12: [2023-05-10 10:10:46,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:46,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 16: [2023-05-10 10:10:46,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 12: [2023-05-10 10:10:46,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:46,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt... 19: [2023-05-10 10:10:46,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:46,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 12: [2023-05-10 10:10:46,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 12: [2023-05-10 10:10:46,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 25: [2023-05-10 10:10:46,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 12: [2023-05-10 10:10:46,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 16: [2023-05-10 10:10:46,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:46,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:46,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:46,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:46,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:46,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:46,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:46,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:46,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:46,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 25: [2023-05-10 10:10:46,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 25: [2023-05-10 10:10:46,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 25: [2023-05-10 10:10:46,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 25: [2023-05-10 10:10:46,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 25: [2023-05-10 10:10:46,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:46,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:46,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 9: [2023-05-10 10:10:46,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:46,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:46,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:46,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:46,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 19: [2023-05-10 10:10:46,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:46,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 7: [2023-05-10 10:10:46,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:46,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 7: [2023-05-10 10:10:46,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:46,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:46,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:46,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 7: [2023-05-10 10:10:46,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:46,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 4: [2023-05-10 10:10:46,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:46,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:46,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 15: [2023-05-10 10:10:46,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 4: [2023-05-10 10:10:46,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 4: [2023-05-10 10:10:46,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 4: [2023-05-10 10:10:46,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:46,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 21: [2023-05-10 10:10:46,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:46,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:46,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:46,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 1: [2023-05-10 10:10:46,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 1: [2023-05-10 10:10:46,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:46,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:46,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:46,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 4: [2023-05-10 10:10:47,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 5: [2023-05-10 10:10:47,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 25: [2023-05-10 10:10:47,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 25: [2023-05-10 10:10:47,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 3: [2023-05-10 10:10:47,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 21: [2023-05-10 10:10:47,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 5: [2023-05-10 10:10:47,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 5: [2023-05-10 10:10:47,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:47,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 0: [2023-05-10 10:10:47,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 10: [2023-05-10 10:10:47,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 10: [2023-05-10 10:10:47,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:47,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 0: [2023-05-10 10:10:47,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:47,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:47,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 15: [2023-05-10 10:10:47,009] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 15: [2023-05-10 10:10:47,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 15: [2023-05-10 10:10:47,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 15: [2023-05-10 10:10:47,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 10: [2023-05-10 10:10:47,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 15: [2023-05-10 10:10:47,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 25: [2023-05-10 10:10:47,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 25: [2023-05-10 10:10:47,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 9: [2023-05-10 10:10:47,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:47,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 25: [2023-05-10 10:10:47,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 9: [2023-05-10 10:10:47,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 25: [2023-05-10 10:10:47,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 25: [2023-05-10 10:10:47,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:47,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 9: [2023-05-10 10:10:47,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 25: [2023-05-10 10:10:47,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 9: [2023-05-10 10:10:47,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 16: [2023-05-10 10:10:47,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 1: [2023-05-10 10:10:47,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 1: [2023-05-10 10:10:47,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:47,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:47,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:47,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:47,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:47,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 5: [2023-05-10 10:10:47,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 5: [2023-05-10 10:10:47,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 5: [2023-05-10 10:10:47,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:47,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 21: [2023-05-10 10:10:47,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 21: [2023-05-10 10:10:47,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 9: [2023-05-10 10:10:47,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 16: [2023-05-10 10:10:47,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 16: [2023-05-10 10:10:47,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 10: [2023-05-10 10:10:47,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 1: [2023-05-10 10:10:47,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 21: [2023-05-10 10:10:47,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:47,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 16: [2023-05-10 10:10:47,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:47,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:47,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:47,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 15: [2023-05-10 10:10:47,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 10: [2023-05-10 10:10:47,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 10: [2023-05-10 10:10:47,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 10: [2023-05-10 10:10:47,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:47,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 7: [2023-05-10 10:10:47,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:47,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:47,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:47,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:47,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 3: [2023-05-10 10:10:47,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:47,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:47,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:47,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 3: [2023-05-10 10:10:47,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 0: [2023-05-10 10:10:47,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 0: [2023-05-10 10:10:47,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 0: [2023-05-10 10:10:47,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 0: [2023-05-10 10:10:47,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 5: [2023-05-10 10:10:47,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 5: [2023-05-10 10:10:47,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 17: [2023-05-10 10:10:47,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:47,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 22: [2023-05-10 10:10:47,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 3: [2023-05-10 10:10:47,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 2: [2023-05-10 10:10:47,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:47,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:47,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 0: [2023-05-10 10:10:47,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 5: [2023-05-10 10:10:47,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 5: [2023-05-10 10:10:47,038] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 5: [2023-05-10 10:10:47,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 26: [2023-05-10 10:10:47,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:47,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:47,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:47,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:47,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 1: [2023-05-10 10:10:47,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 2: [2023-05-10 10:10:47,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:47,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:47,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 15: [2023-05-10 10:10:47,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 18: [2023-05-10 10:10:47,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:47,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:47,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 5: [2023-05-10 10:10:47,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:47,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 18: [2023-05-10 10:10:47,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 15: [2023-05-10 10:10:47,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 3: [2023-05-10 10:10:47,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 3: [2023-05-10 10:10:47,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 3: [2023-05-10 10:10:47,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:47,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 3: [2023-05-10 10:10:47,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:47,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 5: [2023-05-10 10:10:47,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,056] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,058] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 5: [2023-05-10 10:10:47,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 3: [2023-05-10 10:10:47,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 5: [2023-05-10 10:10:47,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 5: [2023-05-10 10:10:47,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:47,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:47,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 2: [2023-05-10 10:10:47,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 2: [2023-05-10 10:10:47,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 6: [2023-05-10 10:10:47,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,073] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:47,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,074] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 26: [2023-05-10 10:10:47,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 17: [2023-05-10 10:10:47,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:47,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 18: [2023-05-10 10:10:47,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 18: [2023-05-10 10:10:47,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,081] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:47,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:47,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 17: [2023-05-10 10:10:47,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 17: [2023-05-10 10:10:47,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 17: [2023-05-10 10:10:47,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 17: [2023-05-10 10:10:47,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 13: [2023-05-10 10:10:47,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 13: [2023-05-10 10:10:47,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 2: [2023-05-10 10:10:47,084] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 2: [2023-05-10 10:10:47,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 2: [2023-05-10 10:10:47,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 18: [2023-05-10 10:10:47,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 18: [2023-05-10 10:10:47,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 26: [2023-05-10 10:10:47,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 18: [2023-05-10 10:10:47,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 18: [2023-05-10 10:10:47,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 26: [2023-05-10 10:10:47,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:47,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 13: [2023-05-10 10:10:47,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 13: [2023-05-10 10:10:47,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 13: [2023-05-10 10:10:47,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 13: [2023-05-10 10:10:47,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_01-model_00-model_states.pt. 26: [2023-05-10 10:10:47,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,108] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:47,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:47,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:47,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:47,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:47,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:47,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:47,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:47,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:47,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 29: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 29: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 29: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 29: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 29: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 29: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 29: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 29: [2023-05-10 10:10:47,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:47,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 29: [2023-05-10 10:10:47,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:47,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 29: [2023-05-10 10:10:47,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 8: [2023-05-10 10:10:47,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 8: [2023-05-10 10:10:47,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 8: [2023-05-10 10:10:47,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 8: [2023-05-10 10:10:47,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 8: [2023-05-10 10:10:47,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 8: [2023-05-10 10:10:47,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 8: [2023-05-10 10:10:47,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 8: [2023-05-10 10:10:47,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:47,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:47,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:47,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:47,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:47,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 29: [2023-05-10 10:10:47,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 29: [2023-05-10 10:10:47,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 29: [2023-05-10 10:10:47,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 29: [2023-05-10 10:10:47,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 29: [2023-05-10 10:10:47,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 29: [2023-05-10 10:10:47,510] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 29: [2023-05-10 10:10:47,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 29: [2023-05-10 10:10:47,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,560] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 8: [2023-05-10 10:10:47,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 29: [2023-05-10 10:10:47,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 29: [2023-05-10 10:10:47,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:47,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:47,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 29: [2023-05-10 10:10:47,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 29: [2023-05-10 10:10:47,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 29: [2023-05-10 10:10:47,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 29: [2023-05-10 10:10:47,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:47,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:47,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:47,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:47,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:47,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:47,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:47,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:47,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:47,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:47,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:47,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:47,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:47,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:47,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 20: [2023-05-10 10:10:47,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 11: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 19: [2023-05-10 10:10:47,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 24: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 19: [2023-05-10 10:10:47,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 19: [2023-05-10 10:10:47,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 19: [2023-05-10 10:10:47,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 19: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 19: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 19: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 28: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 1: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 1: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 1: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 1: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 1: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 1: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 1: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 12: [2023-05-10 10:10:47,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 22: [2023-05-10 10:10:47,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:47,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 22: [2023-05-10 10:10:47,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 22: [2023-05-10 10:10:47,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 22: [2023-05-10 10:10:47,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 22: [2023-05-10 10:10:47,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 22: [2023-05-10 10:10:47,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:47,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 31: [2023-05-10 10:10:47,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 22: [2023-05-10 10:10:47,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 27: [2023-05-10 10:10:47,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:47,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 31: [2023-05-10 10:10:47,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 31: [2023-05-10 10:10:47,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 31: [2023-05-10 10:10:47,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:47,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 28: [2023-05-10 10:10:47,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:47,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 27: [2023-05-10 10:10:47,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 11: [2023-05-10 10:10:47,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:47,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 11: [2023-05-10 10:10:47,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 24: [2023-05-10 10:10:47,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 27: [2023-05-10 10:10:47,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 24: [2023-05-10 10:10:47,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:47,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 12: [2023-05-10 10:10:47,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 27: [2023-05-10 10:10:47,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 27: [2023-05-10 10:10:47,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:47,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:47,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:47,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:47,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:47,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:47,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 24: [2023-05-10 10:10:47,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 24: [2023-05-10 10:10:47,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:47,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 24: [2023-05-10 10:10:47,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 24: [2023-05-10 10:10:47,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:47,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 24: [2023-05-10 10:10:47,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 24: [2023-05-10 10:10:47,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:47,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:47,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 11: [2023-05-10 10:10:47,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 28: [2023-05-10 10:10:47,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:47,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:47,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 11: [2023-05-10 10:10:47,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:47,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 11: [2023-05-10 10:10:47,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:47,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 11: [2023-05-10 10:10:47,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 28: [2023-05-10 10:10:47,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:47,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 28: [2023-05-10 10:10:47,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 12: [2023-05-10 10:10:47,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 12: [2023-05-10 10:10:47,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 28: [2023-05-10 10:10:47,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:47,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 11: [2023-05-10 10:10:47,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 28: [2023-05-10 10:10:47,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 11: [2023-05-10 10:10:47,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:47,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:47,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 28: [2023-05-10 10:10:47,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:47,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 12: [2023-05-10 10:10:47,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:47,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:47,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 12: [2023-05-10 10:10:47,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 12: [2023-05-10 10:10:47,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 12: [2023-05-10 10:10:47,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 12: [2023-05-10 10:10:47,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 0: [2023-05-10 10:10:47,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 0: [2023-05-10 10:10:47,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 0: [2023-05-10 10:10:47,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 0: [2023-05-10 10:10:47,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 0: [2023-05-10 10:10:47,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 0: [2023-05-10 10:10:47,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 0: [2023-05-10 10:10:47,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 6: [2023-05-10 10:10:47,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 6: [2023-05-10 10:10:47,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 6: [2023-05-10 10:10:47,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 6: [2023-05-10 10:10:47,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 6: [2023-05-10 10:10:47,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 6: [2023-05-10 10:10:47,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 6: [2023-05-10 10:10:47,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 6: [2023-05-10 10:10:47,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 0: [2023-05-10 10:10:47,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 0: [2023-05-10 10:10:47,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,673] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 21: [2023-05-10 10:10:47,673] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 21: [2023-05-10 10:10:47,673] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 21: [2023-05-10 10:10:47,673] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 21: [2023-05-10 10:10:47,673] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 21: [2023-05-10 10:10:47,673] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 21: [2023-05-10 10:10:47,673] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 21: [2023-05-10 10:10:47,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 6: [2023-05-10 10:10:47,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 30: [2023-05-10 10:10:47,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 30: [2023-05-10 10:10:47,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 30: [2023-05-10 10:10:47,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 30: [2023-05-10 10:10:47,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 30: [2023-05-10 10:10:47,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 30: [2023-05-10 10:10:47,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 30: [2023-05-10 10:10:47,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 30: [2023-05-10 10:10:47,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 20: [2023-05-10 10:10:47,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,697] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,699] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,704] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 31: [2023-05-10 10:10:47,704] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 18: [2023-05-10 10:10:47,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 18: [2023-05-10 10:10:47,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 31: [2023-05-10 10:10:47,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:47,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 19: [2023-05-10 10:10:47,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 19: [2023-05-10 10:10:47,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:47,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 18: [2023-05-10 10:10:47,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 1: [2023-05-10 10:10:47,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 1: [2023-05-10 10:10:47,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 1: [2023-05-10 10:10:47,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:47,715] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 19: [2023-05-10 10:10:47,715] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,716] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 1: [2023-05-10 10:10:47,716] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,716] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,716] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 19: [2023-05-10 10:10:47,718] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 19: [2023-05-10 10:10:47,718] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 19: [2023-05-10 10:10:47,718] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 1: [2023-05-10 10:10:47,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 1: [2023-05-10 10:10:47,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 1: [2023-05-10 10:10:47,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 1: [2023-05-10 10:10:47,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 6: [2023-05-10 10:10:47,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 6: [2023-05-10 10:10:47,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 31: [2023-05-10 10:10:47,723] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 31: [2023-05-10 10:10:47,727] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:47,727] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:47,727] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:47,727] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 27: [2023-05-10 10:10:47,727] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 20: [2023-05-10 10:10:47,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 25: [2023-05-10 10:10:47,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 25: [2023-05-10 10:10:47,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 0: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 0: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 0: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 11: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 11: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 25: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 22: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 22: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 25: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 25: [2023-05-10 10:10:47,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 25: [2023-05-10 10:10:47,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 27: [2023-05-10 10:10:47,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 24: [2023-05-10 10:10:47,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:47,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 31: [2023-05-10 10:10:47,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 31: [2023-05-10 10:10:47,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 31: [2023-05-10 10:10:47,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 31: [2023-05-10 10:10:47,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 31: [2023-05-10 10:10:47,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 22: [2023-05-10 10:10:47,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 22: [2023-05-10 10:10:47,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 22: [2023-05-10 10:10:47,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 19: [2023-05-10 10:10:47,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:47,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,732] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:47,732] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:47,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:47,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 21: [2023-05-10 10:10:47,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 25: [2023-05-10 10:10:47,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 25: [2023-05-10 10:10:47,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 25: [2023-05-10 10:10:47,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 22: [2023-05-10 10:10:47,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 14: [2023-05-10 10:10:47,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:47,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 25: [2023-05-10 10:10:47,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 25: [2023-05-10 10:10:47,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 1: [2023-05-10 10:10:47,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:47,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 25: [2023-05-10 10:10:47,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 14: [2023-05-10 10:10:47,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:47,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 0: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 0: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 0: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 0: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,739] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,739] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,739] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,739] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,739] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 1: [2023-05-10 10:10:47,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 6: [2023-05-10 10:10:47,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 6: [2023-05-10 10:10:47,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:47,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:47,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:47,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:47,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:47,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 21: [2023-05-10 10:10:47,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 21: [2023-05-10 10:10:47,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 21: [2023-05-10 10:10:47,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:47,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:47,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 1: [2023-05-10 10:10:47,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 10: [2023-05-10 10:10:47,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 6: [2023-05-10 10:10:47,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 20: [2023-05-10 10:10:47,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 10: [2023-05-10 10:10:47,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:47,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 7: [2023-05-10 10:10:47,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 27: [2023-05-10 10:10:47,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 10: [2023-05-10 10:10:47,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 19: [2023-05-10 10:10:47,749] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 6: [2023-05-10 10:10:47,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 6: [2023-05-10 10:10:47,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 6: [2023-05-10 10:10:47,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 6: [2023-05-10 10:10:47,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,751] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,751] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 19: [2023-05-10 10:10:47,752] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:47,753] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:47,753] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:47,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:47,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 1: [2023-05-10 10:10:47,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 12: [2023-05-10 10:10:47,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 1: [2023-05-10 10:10:47,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:47,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:47,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 1: [2023-05-10 10:10:47,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 1: [2023-05-10 10:10:47,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 14: [2023-05-10 10:10:47,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 27: [2023-05-10 10:10:47,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 11: [2023-05-10 10:10:47,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 14: [2023-05-10 10:10:47,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 11: [2023-05-10 10:10:47,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:47,758] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 4: [2023-05-10 10:10:47,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 12: [2023-05-10 10:10:47,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 24: [2023-05-10 10:10:47,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:47,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:47,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 4: [2023-05-10 10:10:47,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 24: [2023-05-10 10:10:47,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:47,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:47,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 4: [2023-05-10 10:10:47,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 4: [2023-05-10 10:10:47,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 14: [2023-05-10 10:10:47,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 27: [2023-05-10 10:10:47,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:47,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 22: [2023-05-10 10:10:47,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 14: [2023-05-10 10:10:47,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 14: [2023-05-10 10:10:47,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 24: [2023-05-10 10:10:47,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 27: [2023-05-10 10:10:47,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 27: [2023-05-10 10:10:47,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 27: [2023-05-10 10:10:47,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 18: [2023-05-10 10:10:47,761] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,761] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,761] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,761] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 24: [2023-05-10 10:10:47,763] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:47,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:47,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:47,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 24: [2023-05-10 10:10:47,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 24: [2023-05-10 10:10:47,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 11: [2023-05-10 10:10:47,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:47,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 4: [2023-05-10 10:10:47,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 20: [2023-05-10 10:10:47,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 13: [2023-05-10 10:10:47,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 20: [2023-05-10 10:10:47,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:47,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 20: [2023-05-10 10:10:47,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:47,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:47,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 4: [2023-05-10 10:10:47,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 20: [2023-05-10 10:10:47,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:47,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:47,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 4: [2023-05-10 10:10:47,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 18: [2023-05-10 10:10:47,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 28: [2023-05-10 10:10:47,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 11: [2023-05-10 10:10:47,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 11: [2023-05-10 10:10:47,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:47,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 30: [2023-05-10 10:10:47,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 21: [2023-05-10 10:10:47,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 11: [2023-05-10 10:10:47,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:47,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:47,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:47,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 6: [2023-05-10 10:10:47,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 6: [2023-05-10 10:10:47,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:47,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:47,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:47,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:47,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:47,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:47,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:47,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:47,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:47,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:47,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:47,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:47,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:47,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 12: [2023-05-10 10:10:47,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 12: [2023-05-10 10:10:47,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:47,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 13: [2023-05-10 10:10:47,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 30: [2023-05-10 10:10:47,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 30: [2023-05-10 10:10:47,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 12: [2023-05-10 10:10:47,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 12: [2023-05-10 10:10:47,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 13: [2023-05-10 10:10:47,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 12: [2023-05-10 10:10:47,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 13: [2023-05-10 10:10:47,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,779] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 17: [2023-05-10 10:10:47,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 17: [2023-05-10 10:10:47,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 17: [2023-05-10 10:10:47,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 17: [2023-05-10 10:10:47,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:47,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,782] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 6: [2023-05-10 10:10:47,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 6: [2023-05-10 10:10:47,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 17: [2023-05-10 10:10:47,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 6: [2023-05-10 10:10:47,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:47,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:47,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:47,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:47,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:47,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 18: [2023-05-10 10:10:47,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 18: [2023-05-10 10:10:47,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 17: [2023-05-10 10:10:47,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 25: [2023-05-10 10:10:47,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 25: [2023-05-10 10:10:47,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 25: [2023-05-10 10:10:47,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 17: [2023-05-10 10:10:47,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 18: [2023-05-10 10:10:47,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 17: [2023-05-10 10:10:47,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 18: [2023-05-10 10:10:47,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:47,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 25: [2023-05-10 10:10:47,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 25: [2023-05-10 10:10:47,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 25: [2023-05-10 10:10:47,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 25: [2023-05-10 10:10:47,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,800] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 18: [2023-05-10 10:10:47,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:47,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 18: [2023-05-10 10:10:47,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 18: [2023-05-10 10:10:47,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 30: [2023-05-10 10:10:47,804] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 10: [2023-05-10 10:10:47,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 30: [2023-05-10 10:10:47,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 30: [2023-05-10 10:10:47,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:47,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 30: [2023-05-10 10:10:47,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 30: [2023-05-10 10:10:47,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:47,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 10: [2023-05-10 10:10:47,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 16: [2023-05-10 10:10:47,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 25: [2023-05-10 10:10:47,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:47,822] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:47,822] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:47,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 4: [2023-05-10 10:10:47,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 4: [2023-05-10 10:10:47,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 16: [2023-05-10 10:10:47,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 16: [2023-05-10 10:10:47,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 16: [2023-05-10 10:10:47,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 4: [2023-05-10 10:10:47,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 4: [2023-05-10 10:10:47,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 4: [2023-05-10 10:10:47,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 4: [2023-05-10 10:10:47,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 4: [2023-05-10 10:10:47,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 16: [2023-05-10 10:10:47,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:47,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:47,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 10: [2023-05-10 10:10:47,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 10: [2023-05-10 10:10:47,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 10: [2023-05-10 10:10:47,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:47,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:47,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:47,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:47,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:47,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:47,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 17: [2023-05-10 10:10:47,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 25: [2023-05-10 10:10:47,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 2: [2023-05-10 10:10:47,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 2: [2023-05-10 10:10:47,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 2: [2023-05-10 10:10:47,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 2: [2023-05-10 10:10:47,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 2: [2023-05-10 10:10:47,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 2: [2023-05-10 10:10:47,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 2: [2023-05-10 10:10:47,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 2: [2023-05-10 10:10:47,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 9: [2023-05-10 10:10:47,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 9: [2023-05-10 10:10:47,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:47,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:47,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 9: [2023-05-10 10:10:47,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 9: [2023-05-10 10:10:47,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 9: [2023-05-10 10:10:47,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 9: [2023-05-10 10:10:47,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 9: [2023-05-10 10:10:47,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 7: [2023-05-10 10:10:47,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 5: [2023-05-10 10:10:47,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 15: [2023-05-10 10:10:47,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 15: [2023-05-10 10:10:47,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 15: [2023-05-10 10:10:47,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 15: [2023-05-10 10:10:47,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 15: [2023-05-10 10:10:47,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 15: [2023-05-10 10:10:47,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 15: [2023-05-10 10:10:47,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 15: [2023-05-10 10:10:47,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 2: [2023-05-10 10:10:47,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 2: [2023-05-10 10:10:47,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 26: [2023-05-10 10:10:47,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 15: [2023-05-10 10:10:47,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 9: [2023-05-10 10:10:47,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 9: [2023-05-10 10:10:47,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 9: [2023-05-10 10:10:47,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 10: [2023-05-10 10:10:47,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:47,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:47,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 5: [2023-05-10 10:10:47,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 5: [2023-05-10 10:10:47,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 9: [2023-05-10 10:10:47,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 10: [2023-05-10 10:10:47,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 5: [2023-05-10 10:10:47,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 9: [2023-05-10 10:10:47,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 2: [2023-05-10 10:10:47,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 2: [2023-05-10 10:10:47,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 9: [2023-05-10 10:10:47,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 26: [2023-05-10 10:10:47,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 26: [2023-05-10 10:10:47,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 26: [2023-05-10 10:10:47,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 26: [2023-05-10 10:10:47,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 2: [2023-05-10 10:10:47,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 5: [2023-05-10 10:10:47,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:47,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 23: [2023-05-10 10:10:47,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 23: [2023-05-10 10:10:47,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:47,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:47,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 23: [2023-05-10 10:10:47,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 23: [2023-05-10 10:10:47,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 23: [2023-05-10 10:10:47,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:47,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:47,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 26: [2023-05-10 10:10:47,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 2: [2023-05-10 10:10:47,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 5: [2023-05-10 10:10:47,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 2: [2023-05-10 10:10:47,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:47,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:47,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 5: [2023-05-10 10:10:47,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:47,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:47,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:47,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:47,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:47,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:47,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 4: [2023-05-10 10:10:47,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 13: [2023-05-10 10:10:47,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:47,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:47,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:47,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:47,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:47,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 23: [2023-05-10 10:10:47,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:47,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 4: [2023-05-10 10:10:47,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:47,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:47,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:47,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 26: [2023-05-10 10:10:47,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 17: [2023-05-10 10:10:47,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 4: [2023-05-10 10:10:47,862] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 13: [2023-05-10 10:10:47,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 17: [2023-05-10 10:10:47,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 17: [2023-05-10 10:10:47,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 17: [2023-05-10 10:10:47,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 17: [2023-05-10 10:10:47,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 17: [2023-05-10 10:10:47,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 3: [2023-05-10 10:10:47,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 3: [2023-05-10 10:10:47,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 3: [2023-05-10 10:10:47,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 3: [2023-05-10 10:10:47,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 3: [2023-05-10 10:10:47,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 3: [2023-05-10 10:10:47,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 3: [2023-05-10 10:10:47,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 3: [2023-05-10 10:10:47,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:47,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 3: [2023-05-10 10:10:47,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 13: [2023-05-10 10:10:47,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 13: [2023-05-10 10:10:47,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 17: [2023-05-10 10:10:47,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:47,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 17: [2023-05-10 10:10:47,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:47,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 3: [2023-05-10 10:10:47,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 3: [2023-05-10 10:10:47,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 3: [2023-05-10 10:10:47,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 2: [2023-05-10 10:10:47,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 2: [2023-05-10 10:10:47,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 3: [2023-05-10 10:10:47,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt... 15: [2023-05-10 10:10:47,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:47,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 26: [2023-05-10 10:10:47,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 26: [2023-05-10 10:10:47,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 9: [2023-05-10 10:10:47,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 9: [2023-05-10 10:10:47,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 9: [2023-05-10 10:10:47,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 17: [2023-05-10 10:10:47,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:47,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:47,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:47,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 9: [2023-05-10 10:10:47,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 9: [2023-05-10 10:10:47,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 9: [2023-05-10 10:10:47,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 13: [2023-05-10 10:10:47,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 13: [2023-05-10 10:10:47,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 17: [2023-05-10 10:10:47,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 17: [2023-05-10 10:10:47,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 23: [2023-05-10 10:10:47,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 23: [2023-05-10 10:10:47,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:47,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 13: [2023-05-10 10:10:47,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 17: [2023-05-10 10:10:47,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 17: [2023-05-10 10:10:47,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 5: [2023-05-10 10:10:47,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 2: [2023-05-10 10:10:47,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 2: [2023-05-10 10:10:47,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 2: [2023-05-10 10:10:47,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 2: [2023-05-10 10:10:47,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 15: [2023-05-10 10:10:47,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:47,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 15: [2023-05-10 10:10:47,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 2: [2023-05-10 10:10:47,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 2: [2023-05-10 10:10:47,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 2: [2023-05-10 10:10:47,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 23: [2023-05-10 10:10:47,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 23: [2023-05-10 10:10:47,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:47,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 26: [2023-05-10 10:10:47,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 26: [2023-05-10 10:10:47,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 26: [2023-05-10 10:10:47,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 5: [2023-05-10 10:10:47,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:47,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 26: [2023-05-10 10:10:47,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 26: [2023-05-10 10:10:47,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 26: [2023-05-10 10:10:47,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:47,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 26: [2023-05-10 10:10:47,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:47,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 15: [2023-05-10 10:10:47,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 15: [2023-05-10 10:10:47,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 15: [2023-05-10 10:10:47,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:47,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:47,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:47,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:47,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:47,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 3: [2023-05-10 10:10:47,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 3: [2023-05-10 10:10:47,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 16: [2023-05-10 10:10:47,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:47,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:47,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:47,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:47,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:47,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:47,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:47,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:47,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:47,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:47,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 5: [2023-05-10 10:10:47,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:47,948] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 23: [2023-05-10 10:10:47,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:47,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:47,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:47,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 2: [2023-05-10 10:10:47,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 23: [2023-05-10 10:10:47,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 5: [2023-05-10 10:10:47,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:47,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 5: [2023-05-10 10:10:47,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 5: [2023-05-10 10:10:47,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:47,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 3: [2023-05-10 10:10:47,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 3: [2023-05-10 10:10:47,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 3: [2023-05-10 10:10:47,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_03-model_00-model_states.pt. 5: [2023-05-10 10:10:47,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 2: [2023-05-10 10:10:47,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:47,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 2: [2023-05-10 10:10:47,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:47,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 2: [2023-05-10 10:10:47,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:47,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:47,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 2: [2023-05-10 10:10:47,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:47,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:47,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:47,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:47,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:47,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:47,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:47,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:47,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:47,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:47,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:47,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:47,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:47,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:47,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:47,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:48,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:48,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:48,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:48,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:48,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:48,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:48,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:48,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 29: [2023-05-10 10:10:48,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,135] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 8: [2023-05-10 10:10:48,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 8: [2023-05-10 10:10:48,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 8: [2023-05-10 10:10:48,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 8: [2023-05-10 10:10:48,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 8: [2023-05-10 10:10:48,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:48,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:48,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:48,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:48,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:48,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:48,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:48,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:48,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 19: [2023-05-10 10:10:48,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 19: [2023-05-10 10:10:48,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 19: [2023-05-10 10:10:48,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 19: [2023-05-10 10:10:48,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 19: [2023-05-10 10:10:48,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 19: [2023-05-10 10:10:48,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 19: [2023-05-10 10:10:48,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 31: [2023-05-10 10:10:48,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 31: [2023-05-10 10:10:48,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 31: [2023-05-10 10:10:48,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 31: [2023-05-10 10:10:48,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 31: [2023-05-10 10:10:48,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 31: [2023-05-10 10:10:48,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 31: [2023-05-10 10:10:48,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 19: [2023-05-10 10:10:48,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:48,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:48,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 19: [2023-05-10 10:10:48,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:48,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:48,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:48,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:48,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:48,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:48,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:48,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:48,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:48,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:48,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:48,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:48,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:48,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 1: [2023-05-10 10:10:48,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 1: [2023-05-10 10:10:48,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 1: [2023-05-10 10:10:48,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 1: [2023-05-10 10:10:48,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 1: [2023-05-10 10:10:48,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 1: [2023-05-10 10:10:48,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 1: [2023-05-10 10:10:48,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 1: [2023-05-10 10:10:48,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:48,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 8: [2023-05-10 10:10:48,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 8: [2023-05-10 10:10:48,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 8: [2023-05-10 10:10:48,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 8: [2023-05-10 10:10:48,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 8: [2023-05-10 10:10:48,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 8: [2023-05-10 10:10:48,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 8: [2023-05-10 10:10:48,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 18: [2023-05-10 10:10:48,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:48,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 19: [2023-05-10 10:10:48,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 19: [2023-05-10 10:10:48,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 18: [2023-05-10 10:10:48,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:48,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:48,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 18: [2023-05-10 10:10:48,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:48,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:48,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:48,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:48,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:48,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 18: [2023-05-10 10:10:48,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:48,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:48,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:48,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:48,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:48,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 8: [2023-05-10 10:10:48,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 0: [2023-05-10 10:10:48,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:48,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:48,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:48,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 6: [2023-05-10 10:10:48,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 6: [2023-05-10 10:10:48,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:48,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:48,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:48,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 14: [2023-05-10 10:10:48,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 14: [2023-05-10 10:10:48,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:48,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:48,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 14: [2023-05-10 10:10:48,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:48,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 28: [2023-05-10 10:10:48,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,268] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 5: [2023-05-10 10:10:48,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:48,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 19: [2023-05-10 10:10:48,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 19: [2023-05-10 10:10:48,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 5: [2023-05-10 10:10:48,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 14: [2023-05-10 10:10:48,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:48,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 28: [2023-05-10 10:10:48,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 28: [2023-05-10 10:10:48,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 28: [2023-05-10 10:10:48,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 28: [2023-05-10 10:10:48,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 14: [2023-05-10 10:10:48,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 5: [2023-05-10 10:10:48,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:48,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:48,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:48,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 5: [2023-05-10 10:10:48,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:48,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 5: [2023-05-10 10:10:48,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:48,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:48,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:48,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:48,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:48,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 5: [2023-05-10 10:10:48,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:48,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 6: [2023-05-10 10:10:48,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:48,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 5: [2023-05-10 10:10:48,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:48,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 8: [2023-05-10 10:10:48,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 24: [2023-05-10 10:10:48,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 24: [2023-05-10 10:10:48,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 6: [2023-05-10 10:10:48,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:48,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 6: [2023-05-10 10:10:48,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 24: [2023-05-10 10:10:48,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:48,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 31: [2023-05-10 10:10:48,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 31: [2023-05-10 10:10:48,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 24: [2023-05-10 10:10:48,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 24: [2023-05-10 10:10:48,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 24: [2023-05-10 10:10:48,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:48,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 24: [2023-05-10 10:10:48,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 24: [2023-05-10 10:10:48,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:48,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 19: [2023-05-10 10:10:48,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:48,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:48,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:48,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:48,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 31: [2023-05-10 10:10:48,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 31: [2023-05-10 10:10:48,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 31: [2023-05-10 10:10:48,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 31: [2023-05-10 10:10:48,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 28: [2023-05-10 10:10:48,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 1: [2023-05-10 10:10:48,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 19: [2023-05-10 10:10:48,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 19: [2023-05-10 10:10:48,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 19: [2023-05-10 10:10:48,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 19: [2023-05-10 10:10:48,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 19: [2023-05-10 10:10:48,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 31: [2023-05-10 10:10:48,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 16: [2023-05-10 10:10:48,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 10: [2023-05-10 10:10:48,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:48,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 16: [2023-05-10 10:10:48,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 16: [2023-05-10 10:10:48,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 16: [2023-05-10 10:10:48,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 16: [2023-05-10 10:10:48,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:48,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:48,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 10: [2023-05-10 10:10:48,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 10: [2023-05-10 10:10:48,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 10: [2023-05-10 10:10:48,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:48,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 22: [2023-05-10 10:10:48,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 22: [2023-05-10 10:10:48,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 22: [2023-05-10 10:10:48,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 22: [2023-05-10 10:10:48,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 22: [2023-05-10 10:10:48,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 22: [2023-05-10 10:10:48,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 22: [2023-05-10 10:10:48,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 16: [2023-05-10 10:10:48,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:48,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:48,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:48,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:48,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:48,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:48,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:48,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:48,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:48,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 31: [2023-05-10 10:10:48,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 31: [2023-05-10 10:10:48,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 18: [2023-05-10 10:10:48,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 31: [2023-05-10 10:10:48,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 31: [2023-05-10 10:10:48,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 31: [2023-05-10 10:10:48,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:48,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:48,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:48,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:48,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 22: [2023-05-10 10:10:48,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:48,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 18: [2023-05-10 10:10:48,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 0: [2023-05-10 10:10:48,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:48,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 28: [2023-05-10 10:10:48,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 1: [2023-05-10 10:10:48,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 24: [2023-05-10 10:10:48,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 20: [2023-05-10 10:10:48,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 20: [2023-05-10 10:10:48,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 14: [2023-05-10 10:10:48,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 30: [2023-05-10 10:10:48,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 30: [2023-05-10 10:10:48,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 30: [2023-05-10 10:10:48,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 30: [2023-05-10 10:10:48,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 30: [2023-05-10 10:10:48,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 30: [2023-05-10 10:10:48,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 30: [2023-05-10 10:10:48,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 30: [2023-05-10 10:10:48,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 17: [2023-05-10 10:10:48,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 20: [2023-05-10 10:10:48,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 17: [2023-05-10 10:10:48,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 17: [2023-05-10 10:10:48,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 17: [2023-05-10 10:10:48,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 17: [2023-05-10 10:10:48,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 17: [2023-05-10 10:10:48,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 17: [2023-05-10 10:10:48,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 17: [2023-05-10 10:10:48,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:48,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 14: [2023-05-10 10:10:48,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:48,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 18: [2023-05-10 10:10:48,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 4: [2023-05-10 10:10:48,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:48,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:48,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 18: [2023-05-10 10:10:48,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 0: [2023-05-10 10:10:48,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:48,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:48,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:48,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 5: [2023-05-10 10:10:48,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 4: [2023-05-10 10:10:48,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 17: [2023-05-10 10:10:48,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:48,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:48,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:48,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 6: [2023-05-10 10:10:48,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:48,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 18: [2023-05-10 10:10:48,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:48,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 25: [2023-05-10 10:10:48,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 26: [2023-05-10 10:10:48,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 26: [2023-05-10 10:10:48,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:48,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 18: [2023-05-10 10:10:48,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 30: [2023-05-10 10:10:48,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 14: [2023-05-10 10:10:48,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 18: [2023-05-10 10:10:48,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 26: [2023-05-10 10:10:48,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 26: [2023-05-10 10:10:48,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 26: [2023-05-10 10:10:48,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 26: [2023-05-10 10:10:48,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 26: [2023-05-10 10:10:48,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 26: [2023-05-10 10:10:48,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 17: [2023-05-10 10:10:48,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:48,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:48,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 30: [2023-05-10 10:10:48,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 17: [2023-05-10 10:10:48,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 30: [2023-05-10 10:10:48,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 14: [2023-05-10 10:10:48,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 18: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 25: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 18: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 13: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 17: [2023-05-10 10:10:48,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 30: [2023-05-10 10:10:48,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:48,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 26: [2023-05-10 10:10:48,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 17: [2023-05-10 10:10:48,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:48,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 13: [2023-05-10 10:10:48,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 17: [2023-05-10 10:10:48,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 28: [2023-05-10 10:10:48,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 25: [2023-05-10 10:10:48,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 25: [2023-05-10 10:10:48,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:48,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:48,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:48,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:48,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 25: [2023-05-10 10:10:48,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 25: [2023-05-10 10:10:48,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 25: [2023-05-10 10:10:48,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 24: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 24: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 9: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 9: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 28: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 28: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 28: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 0: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 24: [2023-05-10 10:10:48,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 9: [2023-05-10 10:10:48,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 9: [2023-05-10 10:10:48,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 9: [2023-05-10 10:10:48,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 9: [2023-05-10 10:10:48,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,352] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 26: [2023-05-10 10:10:48,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:48,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:48,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:48,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:48,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:48,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:48,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:48,357] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 10: [2023-05-10 10:10:48,356] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,356] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,356] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 16: [2023-05-10 10:10:48,357] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 9: [2023-05-10 10:10:48,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:48,359] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:48,359] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:48,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 14: [2023-05-10 10:10:48,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 5: [2023-05-10 10:10:48,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 10: [2023-05-10 10:10:48,361] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 16: [2023-05-10 10:10:48,361] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 7: [2023-05-10 10:10:48,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 7: [2023-05-10 10:10:48,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:48,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 7: [2023-05-10 10:10:48,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 7: [2023-05-10 10:10:48,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 7: [2023-05-10 10:10:48,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 7: [2023-05-10 10:10:48,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:48,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 5: [2023-05-10 10:10:48,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 13: [2023-05-10 10:10:48,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:48,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 14: [2023-05-10 10:10:48,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 14: [2023-05-10 10:10:48,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 14: [2023-05-10 10:10:48,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 13: [2023-05-10 10:10:48,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 13: [2023-05-10 10:10:48,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 5: [2023-05-10 10:10:48,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 13: [2023-05-10 10:10:48,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 14: [2023-05-10 10:10:48,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 13: [2023-05-10 10:10:48,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:48,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 0: [2023-05-10 10:10:48,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 0: [2023-05-10 10:10:48,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 5: [2023-05-10 10:10:48,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 5: [2023-05-10 10:10:48,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 7: [2023-05-10 10:10:48,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:48,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:48,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:48,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:48,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:48,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:48,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 0: [2023-05-10 10:10:48,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 0: [2023-05-10 10:10:48,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 7: [2023-05-10 10:10:48,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 15: [2023-05-10 10:10:48,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 6: [2023-05-10 10:10:48,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 16: [2023-05-10 10:10:48,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 16: [2023-05-10 10:10:48,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 15: [2023-05-10 10:10:48,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 24: [2023-05-10 10:10:48,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 24: [2023-05-10 10:10:48,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 23: [2023-05-10 10:10:48,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 15: [2023-05-10 10:10:48,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,384] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 22: [2023-05-10 10:10:48,384] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,384] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 16: [2023-05-10 10:10:48,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,384] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 23: [2023-05-10 10:10:48,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 23: [2023-05-10 10:10:48,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 16: [2023-05-10 10:10:48,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 16: [2023-05-10 10:10:48,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 16: [2023-05-10 10:10:48,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 16: [2023-05-10 10:10:48,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 24: [2023-05-10 10:10:48,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 16: [2023-05-10 10:10:48,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 22: [2023-05-10 10:10:48,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 22: [2023-05-10 10:10:48,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 22: [2023-05-10 10:10:48,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 22: [2023-05-10 10:10:48,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 23: [2023-05-10 10:10:48,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,390] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,390] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 23: [2023-05-10 10:10:48,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,395] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 4: [2023-05-10 10:10:48,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 10: [2023-05-10 10:10:48,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 26: [2023-05-10 10:10:48,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 26: [2023-05-10 10:10:48,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 16: [2023-05-10 10:10:48,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,406] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,406] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,406] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,406] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 12: [2023-05-10 10:10:48,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 12: [2023-05-10 10:10:48,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 17: [2023-05-10 10:10:48,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 12: [2023-05-10 10:10:48,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 22: [2023-05-10 10:10:48,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 3: [2023-05-10 10:10:48,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 3: [2023-05-10 10:10:48,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 12: [2023-05-10 10:10:48,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 12: [2023-05-10 10:10:48,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 12: [2023-05-10 10:10:48,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 12: [2023-05-10 10:10:48,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 12: [2023-05-10 10:10:48,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 3: [2023-05-10 10:10:48,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 3: [2023-05-10 10:10:48,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 3: [2023-05-10 10:10:48,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 3: [2023-05-10 10:10:48,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 3: [2023-05-10 10:10:48,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 16: [2023-05-10 10:10:48,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 3: [2023-05-10 10:10:48,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 21: [2023-05-10 10:10:48,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 20: [2023-05-10 10:10:48,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 27: [2023-05-10 10:10:48,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 27: [2023-05-10 10:10:48,414] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 22: [2023-05-10 10:10:48,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,414] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 27: [2023-05-10 10:10:48,414] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 27: [2023-05-10 10:10:48,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 9: [2023-05-10 10:10:48,414] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 9: [2023-05-10 10:10:48,414] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 9: [2023-05-10 10:10:48,414] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 27: [2023-05-10 10:10:48,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 27: [2023-05-10 10:10:48,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 27: [2023-05-10 10:10:48,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 12: [2023-05-10 10:10:48,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 2: [2023-05-10 10:10:48,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:48,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 3: [2023-05-10 10:10:48,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:48,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:48,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:48,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 17: [2023-05-10 10:10:48,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:48,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:48,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:48,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:48,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:48,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:48,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 30: [2023-05-10 10:10:48,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 27: [2023-05-10 10:10:48,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:48,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 9: [2023-05-10 10:10:48,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 9: [2023-05-10 10:10:48,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 9: [2023-05-10 10:10:48,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:48,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:48,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 2: [2023-05-10 10:10:48,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 9: [2023-05-10 10:10:48,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 3: [2023-05-10 10:10:48,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 22: [2023-05-10 10:10:48,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 3: [2023-05-10 10:10:48,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:48,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 4: [2023-05-10 10:10:48,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 16: [2023-05-10 10:10:48,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 12: [2023-05-10 10:10:48,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 12: [2023-05-10 10:10:48,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 16: [2023-05-10 10:10:48,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 16: [2023-05-10 10:10:48,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 3: [2023-05-10 10:10:48,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 12: [2023-05-10 10:10:48,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 3: [2023-05-10 10:10:48,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 12: [2023-05-10 10:10:48,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 27: [2023-05-10 10:10:48,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 30: [2023-05-10 10:10:48,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 27: [2023-05-10 10:10:48,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 12: [2023-05-10 10:10:48,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 11: [2023-05-10 10:10:48,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:48,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 12: [2023-05-10 10:10:48,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 26: [2023-05-10 10:10:48,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 26: [2023-05-10 10:10:48,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 11: [2023-05-10 10:10:48,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 27: [2023-05-10 10:10:48,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 12: [2023-05-10 10:10:48,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 27: [2023-05-10 10:10:48,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 21: [2023-05-10 10:10:48,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 27: [2023-05-10 10:10:48,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:48,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 7: [2023-05-10 10:10:48,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 26: [2023-05-10 10:10:48,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 26: [2023-05-10 10:10:48,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 26: [2023-05-10 10:10:48,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 26: [2023-05-10 10:10:48,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 4: [2023-05-10 10:10:48,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 26: [2023-05-10 10:10:48,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 17: [2023-05-10 10:10:48,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 17: [2023-05-10 10:10:48,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 17: [2023-05-10 10:10:48,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 17: [2023-05-10 10:10:48,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 7: [2023-05-10 10:10:48,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 7: [2023-05-10 10:10:48,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 2: [2023-05-10 10:10:48,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 11: [2023-05-10 10:10:48,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 2: [2023-05-10 10:10:48,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 11: [2023-05-10 10:10:48,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 20: [2023-05-10 10:10:48,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 11: [2023-05-10 10:10:48,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 2: [2023-05-10 10:10:48,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 11: [2023-05-10 10:10:48,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 7: [2023-05-10 10:10:48,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 7: [2023-05-10 10:10:48,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 7: [2023-05-10 10:10:48,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 7: [2023-05-10 10:10:48,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:48,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 11: [2023-05-10 10:10:48,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 2: [2023-05-10 10:10:48,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt... 30: [2023-05-10 10:10:48,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 30: [2023-05-10 10:10:48,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 30: [2023-05-10 10:10:48,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 30: [2023-05-10 10:10:48,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 30: [2023-05-10 10:10:48,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 4: [2023-05-10 10:10:48,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 4: [2023-05-10 10:10:48,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 29: [2023-05-10 10:10:48,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 29: [2023-05-10 10:10:48,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 29: [2023-05-10 10:10:48,433] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 29: [2023-05-10 10:10:48,433] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 29: [2023-05-10 10:10:48,433] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 29: [2023-05-10 10:10:48,433] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 29: [2023-05-10 10:10:48,433] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 13: [2023-05-10 10:10:48,439] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,439] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,439] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 29: [2023-05-10 10:10:48,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 23: [2023-05-10 10:10:48,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 23: [2023-05-10 10:10:48,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 26: [2023-05-10 10:10:48,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 7: [2023-05-10 10:10:48,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 7: [2023-05-10 10:10:48,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,453] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:48,453] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 26: [2023-05-10 10:10:48,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 7: [2023-05-10 10:10:48,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 13: [2023-05-10 10:10:48,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,456] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 7: [2023-05-10 10:10:48,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 7: [2023-05-10 10:10:48,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 26: [2023-05-10 10:10:48,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 26: [2023-05-10 10:10:48,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 7: [2023-05-10 10:10:48,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 7: [2023-05-10 10:10:48,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 7: [2023-05-10 10:10:48,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 23: [2023-05-10 10:10:48,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 23: [2023-05-10 10:10:48,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 23: [2023-05-10 10:10:48,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:48,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 12: [2023-05-10 10:10:48,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,469] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:48,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 13: [2023-05-10 10:10:48,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 3: [2023-05-10 10:10:48,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 3: [2023-05-10 10:10:48,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 3: [2023-05-10 10:10:48,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 23: [2023-05-10 10:10:48,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 23: [2023-05-10 10:10:48,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 23: [2023-05-10 10:10:48,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:48,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 27: [2023-05-10 10:10:48,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 3: [2023-05-10 10:10:48,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 23: [2023-05-10 10:10:48,480] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 23: [2023-05-10 10:10:48,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 23: [2023-05-10 10:10:48,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:48,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 3: [2023-05-10 10:10:48,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 3: [2023-05-10 10:10:48,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 3: [2023-05-10 10:10:48,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 23: [2023-05-10 10:10:48,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 23: [2023-05-10 10:10:48,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,490] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:48,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 12: [2023-05-10 10:10:48,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 13: [2023-05-10 10:10:48,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 3: [2023-05-10 10:10:48,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 3: [2023-05-10 10:10:48,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 12: [2023-05-10 10:10:48,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 12: [2023-05-10 10:10:48,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,494] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 13: [2023-05-10 10:10:48,494] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 27: [2023-05-10 10:10:48,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 27: [2023-05-10 10:10:48,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 27: [2023-05-10 10:10:48,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 27: [2023-05-10 10:10:48,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,494] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 13: [2023-05-10 10:10:48,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:48,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:48,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 3: [2023-05-10 10:10:48,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 11: [2023-05-10 10:10:48,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 11: [2023-05-10 10:10:48,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 11: [2023-05-10 10:10:48,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:48,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 2: [2023-05-10 10:10:48,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 12: [2023-05-10 10:10:48,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 12: [2023-05-10 10:10:48,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 12: [2023-05-10 10:10:48,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 12: [2023-05-10 10:10:48,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 12: [2023-05-10 10:10:48,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_04-model_00-model_states.pt. 3: [2023-05-10 10:10:48,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 29: [2023-05-10 10:10:48,510] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 29: [2023-05-10 10:10:48,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 29: [2023-05-10 10:10:48,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 29: [2023-05-10 10:10:48,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 29: [2023-05-10 10:10:48,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 3: [2023-05-10 10:10:48,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 12: [2023-05-10 10:10:48,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 3: [2023-05-10 10:10:48,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 3: [2023-05-10 10:10:48,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 12: [2023-05-10 10:10:48,524] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 11: [2023-05-10 10:10:48,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 11: [2023-05-10 10:10:48,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 11: [2023-05-10 10:10:48,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 11: [2023-05-10 10:10:48,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 11: [2023-05-10 10:10:48,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 11: [2023-05-10 10:10:48,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 2: [2023-05-10 10:10:48,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 12: [2023-05-10 10:10:48,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 12: [2023-05-10 10:10:48,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 12: [2023-05-10 10:10:48,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 12: [2023-05-10 10:10:48,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 12: [2023-05-10 10:10:48,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:48,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:48,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:48,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:48,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:48,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:48,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:48,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:48,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:48,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:48,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:48,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:48,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:48,699] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,704] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:48,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:48,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:48,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:48,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 18: [2023-05-10 10:10:48,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 18: [2023-05-10 10:10:48,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 18: [2023-05-10 10:10:48,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 15: [2023-05-10 10:10:48,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 15: [2023-05-10 10:10:48,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 15: [2023-05-10 10:10:48,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 15: [2023-05-10 10:10:48,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 15: [2023-05-10 10:10:48,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 15: [2023-05-10 10:10:48,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,712] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 18: [2023-05-10 10:10:48,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 18: [2023-05-10 10:10:48,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 18: [2023-05-10 10:10:48,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:48,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 31: [2023-05-10 10:10:48,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 31: [2023-05-10 10:10:48,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 31: [2023-05-10 10:10:48,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 31: [2023-05-10 10:10:48,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 31: [2023-05-10 10:10:48,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 31: [2023-05-10 10:10:48,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:48,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 15: [2023-05-10 10:10:48,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,723] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 31: [2023-05-10 10:10:48,723] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 31: [2023-05-10 10:10:48,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,726] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:48,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 31: [2023-05-10 10:10:48,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 31: [2023-05-10 10:10:48,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 31: [2023-05-10 10:10:48,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 31: [2023-05-10 10:10:48,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 31: [2023-05-10 10:10:48,730] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 1: [2023-05-10 10:10:48,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:48,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:48,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:48,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:48,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:48,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:48,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,758] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:48,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:48,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:48,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:48,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:48,761] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:48,761] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:48,762] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 15: [2023-05-10 10:10:48,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,772] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,772] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,772] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,772] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 31: [2023-05-10 10:10:48,774] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 31: [2023-05-10 10:10:48,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,779] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,779] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,779] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 5: [2023-05-10 10:10:48,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 5: [2023-05-10 10:10:48,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 24: [2023-05-10 10:10:48,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 23: [2023-05-10 10:10:48,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 23: [2023-05-10 10:10:48,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 23: [2023-05-10 10:10:48,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:48,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 23: [2023-05-10 10:10:48,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 23: [2023-05-10 10:10:48,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:48,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 15: [2023-05-10 10:10:48,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:48,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 29: [2023-05-10 10:10:48,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 29: [2023-05-10 10:10:48,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:48,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 5: [2023-05-10 10:10:48,788] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 29: [2023-05-10 10:10:48,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 29: [2023-05-10 10:10:48,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 29: [2023-05-10 10:10:48,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 29: [2023-05-10 10:10:48,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:48,788] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 5: [2023-05-10 10:10:48,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 5: [2023-05-10 10:10:48,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 24: [2023-05-10 10:10:48,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 24: [2023-05-10 10:10:48,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 24: [2023-05-10 10:10:48,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 24: [2023-05-10 10:10:48,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 18: [2023-05-10 10:10:48,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 24: [2023-05-10 10:10:48,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 23: [2023-05-10 10:10:48,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 23: [2023-05-10 10:10:48,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 24: [2023-05-10 10:10:48,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 15: [2023-05-10 10:10:48,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 6: [2023-05-10 10:10:48,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:48,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:48,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 23: [2023-05-10 10:10:48,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 23: [2023-05-10 10:10:48,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 18: [2023-05-10 10:10:48,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 23: [2023-05-10 10:10:48,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:48,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 23: [2023-05-10 10:10:48,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 23: [2023-05-10 10:10:48,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:48,795] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 31: [2023-05-10 10:10:48,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:48,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:48,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 18: [2023-05-10 10:10:48,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 18: [2023-05-10 10:10:48,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:48,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:48,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 21: [2023-05-10 10:10:48,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 21: [2023-05-10 10:10:48,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 21: [2023-05-10 10:10:48,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 21: [2023-05-10 10:10:48,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 21: [2023-05-10 10:10:48,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 21: [2023-05-10 10:10:48,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 21: [2023-05-10 10:10:48,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:48,800] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 15: [2023-05-10 10:10:48,800] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 15: [2023-05-10 10:10:48,800] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 15: [2023-05-10 10:10:48,800] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 15: [2023-05-10 10:10:48,800] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,800] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 31: [2023-05-10 10:10:48,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 31: [2023-05-10 10:10:48,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 31: [2023-05-10 10:10:48,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 31: [2023-05-10 10:10:48,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 31: [2023-05-10 10:10:48,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 18: [2023-05-10 10:10:48,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 18: [2023-05-10 10:10:48,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 18: [2023-05-10 10:10:48,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:48,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:48,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:48,822] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:48,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 30: [2023-05-10 10:10:48,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 30: [2023-05-10 10:10:48,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 30: [2023-05-10 10:10:48,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 30: [2023-05-10 10:10:48,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 30: [2023-05-10 10:10:48,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 30: [2023-05-10 10:10:48,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 30: [2023-05-10 10:10:48,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 22: [2023-05-10 10:10:48,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 22: [2023-05-10 10:10:48,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 22: [2023-05-10 10:10:48,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 22: [2023-05-10 10:10:48,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 22: [2023-05-10 10:10:48,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 22: [2023-05-10 10:10:48,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 22: [2023-05-10 10:10:48,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 22: [2023-05-10 10:10:48,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,831] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 22: [2023-05-10 10:10:48,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 31: [2023-05-10 10:10:48,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 19: [2023-05-10 10:10:48,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 31: [2023-05-10 10:10:48,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:48,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 19: [2023-05-10 10:10:48,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 31: [2023-05-10 10:10:48,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:48,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:48,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:48,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 31: [2023-05-10 10:10:48,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:48,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:48,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 19: [2023-05-10 10:10:48,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 15: [2023-05-10 10:10:48,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:48,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 22: [2023-05-10 10:10:48,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 5: [2023-05-10 10:10:48,837] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,837] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,839] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 19: [2023-05-10 10:10:48,839] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 19: [2023-05-10 10:10:48,839] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 19: [2023-05-10 10:10:48,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 19: [2023-05-10 10:10:48,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,843] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 29: [2023-05-10 10:10:48,843] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 23: [2023-05-10 10:10:48,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 23: [2023-05-10 10:10:48,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 23: [2023-05-10 10:10:48,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:48,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,852] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,852] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 23: [2023-05-10 10:10:48,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:48,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 6: [2023-05-10 10:10:48,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:48,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 20: [2023-05-10 10:10:48,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 20: [2023-05-10 10:10:48,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 20: [2023-05-10 10:10:48,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 20: [2023-05-10 10:10:48,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 20: [2023-05-10 10:10:48,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 20: [2023-05-10 10:10:48,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 23: [2023-05-10 10:10:48,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 23: [2023-05-10 10:10:48,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:48,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 21: [2023-05-10 10:10:48,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 23: [2023-05-10 10:10:48,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:48,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,862] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 24: [2023-05-10 10:10:48,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,862] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 17: [2023-05-10 10:10:48,862] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:48,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 24: [2023-05-10 10:10:48,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 6: [2023-05-10 10:10:48,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 5: [2023-05-10 10:10:48,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 17: [2023-05-10 10:10:48,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:48,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 0: [2023-05-10 10:10:48,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 0: [2023-05-10 10:10:48,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 29: [2023-05-10 10:10:48,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 0: [2023-05-10 10:10:48,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 0: [2023-05-10 10:10:48,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 0: [2023-05-10 10:10:48,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 0: [2023-05-10 10:10:48,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 0: [2023-05-10 10:10:48,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 23: [2023-05-10 10:10:48,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:48,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 29: [2023-05-10 10:10:48,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 29: [2023-05-10 10:10:48,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 29: [2023-05-10 10:10:48,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 29: [2023-05-10 10:10:48,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 29: [2023-05-10 10:10:48,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:48,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 23: [2023-05-10 10:10:48,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:48,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 13: [2023-05-10 10:10:48,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:48,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 0: [2023-05-10 10:10:48,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 13: [2023-05-10 10:10:48,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 0: [2023-05-10 10:10:48,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 13: [2023-05-10 10:10:48,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 0: [2023-05-10 10:10:48,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 0: [2023-05-10 10:10:48,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 17: [2023-05-10 10:10:48,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:48,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 21: [2023-05-10 10:10:48,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 21: [2023-05-10 10:10:48,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 21: [2023-05-10 10:10:48,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 17: [2023-05-10 10:10:48,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 17: [2023-05-10 10:10:48,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 13: [2023-05-10 10:10:48,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 0: [2023-05-10 10:10:48,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 13: [2023-05-10 10:10:48,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 0: [2023-05-10 10:10:48,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 13: [2023-05-10 10:10:48,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 5: [2023-05-10 10:10:48,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 5: [2023-05-10 10:10:48,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:48,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 21: [2023-05-10 10:10:48,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:48,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:48,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:48,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 22: [2023-05-10 10:10:48,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 5: [2023-05-10 10:10:48,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 23: [2023-05-10 10:10:48,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 23: [2023-05-10 10:10:48,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:48,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 23: [2023-05-10 10:10:48,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 29: [2023-05-10 10:10:48,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 4: [2023-05-10 10:10:48,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 4: [2023-05-10 10:10:48,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 4: [2023-05-10 10:10:48,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 4: [2023-05-10 10:10:48,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 4: [2023-05-10 10:10:48,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 4: [2023-05-10 10:10:48,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 4: [2023-05-10 10:10:48,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 6: [2023-05-10 10:10:48,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:48,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 22: [2023-05-10 10:10:48,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,889] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 23: [2023-05-10 10:10:48,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 3: [2023-05-10 10:10:48,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:48,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 16: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 24: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 6: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 16: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 16: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 16: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 16: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 16: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:48,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:48,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:48,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 6: [2023-05-10 10:10:48,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:48,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:48,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:48,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 19: [2023-05-10 10:10:48,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 24: [2023-05-10 10:10:48,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 12: [2023-05-10 10:10:48,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:48,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 16: [2023-05-10 10:10:48,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:48,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 27: [2023-05-10 10:10:48,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 29: [2023-05-10 10:10:48,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 29: [2023-05-10 10:10:48,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 27: [2023-05-10 10:10:48,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 27: [2023-05-10 10:10:48,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 27: [2023-05-10 10:10:48,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 27: [2023-05-10 10:10:48,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 27: [2023-05-10 10:10:48,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 3: [2023-05-10 10:10:48,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 3: [2023-05-10 10:10:48,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 3: [2023-05-10 10:10:48,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:48,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 16: [2023-05-10 10:10:48,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 16: [2023-05-10 10:10:48,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 16: [2023-05-10 10:10:48,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 29: [2023-05-10 10:10:48,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 12: [2023-05-10 10:10:48,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 16: [2023-05-10 10:10:48,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 19: [2023-05-10 10:10:48,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 16: [2023-05-10 10:10:48,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:48,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 12: [2023-05-10 10:10:48,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 12: [2023-05-10 10:10:48,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 12: [2023-05-10 10:10:48,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 22: [2023-05-10 10:10:48,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 21: [2023-05-10 10:10:48,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 12: [2023-05-10 10:10:48,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 21: [2023-05-10 10:10:48,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:48,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 30: [2023-05-10 10:10:48,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 22: [2023-05-10 10:10:48,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 22: [2023-05-10 10:10:48,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 22: [2023-05-10 10:10:48,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 22: [2023-05-10 10:10:48,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 27: [2023-05-10 10:10:48,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:48,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:48,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:48,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:48,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 2: [2023-05-10 10:10:48,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 2: [2023-05-10 10:10:48,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 19: [2023-05-10 10:10:48,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 22: [2023-05-10 10:10:48,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 2: [2023-05-10 10:10:48,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 27: [2023-05-10 10:10:48,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 27: [2023-05-10 10:10:48,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 27: [2023-05-10 10:10:48,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 2: [2023-05-10 10:10:48,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 2: [2023-05-10 10:10:48,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 10: [2023-05-10 10:10:48,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:48,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:48,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 30: [2023-05-10 10:10:48,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 30: [2023-05-10 10:10:48,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 30: [2023-05-10 10:10:48,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 30: [2023-05-10 10:10:48,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 30: [2023-05-10 10:10:48,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 30: [2023-05-10 10:10:48,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:48,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:48,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:48,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:48,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:48,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:48,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 19: [2023-05-10 10:10:48,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 19: [2023-05-10 10:10:48,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 19: [2023-05-10 10:10:48,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 26: [2023-05-10 10:10:48,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 26: [2023-05-10 10:10:48,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 30: [2023-05-10 10:10:48,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:48,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 0: [2023-05-10 10:10:48,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 0: [2023-05-10 10:10:48,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 20: [2023-05-10 10:10:48,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:48,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 19: [2023-05-10 10:10:48,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:48,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 9: [2023-05-10 10:10:48,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:48,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 26: [2023-05-10 10:10:48,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 9: [2023-05-10 10:10:48,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 9: [2023-05-10 10:10:48,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 9: [2023-05-10 10:10:48,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 9: [2023-05-10 10:10:48,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 7: [2023-05-10 10:10:48,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 7: [2023-05-10 10:10:48,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:48,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:48,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 20: [2023-05-10 10:10:48,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 9: [2023-05-10 10:10:48,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 7: [2023-05-10 10:10:48,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 7: [2023-05-10 10:10:48,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 7: [2023-05-10 10:10:48,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 7: [2023-05-10 10:10:48,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 7: [2023-05-10 10:10:48,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 7: [2023-05-10 10:10:48,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 22: [2023-05-10 10:10:48,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 22: [2023-05-10 10:10:48,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 26: [2023-05-10 10:10:48,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 26: [2023-05-10 10:10:48,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 26: [2023-05-10 10:10:48,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:48,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 8: [2023-05-10 10:10:48,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 8: [2023-05-10 10:10:48,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:48,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 8: [2023-05-10 10:10:48,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 8: [2023-05-10 10:10:48,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 8: [2023-05-10 10:10:48,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 8: [2023-05-10 10:10:48,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:48,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 0: [2023-05-10 10:10:48,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 0: [2023-05-10 10:10:48,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 0: [2023-05-10 10:10:48,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 0: [2023-05-10 10:10:48,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:48,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:48,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:48,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:48,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:48,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:48,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:48,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:48,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 22: [2023-05-10 10:10:48,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 22: [2023-05-10 10:10:48,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 22: [2023-05-10 10:10:48,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:48,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 7: [2023-05-10 10:10:48,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 7: [2023-05-10 10:10:48,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 7: [2023-05-10 10:10:48,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 4: [2023-05-10 10:10:48,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 9: [2023-05-10 10:10:48,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 22: [2023-05-10 10:10:48,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:48,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 9: [2023-05-10 10:10:48,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 7: [2023-05-10 10:10:48,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 7: [2023-05-10 10:10:48,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 20: [2023-05-10 10:10:48,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 20: [2023-05-10 10:10:48,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 20: [2023-05-10 10:10:48,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 20: [2023-05-10 10:10:48,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 20: [2023-05-10 10:10:48,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:48,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:48,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:48,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:48,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:48,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:48,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:48,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:48,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:48,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 11: [2023-05-10 10:10:48,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 11: [2023-05-10 10:10:48,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 13: [2023-05-10 10:10:48,948] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,948] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,948] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 16: [2023-05-10 10:10:48,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 10: [2023-05-10 10:10:48,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 12: [2023-05-10 10:10:48,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 0: [2023-05-10 10:10:48,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 11: [2023-05-10 10:10:48,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 11: [2023-05-10 10:10:48,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 11: [2023-05-10 10:10:48,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 11: [2023-05-10 10:10:48,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 11: [2023-05-10 10:10:48,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 16: [2023-05-10 10:10:48,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:48,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 10: [2023-05-10 10:10:48,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 10: [2023-05-10 10:10:48,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 10: [2023-05-10 10:10:48,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 10: [2023-05-10 10:10:48,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:48,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:48,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:48,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:48,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 27: [2023-05-10 10:10:48,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:48,960] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:48,960] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 13: [2023-05-10 10:10:48,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 30: [2023-05-10 10:10:48,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:48,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:48,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:48,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:48,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:48,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:48,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:48,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:48,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:48,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 1: [2023-05-10 10:10:48,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 1: [2023-05-10 10:10:48,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 1: [2023-05-10 10:10:48,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 1: [2023-05-10 10:10:48,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:48,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:48,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 1: [2023-05-10 10:10:48,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:48,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:48,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:48,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:48,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:48,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 13: [2023-05-10 10:10:48,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:48,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:48,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:48,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:48,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 20: [2023-05-10 10:10:48,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:48,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 14: [2023-05-10 10:10:48,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:48,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 14: [2023-05-10 10:10:48,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 14: [2023-05-10 10:10:48,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 14: [2023-05-10 10:10:48,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 14: [2023-05-10 10:10:48,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 14: [2023-05-10 10:10:48,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 0: [2023-05-10 10:10:48,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:48,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:48,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:48,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:48,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 16: [2023-05-10 10:10:48,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 16: [2023-05-10 10:10:48,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 16: [2023-05-10 10:10:48,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 4: [2023-05-10 10:10:48,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:48,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 12: [2023-05-10 10:10:48,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:48,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:48,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:48,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:48,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 13: [2023-05-10 10:10:48,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:48,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:48,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 13: [2023-05-10 10:10:48,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:48,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:48,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:48,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:48,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:48,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:48,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:48,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:48,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 2: [2023-05-10 10:10:48,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:48,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 2: [2023-05-10 10:10:48,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:48,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 27: [2023-05-10 10:10:48,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 27: [2023-05-10 10:10:48,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 27: [2023-05-10 10:10:48,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 27: [2023-05-10 10:10:48,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 27: [2023-05-10 10:10:48,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 14: [2023-05-10 10:10:48,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 14: [2023-05-10 10:10:48,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 26: [2023-05-10 10:10:48,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:48,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 27: [2023-05-10 10:10:48,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:48,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 14: [2023-05-10 10:10:48,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 14: [2023-05-10 10:10:48,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 3: [2023-05-10 10:10:48,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:48,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 14: [2023-05-10 10:10:48,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 12: [2023-05-10 10:10:48,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:48,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:48,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:48,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:48,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:48,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:48,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:48,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 3: [2023-05-10 10:10:48,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:48,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:48,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:48,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 13: [2023-05-10 10:10:48,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 13: [2023-05-10 10:10:48,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:49,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 11: [2023-05-10 10:10:48,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:48,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 9: [2023-05-10 10:10:49,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 9: [2023-05-10 10:10:49,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 9: [2023-05-10 10:10:49,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:49,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 13: [2023-05-10 10:10:49,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 13: [2023-05-10 10:10:49,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:49,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 8: [2023-05-10 10:10:49,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 13: [2023-05-10 10:10:49,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:49,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 7: [2023-05-10 10:10:49,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 7: [2023-05-10 10:10:49,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 7: [2023-05-10 10:10:49,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 16: [2023-05-10 10:10:49,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:49,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 9: [2023-05-10 10:10:49,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 9: [2023-05-10 10:10:49,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 9: [2023-05-10 10:10:49,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 9: [2023-05-10 10:10:49,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:49,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 26: [2023-05-10 10:10:49,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:49,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:49,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 16: [2023-05-10 10:10:49,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:49,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 7: [2023-05-10 10:10:49,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 7: [2023-05-10 10:10:49,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 7: [2023-05-10 10:10:49,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:49,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 26: [2023-05-10 10:10:49,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:49,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:49,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 26: [2023-05-10 10:10:49,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:49,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:49,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:49,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:49,010] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:49,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:49,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:49,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 8: [2023-05-10 10:10:49,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 8: [2023-05-10 10:10:49,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 8: [2023-05-10 10:10:49,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:49,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:49,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:49,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 12: [2023-05-10 10:10:49,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 12: [2023-05-10 10:10:49,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 12: [2023-05-10 10:10:49,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 2: [2023-05-10 10:10:49,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 12: [2023-05-10 10:10:49,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 25: [2023-05-10 10:10:49,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 25: [2023-05-10 10:10:49,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 25: [2023-05-10 10:10:49,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 25: [2023-05-10 10:10:49,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 25: [2023-05-10 10:10:49,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 25: [2023-05-10 10:10:49,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 25: [2023-05-10 10:10:49,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:49,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 25: [2023-05-10 10:10:49,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 12: [2023-05-10 10:10:49,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 2: [2023-05-10 10:10:49,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 11: [2023-05-10 10:10:49,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:49,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:49,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:49,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:49,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:49,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 2: [2023-05-10 10:10:49,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 11: [2023-05-10 10:10:49,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 11: [2023-05-10 10:10:49,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 2: [2023-05-10 10:10:49,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 2: [2023-05-10 10:10:49,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 25: [2023-05-10 10:10:49,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 25: [2023-05-10 10:10:49,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 8: [2023-05-10 10:10:49,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 25: [2023-05-10 10:10:49,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 25: [2023-05-10 10:10:49,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 25: [2023-05-10 10:10:49,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 25: [2023-05-10 10:10:49,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:49,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt... 28: [2023-05-10 10:10:49,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:49,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:49,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:49,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:49,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:49,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:49,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:49,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 1: [2023-05-10 10:10:49,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:49,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:49,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:49,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:49,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:49,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 26: [2023-05-10 10:10:49,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:49,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 26: [2023-05-10 10:10:49,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:49,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:49,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:49,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:49,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 14: [2023-05-10 10:10:49,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 1: [2023-05-10 10:10:49,035] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 26: [2023-05-10 10:10:49,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 26: [2023-05-10 10:10:49,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 26: [2023-05-10 10:10:49,038] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:49,038] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,038] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:49,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:49,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:49,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:49,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:49,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:49,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:49,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 1: [2023-05-10 10:10:49,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 1: [2023-05-10 10:10:49,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 1: [2023-05-10 10:10:49,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 14: [2023-05-10 10:10:49,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 8: [2023-05-10 10:10:49,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 18: [2023-05-10 10:10:49,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 11: [2023-05-10 10:10:49,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 14: [2023-05-10 10:10:49,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 14: [2023-05-10 10:10:49,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:49,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 11: [2023-05-10 10:10:49,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 11: [2023-05-10 10:10:49,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 18: [2023-05-10 10:10:49,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 18: [2023-05-10 10:10:49,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 18: [2023-05-10 10:10:49,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:49,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 11: [2023-05-10 10:10:49,056] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:49,056] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,056] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,057] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 18: [2023-05-10 10:10:49,057] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 18: [2023-05-10 10:10:49,058] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 18: [2023-05-10 10:10:49,058] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:49,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:49,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,084] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:49,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 25: [2023-05-10 10:10:49,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 25: [2023-05-10 10:10:49,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 25: [2023-05-10 10:10:49,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 25: [2023-05-10 10:10:49,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 25: [2023-05-10 10:10:49,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 25: [2023-05-10 10:10:49,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 25: [2023-05-10 10:10:49,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_05-model_00-model_states.pt. 28: [2023-05-10 10:10:49,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 25: [2023-05-10 10:10:49,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 25: [2023-05-10 10:10:49,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 25: [2023-05-10 10:10:49,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 25: [2023-05-10 10:10:49,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 25: [2023-05-10 10:10:49,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 18: [2023-05-10 10:10:49,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 25: [2023-05-10 10:10:49,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:49,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,146] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,146] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,151] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:49,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 24: [2023-05-10 10:10:49,158] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 24: [2023-05-10 10:10:49,158] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:49,159] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:49,159] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:49,159] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:49,160] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:49,160] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:49,160] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 24: [2023-05-10 10:10:49,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 24: [2023-05-10 10:10:49,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 24: [2023-05-10 10:10:49,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 24: [2023-05-10 10:10:49,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 24: [2023-05-10 10:10:49,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 24: [2023-05-10 10:10:49,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 5: [2023-05-10 10:10:49,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,165] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 5: [2023-05-10 10:10:49,169] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 5: [2023-05-10 10:10:49,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 5: [2023-05-10 10:10:49,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 5: [2023-05-10 10:10:49,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 5: [2023-05-10 10:10:49,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 5: [2023-05-10 10:10:49,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 5: [2023-05-10 10:10:49,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:49,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 23: [2023-05-10 10:10:49,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 29: [2023-05-10 10:10:49,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 23: [2023-05-10 10:10:49,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 29: [2023-05-10 10:10:49,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 23: [2023-05-10 10:10:49,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 23: [2023-05-10 10:10:49,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 23: [2023-05-10 10:10:49,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 23: [2023-05-10 10:10:49,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 23: [2023-05-10 10:10:49,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 23: [2023-05-10 10:10:49,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 29: [2023-05-10 10:10:49,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 29: [2023-05-10 10:10:49,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 29: [2023-05-10 10:10:49,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 29: [2023-05-10 10:10:49,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 29: [2023-05-10 10:10:49,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 29: [2023-05-10 10:10:49,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,228] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:49,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 24: [2023-05-10 10:10:49,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:49,241] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:49,241] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:49,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:49,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:49,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:49,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:49,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 5: [2023-05-10 10:10:49,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:49,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:49,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:49,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:49,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:49,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:49,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 24: [2023-05-10 10:10:49,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:49,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:49,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:49,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:49,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:49,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:49,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 29: [2023-05-10 10:10:49,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 23: [2023-05-10 10:10:49,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 30: [2023-05-10 10:10:49,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 30: [2023-05-10 10:10:49,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 23: [2023-05-10 10:10:49,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 30: [2023-05-10 10:10:49,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 30: [2023-05-10 10:10:49,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 30: [2023-05-10 10:10:49,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 30: [2023-05-10 10:10:49,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 30: [2023-05-10 10:10:49,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 30: [2023-05-10 10:10:49,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 29: [2023-05-10 10:10:49,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 29: [2023-05-10 10:10:49,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 29: [2023-05-10 10:10:49,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 21: [2023-05-10 10:10:49,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 23: [2023-05-10 10:10:49,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 30: [2023-05-10 10:10:49,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:49,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:49,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:49,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:49,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:49,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 23: [2023-05-10 10:10:49,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 23: [2023-05-10 10:10:49,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:49,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:49,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 10: [2023-05-10 10:10:49,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:49,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:49,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 10: [2023-05-10 10:10:49,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 17: [2023-05-10 10:10:49,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:49,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 22: [2023-05-10 10:10:49,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 10: [2023-05-10 10:10:49,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 10: [2023-05-10 10:10:49,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 10: [2023-05-10 10:10:49,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 13: [2023-05-10 10:10:49,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 13: [2023-05-10 10:10:49,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 13: [2023-05-10 10:10:49,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 13: [2023-05-10 10:10:49,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 13: [2023-05-10 10:10:49,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 13: [2023-05-10 10:10:49,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 13: [2023-05-10 10:10:49,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 13: [2023-05-10 10:10:49,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 10: [2023-05-10 10:10:49,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:49,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 10: [2023-05-10 10:10:49,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:49,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:49,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:49,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:49,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 0: [2023-05-10 10:10:49,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 23: [2023-05-10 10:10:49,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 23: [2023-05-10 10:10:49,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 22: [2023-05-10 10:10:49,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 22: [2023-05-10 10:10:49,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:49,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 23: [2023-05-10 10:10:49,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 22: [2023-05-10 10:10:49,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 22: [2023-05-10 10:10:49,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 17: [2023-05-10 10:10:49,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 17: [2023-05-10 10:10:49,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 17: [2023-05-10 10:10:49,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:49,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 23: [2023-05-10 10:10:49,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 23: [2023-05-10 10:10:49,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:49,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 15: [2023-05-10 10:10:49,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 15: [2023-05-10 10:10:49,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 15: [2023-05-10 10:10:49,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 15: [2023-05-10 10:10:49,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:49,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 17: [2023-05-10 10:10:49,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 17: [2023-05-10 10:10:49,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:49,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 20: [2023-05-10 10:10:49,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 15: [2023-05-10 10:10:49,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:49,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:49,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:49,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:49,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 13: [2023-05-10 10:10:49,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:49,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 29: [2023-05-10 10:10:49,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:49,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 21: [2023-05-10 10:10:49,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 13: [2023-05-10 10:10:49,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 13: [2023-05-10 10:10:49,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 13: [2023-05-10 10:10:49,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 29: [2023-05-10 10:10:49,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 8: [2023-05-10 10:10:49,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 15: [2023-05-10 10:10:49,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:49,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:49,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:49,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:49,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:49,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:49,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:49,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 1: [2023-05-10 10:10:49,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 21: [2023-05-10 10:10:49,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 21: [2023-05-10 10:10:49,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 21: [2023-05-10 10:10:49,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 21: [2023-05-10 10:10:49,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,359] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,359] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 17: [2023-05-10 10:10:49,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:49,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:49,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:49,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:49,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:49,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:49,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:49,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:49,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 0: [2023-05-10 10:10:49,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:49,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 10: [2023-05-10 10:10:49,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 30: [2023-05-10 10:10:49,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 30: [2023-05-10 10:10:49,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:49,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:49,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:49,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:49,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:49,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:49,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:49,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:49,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:49,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:49,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:49,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:49,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:49,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:49,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:49,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:49,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:49,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:49,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:49,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:49,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 20: [2023-05-10 10:10:49,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 10: [2023-05-10 10:10:49,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 20: [2023-05-10 10:10:49,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 0: [2023-05-10 10:10:49,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 0: [2023-05-10 10:10:49,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 8: [2023-05-10 10:10:49,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 30: [2023-05-10 10:10:49,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 30: [2023-05-10 10:10:49,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 30: [2023-05-10 10:10:49,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 30: [2023-05-10 10:10:49,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 30: [2023-05-10 10:10:49,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 30: [2023-05-10 10:10:49,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 17: [2023-05-10 10:10:49,390] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,390] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 22: [2023-05-10 10:10:49,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 10: [2023-05-10 10:10:49,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 0: [2023-05-10 10:10:49,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 0: [2023-05-10 10:10:49,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 15: [2023-05-10 10:10:49,394] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 10: [2023-05-10 10:10:49,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,395] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 18: [2023-05-10 10:10:49,395] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 30: [2023-05-10 10:10:49,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,395] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 18: [2023-05-10 10:10:49,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 18: [2023-05-10 10:10:49,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 18: [2023-05-10 10:10:49,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 18: [2023-05-10 10:10:49,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 18: [2023-05-10 10:10:49,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 13: [2023-05-10 10:10:49,399] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,399] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 13: [2023-05-10 10:10:49,399] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 15: [2023-05-10 10:10:49,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 15: [2023-05-10 10:10:49,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 6: [2023-05-10 10:10:49,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:49,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:49,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:49,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:49,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:49,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:49,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:49,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 1: [2023-05-10 10:10:49,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 18: [2023-05-10 10:10:49,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:49,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 6: [2023-05-10 10:10:49,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 6: [2023-05-10 10:10:49,407] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:49,407] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 15: [2023-05-10 10:10:49,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 15: [2023-05-10 10:10:49,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 15: [2023-05-10 10:10:49,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 15: [2023-05-10 10:10:49,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 10: [2023-05-10 10:10:49,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 9: [2023-05-10 10:10:49,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 10: [2023-05-10 10:10:49,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 10: [2023-05-10 10:10:49,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 0: [2023-05-10 10:10:49,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 0: [2023-05-10 10:10:49,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 8: [2023-05-10 10:10:49,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 0: [2023-05-10 10:10:49,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:49,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 1: [2023-05-10 10:10:49,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:49,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 12: [2023-05-10 10:10:49,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 12: [2023-05-10 10:10:49,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 12: [2023-05-10 10:10:49,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 12: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 12: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 12: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 12: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 12: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 6: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 0: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 0: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 13: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 13: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 13: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 13: [2023-05-10 10:10:49,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:49,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:49,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 6: [2023-05-10 10:10:49,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:49,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 6: [2023-05-10 10:10:49,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 6: [2023-05-10 10:10:49,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 12: [2023-05-10 10:10:49,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 22: [2023-05-10 10:10:49,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 22: [2023-05-10 10:10:49,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 22: [2023-05-10 10:10:49,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 9: [2023-05-10 10:10:49,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:49,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:49,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 17: [2023-05-10 10:10:49,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 9: [2023-05-10 10:10:49,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:49,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:49,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 1: [2023-05-10 10:10:49,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 8: [2023-05-10 10:10:49,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:49,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:49,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 8: [2023-05-10 10:10:49,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 8: [2023-05-10 10:10:49,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 22: [2023-05-10 10:10:49,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:49,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 12: [2023-05-10 10:10:49,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 12: [2023-05-10 10:10:49,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 28: [2023-05-10 10:10:49,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 27: [2023-05-10 10:10:49,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 12: [2023-05-10 10:10:49,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:49,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 12: [2023-05-10 10:10:49,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 12: [2023-05-10 10:10:49,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 13: [2023-05-10 10:10:49,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 12: [2023-05-10 10:10:49,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 22: [2023-05-10 10:10:49,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 22: [2023-05-10 10:10:49,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 13: [2023-05-10 10:10:49,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 17: [2023-05-10 10:10:49,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:49,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 15: [2023-05-10 10:10:49,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 15: [2023-05-10 10:10:49,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 4: [2023-05-10 10:10:49,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 1: [2023-05-10 10:10:49,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 4: [2023-05-10 10:10:49,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:49,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 17: [2023-05-10 10:10:49,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 4: [2023-05-10 10:10:49,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 1: [2023-05-10 10:10:49,433] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:49,433] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 17: [2023-05-10 10:10:49,433] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 1: [2023-05-10 10:10:49,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 4: [2023-05-10 10:10:49,436] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,435] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,435] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:49,436] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:49,436] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:49,436] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 25: [2023-05-10 10:10:49,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:49,442] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 15: [2023-05-10 10:10:49,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 25: [2023-05-10 10:10:49,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 25: [2023-05-10 10:10:49,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 25: [2023-05-10 10:10:49,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:49,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 15: [2023-05-10 10:10:49,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 27: [2023-05-10 10:10:49,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 15: [2023-05-10 10:10:49,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:49,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 15: [2023-05-10 10:10:49,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 15: [2023-05-10 10:10:49,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:49,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 2: [2023-05-10 10:10:49,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:49,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:49,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 2: [2023-05-10 10:10:49,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 2: [2023-05-10 10:10:49,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 2: [2023-05-10 10:10:49,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 2: [2023-05-10 10:10:49,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 2: [2023-05-10 10:10:49,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 2: [2023-05-10 10:10:49,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 18: [2023-05-10 10:10:49,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 18: [2023-05-10 10:10:49,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 18: [2023-05-10 10:10:49,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 18: [2023-05-10 10:10:49,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 4: [2023-05-10 10:10:49,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 4: [2023-05-10 10:10:49,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 4: [2023-05-10 10:10:49,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 18: [2023-05-10 10:10:49,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 2: [2023-05-10 10:10:49,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 18: [2023-05-10 10:10:49,463] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 18: [2023-05-10 10:10:49,463] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 19: [2023-05-10 10:10:49,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 19: [2023-05-10 10:10:49,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 4: [2023-05-10 10:10:49,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,465] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 28: [2023-05-10 10:10:49,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 19: [2023-05-10 10:10:49,465] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 19: [2023-05-10 10:10:49,465] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 19: [2023-05-10 10:10:49,465] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 19: [2023-05-10 10:10:49,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 19: [2023-05-10 10:10:49,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 14: [2023-05-10 10:10:49,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 14: [2023-05-10 10:10:49,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 14: [2023-05-10 10:10:49,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 14: [2023-05-10 10:10:49,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 14: [2023-05-10 10:10:49,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 14: [2023-05-10 10:10:49,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 14: [2023-05-10 10:10:49,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 14: [2023-05-10 10:10:49,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 2: [2023-05-10 10:10:49,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 2: [2023-05-10 10:10:49,469] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 27: [2023-05-10 10:10:49,469] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,469] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 19: [2023-05-10 10:10:49,469] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 19: [2023-05-10 10:10:49,469] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 12: [2023-05-10 10:10:49,469] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 2: [2023-05-10 10:10:49,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 19: [2023-05-10 10:10:49,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 2: [2023-05-10 10:10:49,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:49,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 4: [2023-05-10 10:10:49,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 4: [2023-05-10 10:10:49,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 9: [2023-05-10 10:10:49,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 14: [2023-05-10 10:10:49,472] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,472] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 19: [2023-05-10 10:10:49,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 19: [2023-05-10 10:10:49,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 16: [2023-05-10 10:10:49,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 19: [2023-05-10 10:10:49,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 19: [2023-05-10 10:10:49,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 14: [2023-05-10 10:10:49,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:49,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:49,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 6: [2023-05-10 10:10:49,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 27: [2023-05-10 10:10:49,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 27: [2023-05-10 10:10:49,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 9: [2023-05-10 10:10:49,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 27: [2023-05-10 10:10:49,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 27: [2023-05-10 10:10:49,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 27: [2023-05-10 10:10:49,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 6: [2023-05-10 10:10:49,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,480] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,480] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 6: [2023-05-10 10:10:49,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:49,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:49,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:49,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:49,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:49,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 16: [2023-05-10 10:10:49,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:49,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:49,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,490] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:49,494] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:49,494] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:49,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 12: [2023-05-10 10:10:49,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 26: [2023-05-10 10:10:49,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 26: [2023-05-10 10:10:49,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 26: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 26: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 26: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 26: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 26: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 11: [2023-05-10 10:10:49,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 11: [2023-05-10 10:10:49,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 11: [2023-05-10 10:10:49,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 11: [2023-05-10 10:10:49,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 11: [2023-05-10 10:10:49,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 11: [2023-05-10 10:10:49,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 11: [2023-05-10 10:10:49,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 26: [2023-05-10 10:10:49,502] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 26: [2023-05-10 10:10:49,502] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:49,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 3: [2023-05-10 10:10:49,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:49,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:49,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:49,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 6: [2023-05-10 10:10:49,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 6: [2023-05-10 10:10:49,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 11: [2023-05-10 10:10:49,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 9: [2023-05-10 10:10:49,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 9: [2023-05-10 10:10:49,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 12: [2023-05-10 10:10:49,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 12: [2023-05-10 10:10:49,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 12: [2023-05-10 10:10:49,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 12: [2023-05-10 10:10:49,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 12: [2023-05-10 10:10:49,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 9: [2023-05-10 10:10:49,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 3: [2023-05-10 10:10:49,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:49,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:49,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 26: [2023-05-10 10:10:49,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 11: [2023-05-10 10:10:49,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 11: [2023-05-10 10:10:49,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:49,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 3: [2023-05-10 10:10:49,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 26: [2023-05-10 10:10:49,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 11: [2023-05-10 10:10:49,510] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 11: [2023-05-10 10:10:49,510] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 26: [2023-05-10 10:10:49,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 26: [2023-05-10 10:10:49,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 11: [2023-05-10 10:10:49,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 11: [2023-05-10 10:10:49,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 26: [2023-05-10 10:10:49,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 26: [2023-05-10 10:10:49,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 2: [2023-05-10 10:10:49,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 2: [2023-05-10 10:10:49,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 6: [2023-05-10 10:10:49,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 6: [2023-05-10 10:10:49,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 6: [2023-05-10 10:10:49,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 6: [2023-05-10 10:10:49,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 12: [2023-05-10 10:10:49,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:49,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:49,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:49,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:49,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 19: [2023-05-10 10:10:49,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 19: [2023-05-10 10:10:49,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:49,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:49,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:49,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:49,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:49,527] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 14: [2023-05-10 10:10:49,527] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 25: [2023-05-10 10:10:49,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 14: [2023-05-10 10:10:49,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 19: [2023-05-10 10:10:49,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 2: [2023-05-10 10:10:49,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 2: [2023-05-10 10:10:49,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 19: [2023-05-10 10:10:49,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 19: [2023-05-10 10:10:49,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 19: [2023-05-10 10:10:49,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 12: [2023-05-10 10:10:49,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 14: [2023-05-10 10:10:49,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 14: [2023-05-10 10:10:49,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 12: [2023-05-10 10:10:49,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 2: [2023-05-10 10:10:49,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 2: [2023-05-10 10:10:49,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 12: [2023-05-10 10:10:49,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 12: [2023-05-10 10:10:49,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 12: [2023-05-10 10:10:49,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 26: [2023-05-10 10:10:49,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 7: [2023-05-10 10:10:49,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 7: [2023-05-10 10:10:49,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 7: [2023-05-10 10:10:49,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 7: [2023-05-10 10:10:49,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 7: [2023-05-10 10:10:49,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 7: [2023-05-10 10:10:49,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 7: [2023-05-10 10:10:49,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 7: [2023-05-10 10:10:49,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 14: [2023-05-10 10:10:49,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:49,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:49,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 31: [2023-05-10 10:10:49,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 11: [2023-05-10 10:10:49,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 31: [2023-05-10 10:10:49,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 31: [2023-05-10 10:10:49,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 31: [2023-05-10 10:10:49,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 31: [2023-05-10 10:10:49,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:49,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:49,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 7: [2023-05-10 10:10:49,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 31: [2023-05-10 10:10:49,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 7: [2023-05-10 10:10:49,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:49,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:49,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:49,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 7: [2023-05-10 10:10:49,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt... 19: [2023-05-10 10:10:49,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 3: [2023-05-10 10:10:49,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 31: [2023-05-10 10:10:49,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,573] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 3: [2023-05-10 10:10:49,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:49,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:49,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:49,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 2: [2023-05-10 10:10:49,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:49,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 26: [2023-05-10 10:10:49,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 26: [2023-05-10 10:10:49,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 26: [2023-05-10 10:10:49,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 11: [2023-05-10 10:10:49,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 11: [2023-05-10 10:10:49,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 11: [2023-05-10 10:10:49,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:49,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 11: [2023-05-10 10:10:49,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 3: [2023-05-10 10:10:49,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 3: [2023-05-10 10:10:49,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 26: [2023-05-10 10:10:49,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 26: [2023-05-10 10:10:49,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 26: [2023-05-10 10:10:49,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 5: [2023-05-10 10:10:49,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 22: [2023-05-10 10:10:49,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 22: [2023-05-10 10:10:49,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 22: [2023-05-10 10:10:49,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 22: [2023-05-10 10:10:49,596] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 22: [2023-05-10 10:10:49,596] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 22: [2023-05-10 10:10:49,596] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 22: [2023-05-10 10:10:49,596] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 22: [2023-05-10 10:10:49,596] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 3: [2023-05-10 10:10:49,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 22: [2023-05-10 10:10:49,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 3: [2023-05-10 10:10:49,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 3: [2023-05-10 10:10:49,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 3: [2023-05-10 10:10:49,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 26: [2023-05-10 10:10:49,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 22: [2023-05-10 10:10:49,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 22: [2023-05-10 10:10:49,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 22: [2023-05-10 10:10:49,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 22: [2023-05-10 10:10:49,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 22: [2023-05-10 10:10:49,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 22: [2023-05-10 10:10:49,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 26: [2023-05-10 10:10:49,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 31: [2023-05-10 10:10:49,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 21: [2023-05-10 10:10:49,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 21: [2023-05-10 10:10:49,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 21: [2023-05-10 10:10:49,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 21: [2023-05-10 10:10:49,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 21: [2023-05-10 10:10:49,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 21: [2023-05-10 10:10:49,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 21: [2023-05-10 10:10:49,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 21: [2023-05-10 10:10:49,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 7: [2023-05-10 10:10:49,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 7: [2023-05-10 10:10:49,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 7: [2023-05-10 10:10:49,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 7: [2023-05-10 10:10:49,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 21: [2023-05-10 10:10:49,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 21: [2023-05-10 10:10:49,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 21: [2023-05-10 10:10:49,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 21: [2023-05-10 10:10:49,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 21: [2023-05-10 10:10:49,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 21: [2023-05-10 10:10:49,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 7: [2023-05-10 10:10:49,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 7: [2023-05-10 10:10:49,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 7: [2023-05-10 10:10:49,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 7: [2023-05-10 10:10:49,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_06-model_00-model_states.pt. 21: [2023-05-10 10:10:49,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 24: [2023-05-10 10:10:49,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 31: [2023-05-10 10:10:49,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 31: [2023-05-10 10:10:49,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 31: [2023-05-10 10:10:49,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 31: [2023-05-10 10:10:49,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 7: [2023-05-10 10:10:49,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 7: [2023-05-10 10:10:49,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 7: [2023-05-10 10:10:49,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 7: [2023-05-10 10:10:49,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 7: [2023-05-10 10:10:49,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 7: [2023-05-10 10:10:49,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 7: [2023-05-10 10:10:49,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 22: [2023-05-10 10:10:49,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 22: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 15: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 23: [2023-05-10 10:10:49,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 23: [2023-05-10 10:10:49,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 23: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 23: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 17: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 23: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 23: [2023-05-10 10:10:49,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:49,657] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,657] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,657] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 15: [2023-05-10 10:10:49,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 15: [2023-05-10 10:10:49,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 15: [2023-05-10 10:10:49,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 15: [2023-05-10 10:10:49,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 15: [2023-05-10 10:10:49,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 15: [2023-05-10 10:10:49,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 0: [2023-05-10 10:10:49,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 23: [2023-05-10 10:10:49,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 0: [2023-05-10 10:10:49,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 23: [2023-05-10 10:10:49,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 0: [2023-05-10 10:10:49,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:49,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:49,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 30: [2023-05-10 10:10:49,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 30: [2023-05-10 10:10:49,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 30: [2023-05-10 10:10:49,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 30: [2023-05-10 10:10:49,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 30: [2023-05-10 10:10:49,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 30: [2023-05-10 10:10:49,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 0: [2023-05-10 10:10:49,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 0: [2023-05-10 10:10:49,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 0: [2023-05-10 10:10:49,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 0: [2023-05-10 10:10:49,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 13: [2023-05-10 10:10:49,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 13: [2023-05-10 10:10:49,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 13: [2023-05-10 10:10:49,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 13: [2023-05-10 10:10:49,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 13: [2023-05-10 10:10:49,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 23: [2023-05-10 10:10:49,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 23: [2023-05-10 10:10:49,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 23: [2023-05-10 10:10:49,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:49,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 23: [2023-05-10 10:10:49,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 10: [2023-05-10 10:10:49,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 10: [2023-05-10 10:10:49,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 15: [2023-05-10 10:10:49,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 10: [2023-05-10 10:10:49,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 10: [2023-05-10 10:10:49,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 10: [2023-05-10 10:10:49,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 10: [2023-05-10 10:10:49,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 17: [2023-05-10 10:10:49,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 17: [2023-05-10 10:10:49,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 17: [2023-05-10 10:10:49,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 17: [2023-05-10 10:10:49,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 17: [2023-05-10 10:10:49,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 17: [2023-05-10 10:10:49,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 15: [2023-05-10 10:10:49,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 15: [2023-05-10 10:10:49,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 17: [2023-05-10 10:10:49,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 15: [2023-05-10 10:10:49,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 15: [2023-05-10 10:10:49,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 15: [2023-05-10 10:10:49,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 21: [2023-05-10 10:10:49,673] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 21: [2023-05-10 10:10:49,673] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 30: [2023-05-10 10:10:49,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:49,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 31: [2023-05-10 10:10:49,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:49,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 22: [2023-05-10 10:10:49,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 22: [2023-05-10 10:10:49,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 21: [2023-05-10 10:10:49,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 13: [2023-05-10 10:10:49,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 22: [2023-05-10 10:10:49,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 22: [2023-05-10 10:10:49,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 22: [2023-05-10 10:10:49,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 22: [2023-05-10 10:10:49,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 13: [2023-05-10 10:10:49,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 21: [2023-05-10 10:10:49,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 22: [2023-05-10 10:10:49,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 21: [2023-05-10 10:10:49,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 21: [2023-05-10 10:10:49,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 21: [2023-05-10 10:10:49,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 21: [2023-05-10 10:10:49,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:49,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 20: [2023-05-10 10:10:49,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 20: [2023-05-10 10:10:49,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 20: [2023-05-10 10:10:49,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 20: [2023-05-10 10:10:49,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 20: [2023-05-10 10:10:49,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 20: [2023-05-10 10:10:49,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 20: [2023-05-10 10:10:49,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 1: [2023-05-10 10:10:49,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 1: [2023-05-10 10:10:49,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 1: [2023-05-10 10:10:49,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 1: [2023-05-10 10:10:49,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 1: [2023-05-10 10:10:49,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 1: [2023-05-10 10:10:49,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 1: [2023-05-10 10:10:49,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:49,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:49,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 1: [2023-05-10 10:10:49,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:49,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 15: [2023-05-10 10:10:49,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 1: [2023-05-10 10:10:49,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 1: [2023-05-10 10:10:49,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 1: [2023-05-10 10:10:49,712] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 1: [2023-05-10 10:10:49,712] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 1: [2023-05-10 10:10:49,712] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,712] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,712] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 23: [2023-05-10 10:10:49,714] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 23: [2023-05-10 10:10:49,714] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 23: [2023-05-10 10:10:49,714] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:49,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 22: [2023-05-10 10:10:49,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:49,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,718] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 0: [2023-05-10 10:10:49,718] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,718] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,718] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,718] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,718] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 4: [2023-05-10 10:10:49,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 4: [2023-05-10 10:10:49,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 4: [2023-05-10 10:10:49,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 4: [2023-05-10 10:10:49,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 4: [2023-05-10 10:10:49,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 21: [2023-05-10 10:10:49,719] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 4: [2023-05-10 10:10:49,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 4: [2023-05-10 10:10:49,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 4: [2023-05-10 10:10:49,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 21: [2023-05-10 10:10:49,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 10: [2023-05-10 10:10:49,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:49,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:49,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:49,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 23: [2023-05-10 10:10:49,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:49,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:49,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:49,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:49,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:49,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:49,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 4: [2023-05-10 10:10:49,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 4: [2023-05-10 10:10:49,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 4: [2023-05-10 10:10:49,726] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 4: [2023-05-10 10:10:49,726] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 4: [2023-05-10 10:10:49,727] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 10: [2023-05-10 10:10:49,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 8: [2023-05-10 10:10:49,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 28: [2023-05-10 10:10:49,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 23: [2023-05-10 10:10:49,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 23: [2023-05-10 10:10:49,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 23: [2023-05-10 10:10:49,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:49,730] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 23: [2023-05-10 10:10:49,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:49,730] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 8: [2023-05-10 10:10:49,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 0: [2023-05-10 10:10:49,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 0: [2023-05-10 10:10:49,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 28: [2023-05-10 10:10:49,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 28: [2023-05-10 10:10:49,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 28: [2023-05-10 10:10:49,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 28: [2023-05-10 10:10:49,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 28: [2023-05-10 10:10:49,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 17: [2023-05-10 10:10:49,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 8: [2023-05-10 10:10:49,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 8: [2023-05-10 10:10:49,732] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,732] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 10: [2023-05-10 10:10:49,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 29: [2023-05-10 10:10:49,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 29: [2023-05-10 10:10:49,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 29: [2023-05-10 10:10:49,735] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 29: [2023-05-10 10:10:49,735] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 29: [2023-05-10 10:10:49,735] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 29: [2023-05-10 10:10:49,735] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 29: [2023-05-10 10:10:49,735] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 28: [2023-05-10 10:10:49,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 23: [2023-05-10 10:10:49,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:49,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 29: [2023-05-10 10:10:49,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 23: [2023-05-10 10:10:49,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:49,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:49,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:49,739] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,739] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:49,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 18: [2023-05-10 10:10:49,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 18: [2023-05-10 10:10:49,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 18: [2023-05-10 10:10:49,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 18: [2023-05-10 10:10:49,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 18: [2023-05-10 10:10:49,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 18: [2023-05-10 10:10:49,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 18: [2023-05-10 10:10:49,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:49,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 15: [2023-05-10 10:10:49,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 0: [2023-05-10 10:10:49,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 29: [2023-05-10 10:10:49,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 29: [2023-05-10 10:10:49,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 10: [2023-05-10 10:10:49,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 29: [2023-05-10 10:10:49,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 29: [2023-05-10 10:10:49,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 15: [2023-05-10 10:10:49,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 15: [2023-05-10 10:10:49,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 15: [2023-05-10 10:10:49,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 18: [2023-05-10 10:10:49,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:49,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:49,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:49,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:49,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 10: [2023-05-10 10:10:49,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:49,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 18: [2023-05-10 10:10:49,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:49,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:49,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 10: [2023-05-10 10:10:49,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:49,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 13: [2023-05-10 10:10:49,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 13: [2023-05-10 10:10:49,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 13: [2023-05-10 10:10:49,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 20: [2023-05-10 10:10:49,757] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 23: [2023-05-10 10:10:49,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:49,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:49,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:49,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 0: [2023-05-10 10:10:49,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 23: [2023-05-10 10:10:49,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:49,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:49,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 30: [2023-05-10 10:10:49,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 30: [2023-05-10 10:10:49,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 30: [2023-05-10 10:10:49,761] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 10: [2023-05-10 10:10:49,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:49,762] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 1: [2023-05-10 10:10:49,762] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 1: [2023-05-10 10:10:49,762] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:49,763] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:49,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 10: [2023-05-10 10:10:49,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 0: [2023-05-10 10:10:49,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 1: [2023-05-10 10:10:49,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 13: [2023-05-10 10:10:49,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 13: [2023-05-10 10:10:49,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 13: [2023-05-10 10:10:49,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 13: [2023-05-10 10:10:49,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 13: [2023-05-10 10:10:49,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 16: [2023-05-10 10:10:49,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 16: [2023-05-10 10:10:49,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:49,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 16: [2023-05-10 10:10:49,769] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 16: [2023-05-10 10:10:49,769] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 16: [2023-05-10 10:10:49,769] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 16: [2023-05-10 10:10:49,769] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 16: [2023-05-10 10:10:49,769] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 16: [2023-05-10 10:10:49,770] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,771] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 16: [2023-05-10 10:10:49,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 16: [2023-05-10 10:10:49,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 1: [2023-05-10 10:10:49,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 1: [2023-05-10 10:10:49,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 1: [2023-05-10 10:10:49,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 17: [2023-05-10 10:10:49,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 16: [2023-05-10 10:10:49,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 16: [2023-05-10 10:10:49,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:49,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:49,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:49,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:49,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:49,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 4: [2023-05-10 10:10:49,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 4: [2023-05-10 10:10:49,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 4: [2023-05-10 10:10:49,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 4: [2023-05-10 10:10:49,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 17: [2023-05-10 10:10:49,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:49,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:49,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:49,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:49,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 15: [2023-05-10 10:10:49,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:49,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 28: [2023-05-10 10:10:49,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:49,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 4: [2023-05-10 10:10:49,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 4: [2023-05-10 10:10:49,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 4: [2023-05-10 10:10:49,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 15: [2023-05-10 10:10:49,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 1: [2023-05-10 10:10:49,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:49,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 8: [2023-05-10 10:10:49,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:49,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:49,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:49,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 29: [2023-05-10 10:10:49,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 29: [2023-05-10 10:10:49,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 27: [2023-05-10 10:10:49,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 27: [2023-05-10 10:10:49,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 27: [2023-05-10 10:10:49,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 27: [2023-05-10 10:10:49,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 27: [2023-05-10 10:10:49,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 27: [2023-05-10 10:10:49,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 27: [2023-05-10 10:10:49,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 27: [2023-05-10 10:10:49,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 20: [2023-05-10 10:10:49,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 20: [2023-05-10 10:10:49,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 20: [2023-05-10 10:10:49,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 20: [2023-05-10 10:10:49,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 20: [2023-05-10 10:10:49,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,794] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,794] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 18: [2023-05-10 10:10:49,795] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 27: [2023-05-10 10:10:49,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 27: [2023-05-10 10:10:49,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,796] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 30: [2023-05-10 10:10:49,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:49,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 1: [2023-05-10 10:10:49,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 30: [2023-05-10 10:10:49,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:49,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:49,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 27: [2023-05-10 10:10:49,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 27: [2023-05-10 10:10:49,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 27: [2023-05-10 10:10:49,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 27: [2023-05-10 10:10:49,800] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 9: [2023-05-10 10:10:49,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 9: [2023-05-10 10:10:49,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 27: [2023-05-10 10:10:49,800] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 30: [2023-05-10 10:10:49,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:49,800] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 27: [2023-05-10 10:10:49,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 13: [2023-05-10 10:10:49,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:49,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:49,802] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:49,802] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:49,802] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:49,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 9: [2023-05-10 10:10:49,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 9: [2023-05-10 10:10:49,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 9: [2023-05-10 10:10:49,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 9: [2023-05-10 10:10:49,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:49,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:49,803] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:49,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 4: [2023-05-10 10:10:49,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:49,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 2: [2023-05-10 10:10:49,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 28: [2023-05-10 10:10:49,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 1: [2023-05-10 10:10:49,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:49,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 1: [2023-05-10 10:10:49,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 18: [2023-05-10 10:10:49,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 18: [2023-05-10 10:10:49,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 18: [2023-05-10 10:10:49,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 18: [2023-05-10 10:10:49,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 1: [2023-05-10 10:10:49,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 9: [2023-05-10 10:10:49,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 1: [2023-05-10 10:10:49,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 9: [2023-05-10 10:10:49,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 28: [2023-05-10 10:10:49,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 9: [2023-05-10 10:10:49,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 4: [2023-05-10 10:10:49,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:49,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 29: [2023-05-10 10:10:49,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:49,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 9: [2023-05-10 10:10:49,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 9: [2023-05-10 10:10:49,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:49,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:49,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 28: [2023-05-10 10:10:49,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 28: [2023-05-10 10:10:49,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:49,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:49,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:49,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 6: [2023-05-10 10:10:49,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 28: [2023-05-10 10:10:49,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:49,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 19: [2023-05-10 10:10:49,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 18: [2023-05-10 10:10:49,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 18: [2023-05-10 10:10:49,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 24: [2023-05-10 10:10:49,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 24: [2023-05-10 10:10:49,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:49,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 8: [2023-05-10 10:10:49,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 8: [2023-05-10 10:10:49,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:49,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 29: [2023-05-10 10:10:49,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 29: [2023-05-10 10:10:49,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 29: [2023-05-10 10:10:49,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 29: [2023-05-10 10:10:49,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:49,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 19: [2023-05-10 10:10:49,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 19: [2023-05-10 10:10:49,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 19: [2023-05-10 10:10:49,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 19: [2023-05-10 10:10:49,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 19: [2023-05-10 10:10:49,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 24: [2023-05-10 10:10:49,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 24: [2023-05-10 10:10:49,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:49,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 6: [2023-05-10 10:10:49,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 6: [2023-05-10 10:10:49,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 6: [2023-05-10 10:10:49,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 6: [2023-05-10 10:10:49,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 6: [2023-05-10 10:10:49,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 24: [2023-05-10 10:10:49,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 24: [2023-05-10 10:10:49,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:49,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:49,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:49,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 4: [2023-05-10 10:10:49,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:49,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:49,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 14: [2023-05-10 10:10:49,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 8: [2023-05-10 10:10:49,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:49,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 4: [2023-05-10 10:10:49,822] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:49,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 14: [2023-05-10 10:10:49,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 4: [2023-05-10 10:10:49,822] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:49,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 14: [2023-05-10 10:10:49,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 14: [2023-05-10 10:10:49,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 14: [2023-05-10 10:10:49,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 25: [2023-05-10 10:10:49,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 25: [2023-05-10 10:10:49,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 14: [2023-05-10 10:10:49,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 19: [2023-05-10 10:10:49,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,824] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 25: [2023-05-10 10:10:49,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 25: [2023-05-10 10:10:49,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 25: [2023-05-10 10:10:49,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 25: [2023-05-10 10:10:49,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:49,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,826] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 19: [2023-05-10 10:10:49,826] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,826] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,826] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:49,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 6: [2023-05-10 10:10:49,826] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 12: [2023-05-10 10:10:49,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:49,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 8: [2023-05-10 10:10:49,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:49,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 6: [2023-05-10 10:10:49,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 6: [2023-05-10 10:10:49,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 6: [2023-05-10 10:10:49,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:49,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 14: [2023-05-10 10:10:49,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 6: [2023-05-10 10:10:49,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:49,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:49,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 25: [2023-05-10 10:10:49,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 12: [2023-05-10 10:10:49,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 29: [2023-05-10 10:10:49,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:49,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 12: [2023-05-10 10:10:49,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:49,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 12: [2023-05-10 10:10:49,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 20: [2023-05-10 10:10:49,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:49,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:49,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 3: [2023-05-10 10:10:49,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 28: [2023-05-10 10:10:49,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:49,837] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,839] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 18: [2023-05-10 10:10:49,839] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:49,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:49,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 3: [2023-05-10 10:10:49,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 18: [2023-05-10 10:10:49,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 18: [2023-05-10 10:10:49,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 18: [2023-05-10 10:10:49,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:49,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,843] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,843] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,843] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,843] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,843] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,843] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 28: [2023-05-10 10:10:49,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:49,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:49,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:49,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 3: [2023-05-10 10:10:49,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 3: [2023-05-10 10:10:49,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 3: [2023-05-10 10:10:49,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:49,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 28: [2023-05-10 10:10:49,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 26: [2023-05-10 10:10:49,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 26: [2023-05-10 10:10:49,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 26: [2023-05-10 10:10:49,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 26: [2023-05-10 10:10:49,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 26: [2023-05-10 10:10:49,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 26: [2023-05-10 10:10:49,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 26: [2023-05-10 10:10:49,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 26: [2023-05-10 10:10:49,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 2: [2023-05-10 10:10:49,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 27: [2023-05-10 10:10:49,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 29: [2023-05-10 10:10:49,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 28: [2023-05-10 10:10:49,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:49,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 16: [2023-05-10 10:10:49,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 16: [2023-05-10 10:10:49,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 16: [2023-05-10 10:10:49,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 16: [2023-05-10 10:10:49,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 16: [2023-05-10 10:10:49,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 16: [2023-05-10 10:10:49,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 11: [2023-05-10 10:10:49,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:49,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:49,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:49,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:49,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:49,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:49,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:49,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 16: [2023-05-10 10:10:49,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 26: [2023-05-10 10:10:49,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 26: [2023-05-10 10:10:49,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 5: [2023-05-10 10:10:49,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,862] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:49,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 9: [2023-05-10 10:10:49,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 9: [2023-05-10 10:10:49,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 27: [2023-05-10 10:10:49,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 5: [2023-05-10 10:10:49,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 27: [2023-05-10 10:10:49,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 27: [2023-05-10 10:10:49,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 27: [2023-05-10 10:10:49,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 27: [2023-05-10 10:10:49,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 27: [2023-05-10 10:10:49,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 9: [2023-05-10 10:10:49,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 9: [2023-05-10 10:10:49,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 9: [2023-05-10 10:10:49,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 9: [2023-05-10 10:10:49,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 9: [2023-05-10 10:10:49,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 6: [2023-05-10 10:10:49,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 6: [2023-05-10 10:10:49,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:49,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 24: [2023-05-10 10:10:49,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 27: [2023-05-10 10:10:49,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 27: [2023-05-10 10:10:49,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 19: [2023-05-10 10:10:49,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 19: [2023-05-10 10:10:49,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 19: [2023-05-10 10:10:49,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 14: [2023-05-10 10:10:49,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 16: [2023-05-10 10:10:49,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 16: [2023-05-10 10:10:49,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 16: [2023-05-10 10:10:49,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:49,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 14: [2023-05-10 10:10:49,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 7: [2023-05-10 10:10:49,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 7: [2023-05-10 10:10:49,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 7: [2023-05-10 10:10:49,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 19: [2023-05-10 10:10:49,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 19: [2023-05-10 10:10:49,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 7: [2023-05-10 10:10:49,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 7: [2023-05-10 10:10:49,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 7: [2023-05-10 10:10:49,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt... 16: [2023-05-10 10:10:49,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:49,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 25: [2023-05-10 10:10:49,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 25: [2023-05-10 10:10:49,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 19: [2023-05-10 10:10:49,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 19: [2023-05-10 10:10:49,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 19: [2023-05-10 10:10:49,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 25: [2023-05-10 10:10:49,890] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 25: [2023-05-10 10:10:49,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 25: [2023-05-10 10:10:49,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 25: [2023-05-10 10:10:49,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 25: [2023-05-10 10:10:49,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 6: [2023-05-10 10:10:49,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:49,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:49,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 6: [2023-05-10 10:10:49,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 14: [2023-05-10 10:10:49,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 14: [2023-05-10 10:10:49,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 14: [2023-05-10 10:10:49,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 14: [2023-05-10 10:10:49,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 14: [2023-05-10 10:10:49,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 9: [2023-05-10 10:10:49,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:49,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:49,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 27: [2023-05-10 10:10:49,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:49,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 24: [2023-05-10 10:10:49,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 24: [2023-05-10 10:10:49,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 24: [2023-05-10 10:10:49,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 24: [2023-05-10 10:10:49,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:49,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 27: [2023-05-10 10:10:49,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:49,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:49,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:49,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:49,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:49,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:49,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:49,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:49,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:49,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:49,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 11: [2023-05-10 10:10:49,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 27: [2023-05-10 10:10:49,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:49,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:49,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 27: [2023-05-10 10:10:49,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:49,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:49,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 27: [2023-05-10 10:10:49,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:49,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:49,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 6: [2023-05-10 10:10:49,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 6: [2023-05-10 10:10:49,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 6: [2023-05-10 10:10:49,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 27: [2023-05-10 10:10:49,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:49,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 26: [2023-05-10 10:10:49,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 26: [2023-05-10 10:10:49,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 2: [2023-05-10 10:10:49,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:49,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:49,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:49,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 19: [2023-05-10 10:10:49,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 3: [2023-05-10 10:10:49,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:49,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:49,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:49,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:49,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:49,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 3: [2023-05-10 10:10:49,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 22: [2023-05-10 10:10:49,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:49,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:49,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:49,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 19: [2023-05-10 10:10:49,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:49,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:49,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:49,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:49,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:49,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:49,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:49,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:49,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:49,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:49,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 3: [2023-05-10 10:10:49,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 19: [2023-05-10 10:10:49,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 19: [2023-05-10 10:10:49,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 19: [2023-05-10 10:10:49,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:49,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 11: [2023-05-10 10:10:49,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 25: [2023-05-10 10:10:49,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:49,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 11: [2023-05-10 10:10:49,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 11: [2023-05-10 10:10:49,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:49,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:49,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:49,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:49,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:49,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 11: [2023-05-10 10:10:49,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 24: [2023-05-10 10:10:49,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:49,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 26: [2023-05-10 10:10:49,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 26: [2023-05-10 10:10:49,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 26: [2023-05-10 10:10:49,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 14: [2023-05-10 10:10:49,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:49,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:49,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 26: [2023-05-10 10:10:49,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 26: [2023-05-10 10:10:49,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:49,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 26: [2023-05-10 10:10:49,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 26: [2023-05-10 10:10:49,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:49,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:49,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:49,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:49,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 24: [2023-05-10 10:10:49,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:49,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:49,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:49,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:49,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:49,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:49,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 3: [2023-05-10 10:10:49,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:49,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 3: [2023-05-10 10:10:49,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 3: [2023-05-10 10:10:49,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:49,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:49,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 12: [2023-05-10 10:10:49,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 7: [2023-05-10 10:10:49,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 7: [2023-05-10 10:10:49,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_07-model_00-model_states.pt. 11: [2023-05-10 10:10:49,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:49,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 11: [2023-05-10 10:10:49,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:49,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:49,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:49,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 11: [2023-05-10 10:10:49,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 26: [2023-05-10 10:10:49,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:49,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:49,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:49,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:49,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 11: [2023-05-10 10:10:49,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 26: [2023-05-10 10:10:49,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 26: [2023-05-10 10:10:49,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 11: [2023-05-10 10:10:49,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 26: [2023-05-10 10:10:49,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:49,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:49,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:49,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:49,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:49,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:49,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:49,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:49,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:49,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:49,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 26: [2023-05-10 10:10:49,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 5: [2023-05-10 10:10:49,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:49,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:49,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 7: [2023-05-10 10:10:49,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 7: [2023-05-10 10:10:49,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 7: [2023-05-10 10:10:49,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 7: [2023-05-10 10:10:49,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 7: [2023-05-10 10:10:49,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 7: [2023-05-10 10:10:49,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 7: [2023-05-10 10:10:49,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:49,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:49,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:49,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:49,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:49,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:49,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:49,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:49,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:49,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:49,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:49,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:49,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:49,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:49,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:49,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:49,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:49,988] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:49,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:49,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:50,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:50,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 22: [2023-05-10 10:10:50,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 10: [2023-05-10 10:10:50,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 10: [2023-05-10 10:10:50,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 15: [2023-05-10 10:10:50,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:50,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 22: [2023-05-10 10:10:50,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 10: [2023-05-10 10:10:50,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 10: [2023-05-10 10:10:50,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:50,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,018] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:50,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 22: [2023-05-10 10:10:50,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 0: [2023-05-10 10:10:50,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 0: [2023-05-10 10:10:50,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 0: [2023-05-10 10:10:50,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 0: [2023-05-10 10:10:50,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 23: [2023-05-10 10:10:50,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 23: [2023-05-10 10:10:50,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 23: [2023-05-10 10:10:50,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 23: [2023-05-10 10:10:50,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 23: [2023-05-10 10:10:50,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 23: [2023-05-10 10:10:50,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 23: [2023-05-10 10:10:50,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 23: [2023-05-10 10:10:50,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 23: [2023-05-10 10:10:50,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:50,038] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:50,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:50,047] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:50,047] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:50,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:50,056] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:50,056] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:50,056] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 1: [2023-05-10 10:10:50,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 1: [2023-05-10 10:10:50,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 1: [2023-05-10 10:10:50,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 1: [2023-05-10 10:10:50,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 1: [2023-05-10 10:10:50,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:50,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:50,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 1: [2023-05-10 10:10:50,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 1: [2023-05-10 10:10:50,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 1: [2023-05-10 10:10:50,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:50,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:50,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:50,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:50,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:50,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:50,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 29: [2023-05-10 10:10:50,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:50,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:50,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:50,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 28: [2023-05-10 10:10:50,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:50,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 23: [2023-05-10 10:10:50,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 23: [2023-05-10 10:10:50,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:50,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 28: [2023-05-10 10:10:50,093] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 28: [2023-05-10 10:10:50,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 28: [2023-05-10 10:10:50,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 28: [2023-05-10 10:10:50,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 28: [2023-05-10 10:10:50,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 8: [2023-05-10 10:10:50,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 8: [2023-05-10 10:10:50,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 8: [2023-05-10 10:10:50,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 8: [2023-05-10 10:10:50,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 8: [2023-05-10 10:10:50,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 8: [2023-05-10 10:10:50,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 8: [2023-05-10 10:10:50,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 8: [2023-05-10 10:10:50,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 10: [2023-05-10 10:10:50,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 4: [2023-05-10 10:10:50,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 4: [2023-05-10 10:10:50,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 10: [2023-05-10 10:10:50,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 28: [2023-05-10 10:10:50,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 28: [2023-05-10 10:10:50,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:50,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 17: [2023-05-10 10:10:50,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 17: [2023-05-10 10:10:50,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 17: [2023-05-10 10:10:50,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 17: [2023-05-10 10:10:50,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 17: [2023-05-10 10:10:50,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 17: [2023-05-10 10:10:50,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 17: [2023-05-10 10:10:50,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:50,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 8: [2023-05-10 10:10:50,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 8: [2023-05-10 10:10:50,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 8: [2023-05-10 10:10:50,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 8: [2023-05-10 10:10:50,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 8: [2023-05-10 10:10:50,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 8: [2023-05-10 10:10:50,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 8: [2023-05-10 10:10:50,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 23: [2023-05-10 10:10:50,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 23: [2023-05-10 10:10:50,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 23: [2023-05-10 10:10:50,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 0: [2023-05-10 10:10:50,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 13: [2023-05-10 10:10:50,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 13: [2023-05-10 10:10:50,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 13: [2023-05-10 10:10:50,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 13: [2023-05-10 10:10:50,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 13: [2023-05-10 10:10:50,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 13: [2023-05-10 10:10:50,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 13: [2023-05-10 10:10:50,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 13: [2023-05-10 10:10:50,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 23: [2023-05-10 10:10:50,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:50,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 23: [2023-05-10 10:10:50,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:50,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:50,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:50,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:50,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:50,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:50,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:50,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 1: [2023-05-10 10:10:50,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:50,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:50,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:50,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:50,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:50,124] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:50,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:50,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:50,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:50,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:50,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:50,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 13: [2023-05-10 10:10:50,128] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 1: [2023-05-10 10:10:50,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 23: [2023-05-10 10:10:50,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 1: [2023-05-10 10:10:50,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 1: [2023-05-10 10:10:50,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 23: [2023-05-10 10:10:50,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:50,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 29: [2023-05-10 10:10:50,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 29: [2023-05-10 10:10:50,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 28: [2023-05-10 10:10:50,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 28: [2023-05-10 10:10:50,151] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 18: [2023-05-10 10:10:50,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 17: [2023-05-10 10:10:50,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:50,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:50,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 8: [2023-05-10 10:10:50,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 18: [2023-05-10 10:10:50,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 18: [2023-05-10 10:10:50,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 18: [2023-05-10 10:10:50,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 18: [2023-05-10 10:10:50,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:50,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 18: [2023-05-10 10:10:50,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 1: [2023-05-10 10:10:50,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 1: [2023-05-10 10:10:50,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 1: [2023-05-10 10:10:50,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 18: [2023-05-10 10:10:50,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 18: [2023-05-10 10:10:50,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 18: [2023-05-10 10:10:50,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 28: [2023-05-10 10:10:50,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 28: [2023-05-10 10:10:50,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 24: [2023-05-10 10:10:50,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 8: [2023-05-10 10:10:50,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 8: [2023-05-10 10:10:50,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 8: [2023-05-10 10:10:50,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 8: [2023-05-10 10:10:50,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 8: [2023-05-10 10:10:50,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 28: [2023-05-10 10:10:50,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 24: [2023-05-10 10:10:50,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 28: [2023-05-10 10:10:50,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 29: [2023-05-10 10:10:50,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 17: [2023-05-10 10:10:50,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 28: [2023-05-10 10:10:50,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 20: [2023-05-10 10:10:50,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 24: [2023-05-10 10:10:50,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 24: [2023-05-10 10:10:50,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:50,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 29: [2023-05-10 10:10:50,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 24: [2023-05-10 10:10:50,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 29: [2023-05-10 10:10:50,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 24: [2023-05-10 10:10:50,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 24: [2023-05-10 10:10:50,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:50,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 5: [2023-05-10 10:10:50,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 5: [2023-05-10 10:10:50,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 5: [2023-05-10 10:10:50,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 30: [2023-05-10 10:10:50,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:50,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 5: [2023-05-10 10:10:50,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 5: [2023-05-10 10:10:50,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 5: [2023-05-10 10:10:50,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 30: [2023-05-10 10:10:50,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 30: [2023-05-10 10:10:50,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 30: [2023-05-10 10:10:50,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 30: [2023-05-10 10:10:50,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 30: [2023-05-10 10:10:50,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 17: [2023-05-10 10:10:50,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 17: [2023-05-10 10:10:50,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:50,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 8: [2023-05-10 10:10:50,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:50,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:50,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:50,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:50,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:50,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:50,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 2: [2023-05-10 10:10:50,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 2: [2023-05-10 10:10:50,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 17: [2023-05-10 10:10:50,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:50,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:50,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 17: [2023-05-10 10:10:50,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 2: [2023-05-10 10:10:50,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 2: [2023-05-10 10:10:50,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 2: [2023-05-10 10:10:50,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 2: [2023-05-10 10:10:50,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 2: [2023-05-10 10:10:50,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 30: [2023-05-10 10:10:50,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:50,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 30: [2023-05-10 10:10:50,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:50,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:50,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 4: [2023-05-10 10:10:50,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 2: [2023-05-10 10:10:50,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:50,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:50,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:50,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 8: [2023-05-10 10:10:50,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:50,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 30: [2023-05-10 10:10:50,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:50,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 25: [2023-05-10 10:10:50,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 8: [2023-05-10 10:10:50,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 25: [2023-05-10 10:10:50,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 25: [2023-05-10 10:10:50,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 25: [2023-05-10 10:10:50,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 25: [2023-05-10 10:10:50,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 25: [2023-05-10 10:10:50,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 13: [2023-05-10 10:10:50,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 13: [2023-05-10 10:10:50,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 13: [2023-05-10 10:10:50,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 28: [2023-05-10 10:10:50,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:50,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 28: [2023-05-10 10:10:50,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 28: [2023-05-10 10:10:50,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 28: [2023-05-10 10:10:50,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 31: [2023-05-10 10:10:50,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:50,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:50,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:50,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 31: [2023-05-10 10:10:50,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:50,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:50,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:50,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 31: [2023-05-10 10:10:50,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 31: [2023-05-10 10:10:50,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 31: [2023-05-10 10:10:50,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 31: [2023-05-10 10:10:50,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:50,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:50,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:50,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:50,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:50,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:50,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 27: [2023-05-10 10:10:50,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 27: [2023-05-10 10:10:50,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:50,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 27: [2023-05-10 10:10:50,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 27: [2023-05-10 10:10:50,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 27: [2023-05-10 10:10:50,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:50,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:50,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 16: [2023-05-10 10:10:50,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 16: [2023-05-10 10:10:50,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 16: [2023-05-10 10:10:50,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 16: [2023-05-10 10:10:50,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 16: [2023-05-10 10:10:50,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 16: [2023-05-10 10:10:50,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:50,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:50,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 13: [2023-05-10 10:10:50,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 13: [2023-05-10 10:10:50,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 13: [2023-05-10 10:10:50,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 13: [2023-05-10 10:10:50,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 13: [2023-05-10 10:10:50,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:50,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 27: [2023-05-10 10:10:50,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:50,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 22: [2023-05-10 10:10:50,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:50,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 22: [2023-05-10 10:10:50,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 22: [2023-05-10 10:10:50,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 22: [2023-05-10 10:10:50,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 22: [2023-05-10 10:10:50,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 22: [2023-05-10 10:10:50,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 22: [2023-05-10 10:10:50,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 12: [2023-05-10 10:10:50,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 12: [2023-05-10 10:10:50,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 12: [2023-05-10 10:10:50,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 12: [2023-05-10 10:10:50,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 12: [2023-05-10 10:10:50,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 12: [2023-05-10 10:10:50,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 12: [2023-05-10 10:10:50,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:50,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 16: [2023-05-10 10:10:50,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:50,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:50,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 27: [2023-05-10 10:10:50,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 27: [2023-05-10 10:10:50,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 27: [2023-05-10 10:10:50,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 27: [2023-05-10 10:10:50,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 27: [2023-05-10 10:10:50,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 16: [2023-05-10 10:10:50,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 16: [2023-05-10 10:10:50,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 16: [2023-05-10 10:10:50,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:50,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 16: [2023-05-10 10:10:50,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:50,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 22: [2023-05-10 10:10:50,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 17: [2023-05-10 10:10:50,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:50,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 13: [2023-05-10 10:10:50,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 13: [2023-05-10 10:10:50,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 13: [2023-05-10 10:10:50,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 22: [2023-05-10 10:10:50,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 22: [2023-05-10 10:10:50,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 22: [2023-05-10 10:10:50,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 22: [2023-05-10 10:10:50,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 20: [2023-05-10 10:10:50,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:50,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:50,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 22: [2023-05-10 10:10:50,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:50,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:50,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:50,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 20: [2023-05-10 10:10:50,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 20: [2023-05-10 10:10:50,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 20: [2023-05-10 10:10:50,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 20: [2023-05-10 10:10:50,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:50,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 5: [2023-05-10 10:10:50,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 2: [2023-05-10 10:10:50,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 2: [2023-05-10 10:10:50,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 18: [2023-05-10 10:10:50,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 18: [2023-05-10 10:10:50,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 18: [2023-05-10 10:10:50,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 24: [2023-05-10 10:10:50,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 26: [2023-05-10 10:10:50,250] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:50,250] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:50,250] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:50,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:50,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:50,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:50,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 24: [2023-05-10 10:10:50,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 26: [2023-05-10 10:10:50,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 24: [2023-05-10 10:10:50,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:50,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 5: [2023-05-10 10:10:50,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 5: [2023-05-10 10:10:50,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 26: [2023-05-10 10:10:50,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 26: [2023-05-10 10:10:50,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:50,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:50,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 5: [2023-05-10 10:10:50,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:50,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:50,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:50,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:50,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:50,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:50,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:50,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:50,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 18: [2023-05-10 10:10:50,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 18: [2023-05-10 10:10:50,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 13: [2023-05-10 10:10:50,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 26: [2023-05-10 10:10:50,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 26: [2023-05-10 10:10:50,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:50,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 26: [2023-05-10 10:10:50,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 26: [2023-05-10 10:10:50,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 26: [2023-05-10 10:10:50,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:50,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:50,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:50,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:50,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:50,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:50,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:50,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:50,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:50,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:50,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 25: [2023-05-10 10:10:50,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 25: [2023-05-10 10:10:50,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:50,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 25: [2023-05-10 10:10:50,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 25: [2023-05-10 10:10:50,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 25: [2023-05-10 10:10:50,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 25: [2023-05-10 10:10:50,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:50,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 2: [2023-05-10 10:10:50,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:50,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 30: [2023-05-10 10:10:50,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 27: [2023-05-10 10:10:50,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 30: [2023-05-10 10:10:50,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:50,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 30: [2023-05-10 10:10:50,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 30: [2023-05-10 10:10:50,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 16: [2023-05-10 10:10:50,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 31: [2023-05-10 10:10:50,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 2: [2023-05-10 10:10:50,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 2: [2023-05-10 10:10:50,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 12: [2023-05-10 10:10:50,276] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 2: [2023-05-10 10:10:50,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 2: [2023-05-10 10:10:50,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:50,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 22: [2023-05-10 10:10:50,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 15: [2023-05-10 10:10:50,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:50,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:50,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 15: [2023-05-10 10:10:50,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:50,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:50,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 2: [2023-05-10 10:10:50,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:50,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:50,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 15: [2023-05-10 10:10:50,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:50,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 11: [2023-05-10 10:10:50,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 5: [2023-05-10 10:10:50,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 11: [2023-05-10 10:10:50,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 11: [2023-05-10 10:10:50,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 11: [2023-05-10 10:10:50,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 11: [2023-05-10 10:10:50,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 11: [2023-05-10 10:10:50,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 24: [2023-05-10 10:10:50,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:50,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 21: [2023-05-10 10:10:50,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 24: [2023-05-10 10:10:50,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 16: [2023-05-10 10:10:50,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 24: [2023-05-10 10:10:50,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 24: [2023-05-10 10:10:50,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:50,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:50,292] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 31: [2023-05-10 10:10:50,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 31: [2023-05-10 10:10:50,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 31: [2023-05-10 10:10:50,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 31: [2023-05-10 10:10:50,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 31: [2023-05-10 10:10:50,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,292] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 27: [2023-05-10 10:10:50,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 27: [2023-05-10 10:10:50,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 27: [2023-05-10 10:10:50,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 27: [2023-05-10 10:10:50,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 30: [2023-05-10 10:10:50,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 11: [2023-05-10 10:10:50,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:50,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:50,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:50,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:50,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 25: [2023-05-10 10:10:50,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 25: [2023-05-10 10:10:50,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 11: [2023-05-10 10:10:50,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 11: [2023-05-10 10:10:50,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 11: [2023-05-10 10:10:50,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 25: [2023-05-10 10:10:50,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:50,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 0: [2023-05-10 10:10:50,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 30: [2023-05-10 10:10:50,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 30: [2023-05-10 10:10:50,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 30: [2023-05-10 10:10:50,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:50,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:50,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:50,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:50,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:50,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:50,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:50,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 0: [2023-05-10 10:10:50,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 30: [2023-05-10 10:10:50,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 25: [2023-05-10 10:10:50,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 22: [2023-05-10 10:10:50,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 22: [2023-05-10 10:10:50,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:50,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:50,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:50,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:50,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:50,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:50,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:50,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:50,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 19: [2023-05-10 10:10:50,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:50,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:50,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 16: [2023-05-10 10:10:50,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 16: [2023-05-10 10:10:50,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 16: [2023-05-10 10:10:50,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 16: [2023-05-10 10:10:50,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:50,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 22: [2023-05-10 10:10:50,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 22: [2023-05-10 10:10:50,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 22: [2023-05-10 10:10:50,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 22: [2023-05-10 10:10:50,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:50,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:50,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 19: [2023-05-10 10:10:50,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 19: [2023-05-10 10:10:50,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 19: [2023-05-10 10:10:50,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 19: [2023-05-10 10:10:50,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:50,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 14: [2023-05-10 10:10:50,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:50,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:50,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 3: [2023-05-10 10:10:50,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 3: [2023-05-10 10:10:50,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 14: [2023-05-10 10:10:50,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 14: [2023-05-10 10:10:50,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 14: [2023-05-10 10:10:50,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 14: [2023-05-10 10:10:50,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 2: [2023-05-10 10:10:50,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 14: [2023-05-10 10:10:50,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 3: [2023-05-10 10:10:50,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 3: [2023-05-10 10:10:50,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 30: [2023-05-10 10:10:50,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:50,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 3: [2023-05-10 10:10:50,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 3: [2023-05-10 10:10:50,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 3: [2023-05-10 10:10:50,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 16: [2023-05-10 10:10:50,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:50,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:50,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:50,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:50,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:50,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:50,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 2: [2023-05-10 10:10:50,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:50,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 3: [2023-05-10 10:10:50,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 2: [2023-05-10 10:10:50,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:50,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:50,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 26: [2023-05-10 10:10:50,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 14: [2023-05-10 10:10:50,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:50,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:50,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:50,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:50,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 12: [2023-05-10 10:10:50,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 12: [2023-05-10 10:10:50,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 12: [2023-05-10 10:10:50,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 12: [2023-05-10 10:10:50,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 12: [2023-05-10 10:10:50,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 14: [2023-05-10 10:10:50,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 14: [2023-05-10 10:10:50,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:50,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 21: [2023-05-10 10:10:50,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 3: [2023-05-10 10:10:50,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 21: [2023-05-10 10:10:50,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 3: [2023-05-10 10:10:50,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:50,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:50,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 21: [2023-05-10 10:10:50,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 3: [2023-05-10 10:10:50,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 9: [2023-05-10 10:10:50,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:50,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 16: [2023-05-10 10:10:50,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:50,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 31: [2023-05-10 10:10:50,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:50,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:50,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 31: [2023-05-10 10:10:50,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 31: [2023-05-10 10:10:50,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 22: [2023-05-10 10:10:50,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 22: [2023-05-10 10:10:50,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 30: [2023-05-10 10:10:50,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 30: [2023-05-10 10:10:50,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 30: [2023-05-10 10:10:50,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 26: [2023-05-10 10:10:50,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 26: [2023-05-10 10:10:50,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:50,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 22: [2023-05-10 10:10:50,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 22: [2023-05-10 10:10:50,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 26: [2023-05-10 10:10:50,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 26: [2023-05-10 10:10:50,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:50,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:50,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:50,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 22: [2023-05-10 10:10:50,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 22: [2023-05-10 10:10:50,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:50,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:50,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 11: [2023-05-10 10:10:50,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:50,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:50,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:50,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 11: [2023-05-10 10:10:50,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:50,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 12: [2023-05-10 10:10:50,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 9: [2023-05-10 10:10:50,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,352] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,353] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 19: [2023-05-10 10:10:50,355] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:50,355] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:50,355] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 9: [2023-05-10 10:10:50,357] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 9: [2023-05-10 10:10:50,357] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 9: [2023-05-10 10:10:50,357] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:50,358] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,358] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,358] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,358] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,358] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,358] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 26: [2023-05-10 10:10:50,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 26: [2023-05-10 10:10:50,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:50,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 10: [2023-05-10 10:10:50,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 7: [2023-05-10 10:10:50,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:50,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:50,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:50,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 3: [2023-05-10 10:10:50,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 3: [2023-05-10 10:10:50,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 3: [2023-05-10 10:10:50,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 26: [2023-05-10 10:10:50,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:50,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 7: [2023-05-10 10:10:50,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 7: [2023-05-10 10:10:50,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 26: [2023-05-10 10:10:50,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:50,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 6: [2023-05-10 10:10:50,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:50,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:50,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:50,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:50,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 11: [2023-05-10 10:10:50,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:50,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:50,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 11: [2023-05-10 10:10:50,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 11: [2023-05-10 10:10:50,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 10: [2023-05-10 10:10:50,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 14: [2023-05-10 10:10:50,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 14: [2023-05-10 10:10:50,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 7: [2023-05-10 10:10:50,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 11: [2023-05-10 10:10:50,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt... 11: [2023-05-10 10:10:50,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 11: [2023-05-10 10:10:50,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 20: [2023-05-10 10:10:50,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 6: [2023-05-10 10:10:50,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:50,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 15: [2023-05-10 10:10:50,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 14: [2023-05-10 10:10:50,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 14: [2023-05-10 10:10:50,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 14: [2023-05-10 10:10:50,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 14: [2023-05-10 10:10:50,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:50,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:50,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:50,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 3: [2023-05-10 10:10:50,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 3: [2023-05-10 10:10:50,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 3: [2023-05-10 10:10:50,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 3: [2023-05-10 10:10:50,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 19: [2023-05-10 10:10:50,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:50,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:50,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 6: [2023-05-10 10:10:50,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 15: [2023-05-10 10:10:50,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:50,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:50,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:50,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:50,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 20: [2023-05-10 10:10:50,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 20: [2023-05-10 10:10:50,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:50,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:50,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 15: [2023-05-10 10:10:50,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 15: [2023-05-10 10:10:50,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,399] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 14: [2023-05-10 10:10:50,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 14: [2023-05-10 10:10:50,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:50,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:50,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:50,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:50,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:50,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:50,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:50,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:50,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:50,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,407] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:50,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:50,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:50,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:50,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:50,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 20: [2023-05-10 10:10:50,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 7: [2023-05-10 10:10:50,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_08-model_00-model_states.pt. 20: [2023-05-10 10:10:50,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,442] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,442] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,442] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,452] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,453] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 13: [2023-05-10 10:10:50,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 13: [2023-05-10 10:10:50,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 13: [2023-05-10 10:10:50,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 13: [2023-05-10 10:10:50,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 13: [2023-05-10 10:10:50,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:50,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 13: [2023-05-10 10:10:50,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 20: [2023-05-10 10:10:50,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 1: [2023-05-10 10:10:50,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 1: [2023-05-10 10:10:50,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 1: [2023-05-10 10:10:50,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 1: [2023-05-10 10:10:50,463] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 1: [2023-05-10 10:10:50,463] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 1: [2023-05-10 10:10:50,463] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 7: [2023-05-10 10:10:50,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:50,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 1: [2023-05-10 10:10:50,463] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,463] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:50,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:50,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 20: [2023-05-10 10:10:50,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 1: [2023-05-10 10:10:50,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 1: [2023-05-10 10:10:50,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,469] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 1: [2023-05-10 10:10:50,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 1: [2023-05-10 10:10:50,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 1: [2023-05-10 10:10:50,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 1: [2023-05-10 10:10:50,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 20: [2023-05-10 10:10:50,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,480] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,480] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 20: [2023-05-10 10:10:50,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 8: [2023-05-10 10:10:50,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 8: [2023-05-10 10:10:50,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 8: [2023-05-10 10:10:50,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 8: [2023-05-10 10:10:50,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 8: [2023-05-10 10:10:50,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 8: [2023-05-10 10:10:50,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 8: [2023-05-10 10:10:50,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 17: [2023-05-10 10:10:50,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 16: [2023-05-10 10:10:50,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 16: [2023-05-10 10:10:50,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 16: [2023-05-10 10:10:50,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 16: [2023-05-10 10:10:50,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 16: [2023-05-10 10:10:50,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 16: [2023-05-10 10:10:50,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 16: [2023-05-10 10:10:50,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,502] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 16: [2023-05-10 10:10:50,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 18: [2023-05-10 10:10:50,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 18: [2023-05-10 10:10:50,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 5: [2023-05-10 10:10:50,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 18: [2023-05-10 10:10:50,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 18: [2023-05-10 10:10:50,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 18: [2023-05-10 10:10:50,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 18: [2023-05-10 10:10:50,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 18: [2023-05-10 10:10:50,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 18: [2023-05-10 10:10:50,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 17: [2023-05-10 10:10:50,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 18: [2023-05-10 10:10:50,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 18: [2023-05-10 10:10:50,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 17: [2023-05-10 10:10:50,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 18: [2023-05-10 10:10:50,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 18: [2023-05-10 10:10:50,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 18: [2023-05-10 10:10:50,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 18: [2023-05-10 10:10:50,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 18: [2023-05-10 10:10:50,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 18: [2023-05-10 10:10:50,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 31: [2023-05-10 10:10:50,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 31: [2023-05-10 10:10:50,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 31: [2023-05-10 10:10:50,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 31: [2023-05-10 10:10:50,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 31: [2023-05-10 10:10:50,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 31: [2023-05-10 10:10:50,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 31: [2023-05-10 10:10:50,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 31: [2023-05-10 10:10:50,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 26: [2023-05-10 10:10:50,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 26: [2023-05-10 10:10:50,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 31: [2023-05-10 10:10:50,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 31: [2023-05-10 10:10:50,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 2: [2023-05-10 10:10:50,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 26: [2023-05-10 10:10:50,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 26: [2023-05-10 10:10:50,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 26: [2023-05-10 10:10:50,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 26: [2023-05-10 10:10:50,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 26: [2023-05-10 10:10:50,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 1: [2023-05-10 10:10:50,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 26: [2023-05-10 10:10:50,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 1: [2023-05-10 10:10:50,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 31: [2023-05-10 10:10:50,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 31: [2023-05-10 10:10:50,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 25: [2023-05-10 10:10:50,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 17: [2023-05-10 10:10:50,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 31: [2023-05-10 10:10:50,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 17: [2023-05-10 10:10:50,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 21: [2023-05-10 10:10:50,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 21: [2023-05-10 10:10:50,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 25: [2023-05-10 10:10:50,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 21: [2023-05-10 10:10:50,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 21: [2023-05-10 10:10:50,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 21: [2023-05-10 10:10:50,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 21: [2023-05-10 10:10:50,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 25: [2023-05-10 10:10:50,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 17: [2023-05-10 10:10:50,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 26: [2023-05-10 10:10:50,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 13: [2023-05-10 10:10:50,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 2: [2023-05-10 10:10:50,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 1: [2023-05-10 10:10:50,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 26: [2023-05-10 10:10:50,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 26: [2023-05-10 10:10:50,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 2: [2023-05-10 10:10:50,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 1: [2023-05-10 10:10:50,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 1: [2023-05-10 10:10:50,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 1: [2023-05-10 10:10:50,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 2: [2023-05-10 10:10:50,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 1: [2023-05-10 10:10:50,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 26: [2023-05-10 10:10:50,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 21: [2023-05-10 10:10:50,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 21: [2023-05-10 10:10:50,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 21: [2023-05-10 10:10:50,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 21: [2023-05-10 10:10:50,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 30: [2023-05-10 10:10:50,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 12: [2023-05-10 10:10:50,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 12: [2023-05-10 10:10:50,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 12: [2023-05-10 10:10:50,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 12: [2023-05-10 10:10:50,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 12: [2023-05-10 10:10:50,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 30: [2023-05-10 10:10:50,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 30: [2023-05-10 10:10:50,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 8: [2023-05-10 10:10:50,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 8: [2023-05-10 10:10:50,546] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,547] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,547] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,547] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 1: [2023-05-10 10:10:50,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 27: [2023-05-10 10:10:50,550] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 1: [2023-05-10 10:10:50,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 29: [2023-05-10 10:10:50,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 8: [2023-05-10 10:10:50,550] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 27: [2023-05-10 10:10:50,553] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,553] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,553] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,553] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,553] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 29: [2023-05-10 10:10:50,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 29: [2023-05-10 10:10:50,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 16: [2023-05-10 10:10:50,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 8: [2023-05-10 10:10:50,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 8: [2023-05-10 10:10:50,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 8: [2023-05-10 10:10:50,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 8: [2023-05-10 10:10:50,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 8: [2023-05-10 10:10:50,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 29: [2023-05-10 10:10:50,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 29: [2023-05-10 10:10:50,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 29: [2023-05-10 10:10:50,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 29: [2023-05-10 10:10:50,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 29: [2023-05-10 10:10:50,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 1: [2023-05-10 10:10:50,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:50,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 1: [2023-05-10 10:10:50,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:50,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:50,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 1: [2023-05-10 10:10:50,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 5: [2023-05-10 10:10:50,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 1: [2023-05-10 10:10:50,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:50,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 23: [2023-05-10 10:10:50,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 18: [2023-05-10 10:10:50,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 18: [2023-05-10 10:10:50,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 18: [2023-05-10 10:10:50,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 18: [2023-05-10 10:10:50,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 24: [2023-05-10 10:10:50,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 8: [2023-05-10 10:10:50,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:50,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 23: [2023-05-10 10:10:50,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 23: [2023-05-10 10:10:50,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 23: [2023-05-10 10:10:50,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 18: [2023-05-10 10:10:50,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 18: [2023-05-10 10:10:50,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 18: [2023-05-10 10:10:50,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 31: [2023-05-10 10:10:50,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 31: [2023-05-10 10:10:50,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 23: [2023-05-10 10:10:50,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 24: [2023-05-10 10:10:50,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:50,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:50,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:50,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:50,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:50,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:50,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:50,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:50,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 8: [2023-05-10 10:10:50,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:50,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 16: [2023-05-10 10:10:50,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 16: [2023-05-10 10:10:50,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 16: [2023-05-10 10:10:50,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 16: [2023-05-10 10:10:50,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 26: [2023-05-10 10:10:50,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 26: [2023-05-10 10:10:50,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 28: [2023-05-10 10:10:50,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 22: [2023-05-10 10:10:50,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 22: [2023-05-10 10:10:50,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 24: [2023-05-10 10:10:50,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 22: [2023-05-10 10:10:50,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 22: [2023-05-10 10:10:50,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 22: [2023-05-10 10:10:50,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 22: [2023-05-10 10:10:50,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 22: [2023-05-10 10:10:50,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 22: [2023-05-10 10:10:50,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 28: [2023-05-10 10:10:50,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 24: [2023-05-10 10:10:50,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:50,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 8: [2023-05-10 10:10:50,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 24: [2023-05-10 10:10:50,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:50,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:50,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 8: [2023-05-10 10:10:50,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:50,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 28: [2023-05-10 10:10:50,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 28: [2023-05-10 10:10:50,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 22: [2023-05-10 10:10:50,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 22: [2023-05-10 10:10:50,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 21: [2023-05-10 10:10:50,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,547] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,547] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 13: [2023-05-10 10:10:50,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 31: [2023-05-10 10:10:50,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 21: [2023-05-10 10:10:50,591] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,591] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,591] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 18: [2023-05-10 10:10:50,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 5: [2023-05-10 10:10:50,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 22: [2023-05-10 10:10:50,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 31: [2023-05-10 10:10:50,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 22: [2023-05-10 10:10:50,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 31: [2023-05-10 10:10:50,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:50,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 31: [2023-05-10 10:10:50,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:50,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 22: [2023-05-10 10:10:50,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:50,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 18: [2023-05-10 10:10:50,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 18: [2023-05-10 10:10:50,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 18: [2023-05-10 10:10:50,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:50,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 31: [2023-05-10 10:10:50,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 31: [2023-05-10 10:10:50,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 31: [2023-05-10 10:10:50,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 26: [2023-05-10 10:10:50,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 21: [2023-05-10 10:10:50,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 21: [2023-05-10 10:10:50,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 21: [2023-05-10 10:10:50,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 26: [2023-05-10 10:10:50,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 18: [2023-05-10 10:10:50,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:50,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 18: [2023-05-10 10:10:50,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 29: [2023-05-10 10:10:50,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 18: [2023-05-10 10:10:50,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 18: [2023-05-10 10:10:50,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 5: [2023-05-10 10:10:50,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 26: [2023-05-10 10:10:50,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 26: [2023-05-10 10:10:50,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 5: [2023-05-10 10:10:50,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:50,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 5: [2023-05-10 10:10:50,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:50,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 25: [2023-05-10 10:10:50,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 26: [2023-05-10 10:10:50,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 21: [2023-05-10 10:10:50,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:50,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 10: [2023-05-10 10:10:50,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 10: [2023-05-10 10:10:50,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 10: [2023-05-10 10:10:50,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 10: [2023-05-10 10:10:50,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 10: [2023-05-10 10:10:50,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 10: [2023-05-10 10:10:50,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 31: [2023-05-10 10:10:50,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 21: [2023-05-10 10:10:50,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 12: [2023-05-10 10:10:50,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 31: [2023-05-10 10:10:50,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 10: [2023-05-10 10:10:50,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 23: [2023-05-10 10:10:50,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 10: [2023-05-10 10:10:50,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 21: [2023-05-10 10:10:50,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 10: [2023-05-10 10:10:50,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 30: [2023-05-10 10:10:50,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 26: [2023-05-10 10:10:50,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 29: [2023-05-10 10:10:50,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:50,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 9: [2023-05-10 10:10:50,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:50,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 15: [2023-05-10 10:10:50,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 15: [2023-05-10 10:10:50,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 15: [2023-05-10 10:10:50,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 15: [2023-05-10 10:10:50,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 15: [2023-05-10 10:10:50,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 15: [2023-05-10 10:10:50,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 15: [2023-05-10 10:10:50,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 24: [2023-05-10 10:10:50,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 31: [2023-05-10 10:10:50,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:50,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:50,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 11: [2023-05-10 10:10:50,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 11: [2023-05-10 10:10:50,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 12: [2023-05-10 10:10:50,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 12: [2023-05-10 10:10:50,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 12: [2023-05-10 10:10:50,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 12: [2023-05-10 10:10:50,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 12: [2023-05-10 10:10:50,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 29: [2023-05-10 10:10:50,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 9: [2023-05-10 10:10:50,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 9: [2023-05-10 10:10:50,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 21: [2023-05-10 10:10:50,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 26: [2023-05-10 10:10:50,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:50,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 11: [2023-05-10 10:10:50,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 11: [2023-05-10 10:10:50,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 11: [2023-05-10 10:10:50,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 11: [2023-05-10 10:10:50,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 11: [2023-05-10 10:10:50,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 26: [2023-05-10 10:10:50,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 9: [2023-05-10 10:10:50,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 31: [2023-05-10 10:10:50,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:50,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:50,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:50,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 11: [2023-05-10 10:10:50,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 14: [2023-05-10 10:10:50,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 21: [2023-05-10 10:10:50,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:50,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 14: [2023-05-10 10:10:50,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 14: [2023-05-10 10:10:50,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 14: [2023-05-10 10:10:50,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 14: [2023-05-10 10:10:50,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 26: [2023-05-10 10:10:50,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 2: [2023-05-10 10:10:50,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 21: [2023-05-10 10:10:50,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 21: [2023-05-10 10:10:50,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 21: [2023-05-10 10:10:50,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:50,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 19: [2023-05-10 10:10:50,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 15: [2023-05-10 10:10:50,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 26: [2023-05-10 10:10:50,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 23: [2023-05-10 10:10:50,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 15: [2023-05-10 10:10:50,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 15: [2023-05-10 10:10:50,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 15: [2023-05-10 10:10:50,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 15: [2023-05-10 10:10:50,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 22: [2023-05-10 10:10:50,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 14: [2023-05-10 10:10:50,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 15: [2023-05-10 10:10:50,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:50,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 28: [2023-05-10 10:10:50,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 11: [2023-05-10 10:10:50,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:50,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 19: [2023-05-10 10:10:50,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 19: [2023-05-10 10:10:50,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 19: [2023-05-10 10:10:50,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 19: [2023-05-10 10:10:50,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 14: [2023-05-10 10:10:50,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:50,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 11: [2023-05-10 10:10:50,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 22: [2023-05-10 10:10:50,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 14: [2023-05-10 10:10:50,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:50,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:50,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:50,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 11: [2023-05-10 10:10:50,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 0: [2023-05-10 10:10:50,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 0: [2023-05-10 10:10:50,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 0: [2023-05-10 10:10:50,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 0: [2023-05-10 10:10:50,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 0: [2023-05-10 10:10:50,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 0: [2023-05-10 10:10:50,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 0: [2023-05-10 10:10:50,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:50,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 23: [2023-05-10 10:10:50,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:50,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 6: [2023-05-10 10:10:50,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 6: [2023-05-10 10:10:50,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 6: [2023-05-10 10:10:50,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 6: [2023-05-10 10:10:50,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 6: [2023-05-10 10:10:50,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 6: [2023-05-10 10:10:50,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 6: [2023-05-10 10:10:50,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 6: [2023-05-10 10:10:50,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:50,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:50,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:50,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 19: [2023-05-10 10:10:50,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 28: [2023-05-10 10:10:50,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:50,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:50,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 24: [2023-05-10 10:10:50,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 0: [2023-05-10 10:10:50,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 24: [2023-05-10 10:10:50,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:50,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 24: [2023-05-10 10:10:50,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 0: [2023-05-10 10:10:50,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 29: [2023-05-10 10:10:50,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 30: [2023-05-10 10:10:50,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 28: [2023-05-10 10:10:50,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:50,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,657] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,657] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,657] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 6: [2023-05-10 10:10:50,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 4: [2023-05-10 10:10:50,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:50,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 28: [2023-05-10 10:10:50,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:50,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:50,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 30: [2023-05-10 10:10:50,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 22: [2023-05-10 10:10:50,663] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 22: [2023-05-10 10:10:50,663] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:50,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:50,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:50,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 29: [2023-05-10 10:10:50,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 29: [2023-05-10 10:10:50,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:50,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:50,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:50,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 22: [2023-05-10 10:10:50,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 22: [2023-05-10 10:10:50,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 22: [2023-05-10 10:10:50,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:50,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 23: [2023-05-10 10:10:50,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 29: [2023-05-10 10:10:50,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 23: [2023-05-10 10:10:50,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 22: [2023-05-10 10:10:50,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:50,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:50,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 10: [2023-05-10 10:10:50,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 28: [2023-05-10 10:10:50,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 15: [2023-05-10 10:10:50,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 10: [2023-05-10 10:10:50,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:50,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:50,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 11: [2023-05-10 10:10:50,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 24: [2023-05-10 10:10:50,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 24: [2023-05-10 10:10:50,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 17: [2023-05-10 10:10:50,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 22: [2023-05-10 10:10:50,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 17: [2023-05-10 10:10:50,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 17: [2023-05-10 10:10:50,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:50,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 10: [2023-05-10 10:10:50,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 10: [2023-05-10 10:10:50,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 10: [2023-05-10 10:10:50,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 10: [2023-05-10 10:10:50,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 17: [2023-05-10 10:10:50,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 17: [2023-05-10 10:10:50,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 17: [2023-05-10 10:10:50,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 17: [2023-05-10 10:10:50,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 22: [2023-05-10 10:10:50,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 17: [2023-05-10 10:10:50,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 24: [2023-05-10 10:10:50,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 24: [2023-05-10 10:10:50,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:50,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,692] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,692] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 28: [2023-05-10 10:10:50,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 17: [2023-05-10 10:10:50,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 28: [2023-05-10 10:10:50,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:50,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:50,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:50,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:50,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 14: [2023-05-10 10:10:50,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 3: [2023-05-10 10:10:50,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 3: [2023-05-10 10:10:50,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 19: [2023-05-10 10:10:50,699] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 19: [2023-05-10 10:10:50,699] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 19: [2023-05-10 10:10:50,699] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 14: [2023-05-10 10:10:50,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 3: [2023-05-10 10:10:50,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 3: [2023-05-10 10:10:50,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 17: [2023-05-10 10:10:50,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 3: [2023-05-10 10:10:50,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 3: [2023-05-10 10:10:50,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 3: [2023-05-10 10:10:50,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 17: [2023-05-10 10:10:50,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 22: [2023-05-10 10:10:50,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:50,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 17: [2023-05-10 10:10:50,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 10: [2023-05-10 10:10:50,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 17: [2023-05-10 10:10:50,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 17: [2023-05-10 10:10:50,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 15: [2023-05-10 10:10:50,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:50,704] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 3: [2023-05-10 10:10:50,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:50,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 6: [2023-05-10 10:10:50,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 6: [2023-05-10 10:10:50,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:50,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:50,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:50,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:50,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:50,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:50,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:50,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:50,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 7: [2023-05-10 10:10:50,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 14: [2023-05-10 10:10:50,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 3: [2023-05-10 10:10:50,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 10: [2023-05-10 10:10:50,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:50,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 15: [2023-05-10 10:10:50,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 3: [2023-05-10 10:10:50,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 14: [2023-05-10 10:10:50,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 3: [2023-05-10 10:10:50,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 3: [2023-05-10 10:10:50,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 9: [2023-05-10 10:10:50,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt... 0: [2023-05-10 10:10:50,712] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 0: [2023-05-10 10:10:50,712] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 0: [2023-05-10 10:10:50,712] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 0: [2023-05-10 10:10:50,712] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 15: [2023-05-10 10:10:50,713] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 15: [2023-05-10 10:10:50,713] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:50,714] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 11: [2023-05-10 10:10:50,714] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 11: [2023-05-10 10:10:50,714] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 11: [2023-05-10 10:10:50,714] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 11: [2023-05-10 10:10:50,714] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 11: [2023-05-10 10:10:50,714] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 11: [2023-05-10 10:10:50,715] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 19: [2023-05-10 10:10:50,716] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 19: [2023-05-10 10:10:50,716] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 19: [2023-05-10 10:10:50,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 10: [2023-05-10 10:10:50,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 10: [2023-05-10 10:10:50,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 10: [2023-05-10 10:10:50,718] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:50,718] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 10: [2023-05-10 10:10:50,718] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:50,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 9: [2023-05-10 10:10:50,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:50,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 0: [2023-05-10 10:10:50,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 0: [2023-05-10 10:10:50,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 0: [2023-05-10 10:10:50,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 14: [2023-05-10 10:10:50,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 15: [2023-05-10 10:10:50,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 15: [2023-05-10 10:10:50,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 15: [2023-05-10 10:10:50,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 15: [2023-05-10 10:10:50,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:50,726] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,727] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:50,727] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 6: [2023-05-10 10:10:50,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 6: [2023-05-10 10:10:50,727] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 6: [2023-05-10 10:10:50,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 6: [2023-05-10 10:10:50,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,730] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,730] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 6: [2023-05-10 10:10:50,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 6: [2023-05-10 10:10:50,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 6: [2023-05-10 10:10:50,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 6: [2023-05-10 10:10:50,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 0: [2023-05-10 10:10:50,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:50,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:50,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 0: [2023-05-10 10:10:50,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 0: [2023-05-10 10:10:50,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:50,741] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,741] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 15: [2023-05-10 10:10:50,741] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:50,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 17: [2023-05-10 10:10:50,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 15: [2023-05-10 10:10:50,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 15: [2023-05-10 10:10:50,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:50,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:50,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:50,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:50,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 4: [2023-05-10 10:10:50,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 4: [2023-05-10 10:10:50,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 4: [2023-05-10 10:10:50,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 0: [2023-05-10 10:10:50,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 0: [2023-05-10 10:10:50,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:50,752] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 0: [2023-05-10 10:10:50,753] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 0: [2023-05-10 10:10:50,753] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 6: [2023-05-10 10:10:50,753] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 6: [2023-05-10 10:10:50,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:50,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 15: [2023-05-10 10:10:50,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 15: [2023-05-10 10:10:50,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 15: [2023-05-10 10:10:50,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:50,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 27: [2023-05-10 10:10:50,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 27: [2023-05-10 10:10:50,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 27: [2023-05-10 10:10:50,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 27: [2023-05-10 10:10:50,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 27: [2023-05-10 10:10:50,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 27: [2023-05-10 10:10:50,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 15: [2023-05-10 10:10:50,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:50,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 4: [2023-05-10 10:10:50,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:50,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 3: [2023-05-10 10:10:50,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 3: [2023-05-10 10:10:50,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 4: [2023-05-10 10:10:50,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 4: [2023-05-10 10:10:50,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:50,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,763] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:50,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 6: [2023-05-10 10:10:50,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 7: [2023-05-10 10:10:50,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 7: [2023-05-10 10:10:50,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 6: [2023-05-10 10:10:50,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 6: [2023-05-10 10:10:50,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 27: [2023-05-10 10:10:50,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 6: [2023-05-10 10:10:50,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 17: [2023-05-10 10:10:50,774] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 17: [2023-05-10 10:10:50,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:50,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:50,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 3: [2023-05-10 10:10:50,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 3: [2023-05-10 10:10:50,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 3: [2023-05-10 10:10:50,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 3: [2023-05-10 10:10:50,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_09-model_00-model_states.pt. 17: [2023-05-10 10:10:50,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 17: [2023-05-10 10:10:50,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 17: [2023-05-10 10:10:50,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 17: [2023-05-10 10:10:50,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 17: [2023-05-10 10:10:50,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,782] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 16: [2023-05-10 10:10:50,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 7: [2023-05-10 10:10:50,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 7: [2023-05-10 10:10:50,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:50,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 20: [2023-05-10 10:10:50,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:50,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,794] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,794] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,794] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,794] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 20: [2023-05-10 10:10:50,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:50,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:50,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:50,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 7: [2023-05-10 10:10:50,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 7: [2023-05-10 10:10:50,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 7: [2023-05-10 10:10:50,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 7: [2023-05-10 10:10:50,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 17: [2023-05-10 10:10:50,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 17: [2023-05-10 10:10:50,804] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:50,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 17: [2023-05-10 10:10:50,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 17: [2023-05-10 10:10:50,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:50,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 17: [2023-05-10 10:10:50,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 17: [2023-05-10 10:10:50,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 17: [2023-05-10 10:10:50,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:50,821] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 12: [2023-05-10 10:10:50,831] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 27: [2023-05-10 10:10:50,831] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,831] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,831] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 12: [2023-05-10 10:10:50,832] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 12: [2023-05-10 10:10:50,832] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 12: [2023-05-10 10:10:50,832] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 12: [2023-05-10 10:10:50,832] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 12: [2023-05-10 10:10:50,832] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 12: [2023-05-10 10:10:50,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,837] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,794] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,794] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,804] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 2: [2023-05-10 10:10:50,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,839] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 30: [2023-05-10 10:10:50,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 26: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 26: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 27: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 27: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 27: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 27: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 26: [2023-05-10 10:10:50,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 26: [2023-05-10 10:10:50,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 26: [2023-05-10 10:10:50,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 13: [2023-05-10 10:10:50,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 27: [2023-05-10 10:10:50,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 30: [2023-05-10 10:10:50,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 27: [2023-05-10 10:10:50,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:50,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 13: [2023-05-10 10:10:50,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:50,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 13: [2023-05-10 10:10:50,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:50,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:50,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 13: [2023-05-10 10:10:50,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 13: [2023-05-10 10:10:50,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:50,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:50,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:50,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:50,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:50,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 12: [2023-05-10 10:10:50,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:50,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 14: [2023-05-10 10:10:50,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 13: [2023-05-10 10:10:50,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 13: [2023-05-10 10:10:50,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:50,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:50,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 14: [2023-05-10 10:10:50,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 14: [2023-05-10 10:10:50,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 14: [2023-05-10 10:10:50,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 14: [2023-05-10 10:10:50,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 16: [2023-05-10 10:10:50,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:50,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 13: [2023-05-10 10:10:50,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 13: [2023-05-10 10:10:50,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 13: [2023-05-10 10:10:50,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:50,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:50,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:50,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 14: [2023-05-10 10:10:50,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 25: [2023-05-10 10:10:50,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 26: [2023-05-10 10:10:50,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 5: [2023-05-10 10:10:50,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 5: [2023-05-10 10:10:50,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 5: [2023-05-10 10:10:50,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 5: [2023-05-10 10:10:50,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 5: [2023-05-10 10:10:50,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 5: [2023-05-10 10:10:50,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 5: [2023-05-10 10:10:50,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 5: [2023-05-10 10:10:50,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:50,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 12: [2023-05-10 10:10:50,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 5: [2023-05-10 10:10:50,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 5: [2023-05-10 10:10:50,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 5: [2023-05-10 10:10:50,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 5: [2023-05-10 10:10:50,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 5: [2023-05-10 10:10:50,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 5: [2023-05-10 10:10:50,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:50,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:50,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:50,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:50,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:50,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:50,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:50,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:50,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 12: [2023-05-10 10:10:50,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 12: [2023-05-10 10:10:50,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 12: [2023-05-10 10:10:50,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 12: [2023-05-10 10:10:50,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 2: [2023-05-10 10:10:50,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:50,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:50,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:50,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:50,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:50,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:50,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 30: [2023-05-10 10:10:50,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:50,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:50,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:50,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:50,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:50,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:50,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:50,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:50,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:50,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 26: [2023-05-10 10:10:50,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:50,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:50,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 26: [2023-05-10 10:10:50,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 30: [2023-05-10 10:10:50,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 25: [2023-05-10 10:10:50,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:50,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 30: [2023-05-10 10:10:50,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 26: [2023-05-10 10:10:50,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:50,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 9: [2023-05-10 10:10:50,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:50,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 9: [2023-05-10 10:10:50,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 9: [2023-05-10 10:10:50,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 9: [2023-05-10 10:10:50,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 9: [2023-05-10 10:10:50,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 9: [2023-05-10 10:10:50,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 26: [2023-05-10 10:10:50,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:50,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 11: [2023-05-10 10:10:50,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 6: [2023-05-10 10:10:50,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:50,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 30: [2023-05-10 10:10:50,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:50,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 9: [2023-05-10 10:10:50,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 11: [2023-05-10 10:10:50,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 11: [2023-05-10 10:10:50,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 11: [2023-05-10 10:10:50,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 11: [2023-05-10 10:10:50,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 9: [2023-05-10 10:10:50,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 9: [2023-05-10 10:10:50,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 9: [2023-05-10 10:10:50,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 6: [2023-05-10 10:10:50,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:50,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 14: [2023-05-10 10:10:50,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 14: [2023-05-10 10:10:50,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:50,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:50,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 6: [2023-05-10 10:10:50,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:50,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:50,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 6: [2023-05-10 10:10:50,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 11: [2023-05-10 10:10:50,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 12: [2023-05-10 10:10:50,959] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:50,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:50,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 6: [2023-05-10 10:10:50,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 11: [2023-05-10 10:10:50,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 6: [2023-05-10 10:10:50,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 3: [2023-05-10 10:10:50,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 14: [2023-05-10 10:10:50,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 14: [2023-05-10 10:10:50,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:50,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:50,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:50,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 6: [2023-05-10 10:10:50,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:50,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 14: [2023-05-10 10:10:50,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 3: [2023-05-10 10:10:50,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 3: [2023-05-10 10:10:50,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 3: [2023-05-10 10:10:50,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 3: [2023-05-10 10:10:50,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 3: [2023-05-10 10:10:50,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 3: [2023-05-10 10:10:50,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:50,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 5: [2023-05-10 10:10:50,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:50,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 3: [2023-05-10 10:10:50,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:50,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 30: [2023-05-10 10:10:50,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:50,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 23: [2023-05-10 10:10:50,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 23: [2023-05-10 10:10:50,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 23: [2023-05-10 10:10:50,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:50,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:50,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:50,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 23: [2023-05-10 10:10:50,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 23: [2023-05-10 10:10:50,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 23: [2023-05-10 10:10:50,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 23: [2023-05-10 10:10:50,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:50,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 30: [2023-05-10 10:10:50,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 30: [2023-05-10 10:10:50,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:50,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:50,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 5: [2023-05-10 10:10:50,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 5: [2023-05-10 10:10:50,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 5: [2023-05-10 10:10:50,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:50,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:50,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:50,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:50,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:50,984] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:50,984] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:50,984] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 5: [2023-05-10 10:10:50,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 14: [2023-05-10 10:10:50,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:50,987] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:50,987] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:50,988] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:50,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:50,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 5: [2023-05-10 10:10:50,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:50,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:50,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:50,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 7: [2023-05-10 10:10:50,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 7: [2023-05-10 10:10:50,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 7: [2023-05-10 10:10:50,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 7: [2023-05-10 10:10:50,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 7: [2023-05-10 10:10:50,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 7: [2023-05-10 10:10:50,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:50,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:50,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:50,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:50,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 9: [2023-05-10 10:10:51,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 7: [2023-05-10 10:10:51,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 7: [2023-05-10 10:10:51,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 14: [2023-05-10 10:10:51,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:51,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:51,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:51,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 14: [2023-05-10 10:10:51,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:51,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 7: [2023-05-10 10:10:51,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 7: [2023-05-10 10:10:51,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 7: [2023-05-10 10:10:51,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 29: [2023-05-10 10:10:51,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 29: [2023-05-10 10:10:51,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 5: [2023-05-10 10:10:51,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 7: [2023-05-10 10:10:51,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 7: [2023-05-10 10:10:51,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt... 19: [2023-05-10 10:10:51,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:51,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:51,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 29: [2023-05-10 10:10:51,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 29: [2023-05-10 10:10:51,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 29: [2023-05-10 10:10:51,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 29: [2023-05-10 10:10:51,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 29: [2023-05-10 10:10:51,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 29: [2023-05-10 10:10:51,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:51,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 11: [2023-05-10 10:10:51,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 11: [2023-05-10 10:10:51,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 19: [2023-05-10 10:10:51,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:51,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 29: [2023-05-10 10:10:51,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 29: [2023-05-10 10:10:51,010] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 5: [2023-05-10 10:10:51,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 9: [2023-05-10 10:10:51,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 9: [2023-05-10 10:10:51,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:51,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:51,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:51,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 29: [2023-05-10 10:10:51,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 29: [2023-05-10 10:10:51,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 29: [2023-05-10 10:10:51,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 5: [2023-05-10 10:10:51,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:51,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 5: [2023-05-10 10:10:51,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,018] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 21: [2023-05-10 10:10:51,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 28: [2023-05-10 10:10:51,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 28: [2023-05-10 10:10:51,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:51,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 21: [2023-05-10 10:10:51,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:51,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 28: [2023-05-10 10:10:51,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 28: [2023-05-10 10:10:51,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 28: [2023-05-10 10:10:51,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 28: [2023-05-10 10:10:51,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 28: [2023-05-10 10:10:51,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 21: [2023-05-10 10:10:51,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 21: [2023-05-10 10:10:51,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 1: [2023-05-10 10:10:51,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 1: [2023-05-10 10:10:51,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 1: [2023-05-10 10:10:51,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 1: [2023-05-10 10:10:51,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 1: [2023-05-10 10:10:51,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 1: [2023-05-10 10:10:51,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:51,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 21: [2023-05-10 10:10:51,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:51,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 21: [2023-05-10 10:10:51,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:51,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 3: [2023-05-10 10:10:51,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 3: [2023-05-10 10:10:51,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:51,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:51,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:51,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 6: [2023-05-10 10:10:51,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 6: [2023-05-10 10:10:51,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:51,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 11: [2023-05-10 10:10:51,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 11: [2023-05-10 10:10:51,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 3: [2023-05-10 10:10:51,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:51,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:51,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 6: [2023-05-10 10:10:51,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 1: [2023-05-10 10:10:51,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:51,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:51,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:51,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,035] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 11: [2023-05-10 10:10:51,035] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 11: [2023-05-10 10:10:51,035] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 28: [2023-05-10 10:10:51,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:51,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:51,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:51,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 23: [2023-05-10 10:10:51,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 23: [2023-05-10 10:10:51,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:51,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:51,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 6: [2023-05-10 10:10:51,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 9: [2023-05-10 10:10:51,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:51,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:51,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:51,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:51,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:51,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:51,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 3: [2023-05-10 10:10:51,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 3: [2023-05-10 10:10:51,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:51,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:51,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:51,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:51,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:51,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:51,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 23: [2023-05-10 10:10:51,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 23: [2023-05-10 10:10:51,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 23: [2023-05-10 10:10:51,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:51,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 29: [2023-05-10 10:10:51,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 29: [2023-05-10 10:10:51,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 7: [2023-05-10 10:10:51,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 7: [2023-05-10 10:10:51,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 7: [2023-05-10 10:10:51,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 23: [2023-05-10 10:10:51,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 6: [2023-05-10 10:10:51,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:51,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:51,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 23: [2023-05-10 10:10:51,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 7: [2023-05-10 10:10:51,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 7: [2023-05-10 10:10:51,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 7: [2023-05-10 10:10:51,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_10-model_00-model_states.pt. 21: [2023-05-10 10:10:51,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 6: [2023-05-10 10:10:51,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 6: [2023-05-10 10:10:51,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 6: [2023-05-10 10:10:51,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:51,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:51,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:51,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 23: [2023-05-10 10:10:51,077] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 11: [2023-05-10 10:10:51,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:51,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:51,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 28: [2023-05-10 10:10:51,080] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,081] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 23: [2023-05-10 10:10:51,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 23: [2023-05-10 10:10:51,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 23: [2023-05-10 10:10:51,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 20: [2023-05-10 10:10:51,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 20: [2023-05-10 10:10:51,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 20: [2023-05-10 10:10:51,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 29: [2023-05-10 10:10:51,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 20: [2023-05-10 10:10:51,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 20: [2023-05-10 10:10:51,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 20: [2023-05-10 10:10:51,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 20: [2023-05-10 10:10:51,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 20: [2023-05-10 10:10:51,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 23: [2023-05-10 10:10:51,084] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:51,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 1: [2023-05-10 10:10:51,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 29: [2023-05-10 10:10:51,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 29: [2023-05-10 10:10:51,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 29: [2023-05-10 10:10:51,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 29: [2023-05-10 10:10:51,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 29: [2023-05-10 10:10:51,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:51,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:51,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 22: [2023-05-10 10:10:51,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 21: [2023-05-10 10:10:51,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 22: [2023-05-10 10:10:51,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 22: [2023-05-10 10:10:51,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 22: [2023-05-10 10:10:51,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 22: [2023-05-10 10:10:51,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 22: [2023-05-10 10:10:51,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 22: [2023-05-10 10:10:51,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:51,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:51,093] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:51,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 21: [2023-05-10 10:10:51,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 20: [2023-05-10 10:10:51,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:51,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:51,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:51,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:51,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 20: [2023-05-10 10:10:51,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 22: [2023-05-10 10:10:51,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:51,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 1: [2023-05-10 10:10:51,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 1: [2023-05-10 10:10:51,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 1: [2023-05-10 10:10:51,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 28: [2023-05-10 10:10:51,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 28: [2023-05-10 10:10:51,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 22: [2023-05-10 10:10:51,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:51,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:51,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:51,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 28: [2023-05-10 10:10:51,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 22: [2023-05-10 10:10:51,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:51,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:51,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 4: [2023-05-10 10:10:51,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,107] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 28: [2023-05-10 10:10:51,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 28: [2023-05-10 10:10:51,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 31: [2023-05-10 10:10:51,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 31: [2023-05-10 10:10:51,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 31: [2023-05-10 10:10:51,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 31: [2023-05-10 10:10:51,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 31: [2023-05-10 10:10:51,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 31: [2023-05-10 10:10:51,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 4: [2023-05-10 10:10:51,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 4: [2023-05-10 10:10:51,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:51,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 28: [2023-05-10 10:10:51,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:51,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 4: [2023-05-10 10:10:51,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 4: [2023-05-10 10:10:51,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 4: [2023-05-10 10:10:51,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 8: [2023-05-10 10:10:51,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 8: [2023-05-10 10:10:51,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 8: [2023-05-10 10:10:51,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 8: [2023-05-10 10:10:51,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 8: [2023-05-10 10:10:51,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 8: [2023-05-10 10:10:51,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 8: [2023-05-10 10:10:51,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 8: [2023-05-10 10:10:51,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:51,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:51,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 8: [2023-05-10 10:10:51,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:51,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 8: [2023-05-10 10:10:51,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:51,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:51,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:51,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 29: [2023-05-10 10:10:51,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 29: [2023-05-10 10:10:51,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 8: [2023-05-10 10:10:51,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 29: [2023-05-10 10:10:51,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 29: [2023-05-10 10:10:51,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 1: [2023-05-10 10:10:51,123] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,124] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 28: [2023-05-10 10:10:51,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,128] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,128] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:51,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 10: [2023-05-10 10:10:51,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 21: [2023-05-10 10:10:51,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 10: [2023-05-10 10:10:51,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 10: [2023-05-10 10:10:51,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 18: [2023-05-10 10:10:51,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 18: [2023-05-10 10:10:51,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 18: [2023-05-10 10:10:51,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 18: [2023-05-10 10:10:51,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 18: [2023-05-10 10:10:51,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 18: [2023-05-10 10:10:51,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 18: [2023-05-10 10:10:51,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 18: [2023-05-10 10:10:51,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 1: [2023-05-10 10:10:51,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 1: [2023-05-10 10:10:51,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 1: [2023-05-10 10:10:51,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 1: [2023-05-10 10:10:51,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 18: [2023-05-10 10:10:51,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 18: [2023-05-10 10:10:51,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:51,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 18: [2023-05-10 10:10:51,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:51,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 28: [2023-05-10 10:10:51,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:51,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 18: [2023-05-10 10:10:51,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 28: [2023-05-10 10:10:51,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 20: [2023-05-10 10:10:51,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 20: [2023-05-10 10:10:51,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 22: [2023-05-10 10:10:51,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 22: [2023-05-10 10:10:51,151] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 31: [2023-05-10 10:10:51,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 8: [2023-05-10 10:10:51,165] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 8: [2023-05-10 10:10:51,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 31: [2023-05-10 10:10:51,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 20: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 22: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 22: [2023-05-10 10:10:51,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 4: [2023-05-10 10:10:51,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:51,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:51,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 0: [2023-05-10 10:10:51,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 0: [2023-05-10 10:10:51,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 0: [2023-05-10 10:10:51,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:51,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 22: [2023-05-10 10:10:51,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 22: [2023-05-10 10:10:51,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 22: [2023-05-10 10:10:51,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:51,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 20: [2023-05-10 10:10:51,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 22: [2023-05-10 10:10:51,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 16: [2023-05-10 10:10:51,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 0: [2023-05-10 10:10:51,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 0: [2023-05-10 10:10:51,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 0: [2023-05-10 10:10:51,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 0: [2023-05-10 10:10:51,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:51,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:51,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 0: [2023-05-10 10:10:51,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:51,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:51,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:51,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 8: [2023-05-10 10:10:51,178] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 17: [2023-05-10 10:10:51,178] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 17: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 20: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 20: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 20: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 20: [2023-05-10 10:10:51,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 15: [2023-05-10 10:10:51,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 15: [2023-05-10 10:10:51,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 15: [2023-05-10 10:10:51,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 8: [2023-05-10 10:10:51,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 15: [2023-05-10 10:10:51,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 26: [2023-05-10 10:10:51,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 15: [2023-05-10 10:10:51,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 10: [2023-05-10 10:10:51,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 26: [2023-05-10 10:10:51,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 8: [2023-05-10 10:10:51,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 8: [2023-05-10 10:10:51,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 8: [2023-05-10 10:10:51,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 8: [2023-05-10 10:10:51,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 8: [2023-05-10 10:10:51,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 17: [2023-05-10 10:10:51,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 17: [2023-05-10 10:10:51,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 26: [2023-05-10 10:10:51,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 26: [2023-05-10 10:10:51,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:51,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 31: [2023-05-10 10:10:51,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 31: [2023-05-10 10:10:51,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 31: [2023-05-10 10:10:51,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 31: [2023-05-10 10:10:51,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:51,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 10: [2023-05-10 10:10:51,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 26: [2023-05-10 10:10:51,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 26: [2023-05-10 10:10:51,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 8: [2023-05-10 10:10:51,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 17: [2023-05-10 10:10:51,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 17: [2023-05-10 10:10:51,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:51,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 26: [2023-05-10 10:10:51,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 10: [2023-05-10 10:10:51,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 26: [2023-05-10 10:10:51,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 18: [2023-05-10 10:10:51,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 18: [2023-05-10 10:10:51,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 18: [2023-05-10 10:10:51,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 18: [2023-05-10 10:10:51,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:51,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:51,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:51,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:51,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:51,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 12: [2023-05-10 10:10:51,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 24: [2023-05-10 10:10:51,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 24: [2023-05-10 10:10:51,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 18: [2023-05-10 10:10:51,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 18: [2023-05-10 10:10:51,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 24: [2023-05-10 10:10:51,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 24: [2023-05-10 10:10:51,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 24: [2023-05-10 10:10:51,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 24: [2023-05-10 10:10:51,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 18: [2023-05-10 10:10:51,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 18: [2023-05-10 10:10:51,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 24: [2023-05-10 10:10:51,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 24: [2023-05-10 10:10:51,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 13: [2023-05-10 10:10:51,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 13: [2023-05-10 10:10:51,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 13: [2023-05-10 10:10:51,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 13: [2023-05-10 10:10:51,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 13: [2023-05-10 10:10:51,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 13: [2023-05-10 10:10:51,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 4: [2023-05-10 10:10:51,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 24: [2023-05-10 10:10:51,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 24: [2023-05-10 10:10:51,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 8: [2023-05-10 10:10:51,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 22: [2023-05-10 10:10:51,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 22: [2023-05-10 10:10:51,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 22: [2023-05-10 10:10:51,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 24: [2023-05-10 10:10:51,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 24: [2023-05-10 10:10:51,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 24: [2023-05-10 10:10:51,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 24: [2023-05-10 10:10:51,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 24: [2023-05-10 10:10:51,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 22: [2023-05-10 10:10:51,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 24: [2023-05-10 10:10:51,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 20: [2023-05-10 10:10:51,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 20: [2023-05-10 10:10:51,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 20: [2023-05-10 10:10:51,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 13: [2023-05-10 10:10:51,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:51,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 2: [2023-05-10 10:10:51,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 2: [2023-05-10 10:10:51,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 2: [2023-05-10 10:10:51,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 2: [2023-05-10 10:10:51,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 2: [2023-05-10 10:10:51,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 2: [2023-05-10 10:10:51,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 13: [2023-05-10 10:10:51,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:51,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 13: [2023-05-10 10:10:51,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 10: [2023-05-10 10:10:51,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 20: [2023-05-10 10:10:51,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 20: [2023-05-10 10:10:51,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 13: [2023-05-10 10:10:51,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 13: [2023-05-10 10:10:51,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 13: [2023-05-10 10:10:51,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 18: [2023-05-10 10:10:51,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 13: [2023-05-10 10:10:51,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:51,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 8: [2023-05-10 10:10:51,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 15: [2023-05-10 10:10:51,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 2: [2023-05-10 10:10:51,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:51,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:51,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:51,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 31: [2023-05-10 10:10:51,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:51,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:51,228] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 31: [2023-05-10 10:10:51,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 16: [2023-05-10 10:10:51,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 31: [2023-05-10 10:10:51,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 16: [2023-05-10 10:10:51,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 10: [2023-05-10 10:10:51,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 25: [2023-05-10 10:10:51,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 25: [2023-05-10 10:10:51,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 25: [2023-05-10 10:10:51,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 25: [2023-05-10 10:10:51,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 25: [2023-05-10 10:10:51,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 25: [2023-05-10 10:10:51,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 25: [2023-05-10 10:10:51,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 25: [2023-05-10 10:10:51,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,233] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 25: [2023-05-10 10:10:51,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:51,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:51,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:51,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 25: [2023-05-10 10:10:51,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:51,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:51,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:51,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:51,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 26: [2023-05-10 10:10:51,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 0: [2023-05-10 10:10:51,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,250] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 27: [2023-05-10 10:10:51,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 14: [2023-05-10 10:10:51,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 14: [2023-05-10 10:10:51,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 14: [2023-05-10 10:10:51,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 14: [2023-05-10 10:10:51,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 14: [2023-05-10 10:10:51,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 14: [2023-05-10 10:10:51,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 14: [2023-05-10 10:10:51,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 16: [2023-05-10 10:10:51,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 15: [2023-05-10 10:10:51,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 24: [2023-05-10 10:10:51,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 24: [2023-05-10 10:10:51,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 17: [2023-05-10 10:10:51,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 0: [2023-05-10 10:10:51,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:51,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:51,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:51,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 15: [2023-05-10 10:10:51,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 14: [2023-05-10 10:10:51,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:51,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:51,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:51,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 27: [2023-05-10 10:10:51,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 26: [2023-05-10 10:10:51,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 26: [2023-05-10 10:10:51,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,268] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,268] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 2: [2023-05-10 10:10:51,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:51,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:51,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 17: [2023-05-10 10:10:51,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:51,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:51,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:51,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:51,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:51,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 30: [2023-05-10 10:10:51,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 24: [2023-05-10 10:10:51,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 0: [2023-05-10 10:10:51,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 0: [2023-05-10 10:10:51,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 24: [2023-05-10 10:10:51,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 24: [2023-05-10 10:10:51,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 24: [2023-05-10 10:10:51,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 24: [2023-05-10 10:10:51,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 24: [2023-05-10 10:10:51,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 24: [2023-05-10 10:10:51,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 24: [2023-05-10 10:10:51,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 26: [2023-05-10 10:10:51,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 16: [2023-05-10 10:10:51,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 17: [2023-05-10 10:10:51,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 16: [2023-05-10 10:10:51,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 26: [2023-05-10 10:10:51,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:51,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 16: [2023-05-10 10:10:51,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 30: [2023-05-10 10:10:51,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 30: [2023-05-10 10:10:51,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:51,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 16: [2023-05-10 10:10:51,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 30: [2023-05-10 10:10:51,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 6: [2023-05-10 10:10:51,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 30: [2023-05-10 10:10:51,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 6: [2023-05-10 10:10:51,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 30: [2023-05-10 10:10:51,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 16: [2023-05-10 10:10:51,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 30: [2023-05-10 10:10:51,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 6: [2023-05-10 10:10:51,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 15: [2023-05-10 10:10:51,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:51,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 30: [2023-05-10 10:10:51,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 2: [2023-05-10 10:10:51,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 30: [2023-05-10 10:10:51,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:51,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 13: [2023-05-10 10:10:51,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 17: [2023-05-10 10:10:51,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 13: [2023-05-10 10:10:51,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 15: [2023-05-10 10:10:51,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 26: [2023-05-10 10:10:51,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 15: [2023-05-10 10:10:51,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 15: [2023-05-10 10:10:51,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 25: [2023-05-10 10:10:51,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 25: [2023-05-10 10:10:51,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 25: [2023-05-10 10:10:51,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 25: [2023-05-10 10:10:51,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 2: [2023-05-10 10:10:51,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 2: [2023-05-10 10:10:51,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 25: [2023-05-10 10:10:51,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 25: [2023-05-10 10:10:51,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 25: [2023-05-10 10:10:51,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 2: [2023-05-10 10:10:51,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 26: [2023-05-10 10:10:51,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 2: [2023-05-10 10:10:51,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:51,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:51,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:51,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:51,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:51,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 26: [2023-05-10 10:10:51,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:51,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:51,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 11: [2023-05-10 10:10:51,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 11: [2023-05-10 10:10:51,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 11: [2023-05-10 10:10:51,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 11: [2023-05-10 10:10:51,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 11: [2023-05-10 10:10:51,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 11: [2023-05-10 10:10:51,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 11: [2023-05-10 10:10:51,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 11: [2023-05-10 10:10:51,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 13: [2023-05-10 10:10:51,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 13: [2023-05-10 10:10:51,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 13: [2023-05-10 10:10:51,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 13: [2023-05-10 10:10:51,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 17: [2023-05-10 10:10:51,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 17: [2023-05-10 10:10:51,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 17: [2023-05-10 10:10:51,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 17: [2023-05-10 10:10:51,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 17: [2023-05-10 10:10:51,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 24: [2023-05-10 10:10:51,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:51,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 24: [2023-05-10 10:10:51,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:51,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:51,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:51,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 24: [2023-05-10 10:10:51,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:51,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:51,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:51,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:51,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 9: [2023-05-10 10:10:51,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 24: [2023-05-10 10:10:51,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 7: [2023-05-10 10:10:51,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:51,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:51,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:51,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 24: [2023-05-10 10:10:51,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 24: [2023-05-10 10:10:51,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 14: [2023-05-10 10:10:51,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 14: [2023-05-10 10:10:51,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 14: [2023-05-10 10:10:51,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:51,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 19: [2023-05-10 10:10:51,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:51,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 7: [2023-05-10 10:10:51,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 11: [2023-05-10 10:10:51,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 25: [2023-05-10 10:10:51,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 12: [2023-05-10 10:10:51,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 25: [2023-05-10 10:10:51,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 25: [2023-05-10 10:10:51,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 25: [2023-05-10 10:10:51,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 25: [2023-05-10 10:10:51,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 14: [2023-05-10 10:10:51,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 14: [2023-05-10 10:10:51,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 14: [2023-05-10 10:10:51,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 25: [2023-05-10 10:10:51,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 25: [2023-05-10 10:10:51,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 25: [2023-05-10 10:10:51,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:51,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 5: [2023-05-10 10:10:51,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:51,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:51,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:51,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:51,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:51,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:51,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:51,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 6: [2023-05-10 10:10:51,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,352] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:51,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 6: [2023-05-10 10:10:51,355] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,356] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:51,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:51,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:51,357] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 3: [2023-05-10 10:10:51,357] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt... 6: [2023-05-10 10:10:51,357] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,360] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 14: [2023-05-10 10:10:51,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 11: [2023-05-10 10:10:51,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 11: [2023-05-10 10:10:51,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:51,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:51,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:51,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:51,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:51,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:51,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 23: [2023-05-10 10:10:51,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:51,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:51,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:51,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:51,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:51,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 9: [2023-05-10 10:10:51,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 23: [2023-05-10 10:10:51,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 23: [2023-05-10 10:10:51,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 23: [2023-05-10 10:10:51,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 23: [2023-05-10 10:10:51,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 23: [2023-05-10 10:10:51,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 23: [2023-05-10 10:10:51,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 23: [2023-05-10 10:10:51,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 23: [2023-05-10 10:10:51,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:51,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:51,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 6: [2023-05-10 10:10:51,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 11: [2023-05-10 10:10:51,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 11: [2023-05-10 10:10:51,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 11: [2023-05-10 10:10:51,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 11: [2023-05-10 10:10:51,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 11: [2023-05-10 10:10:51,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 11: [2023-05-10 10:10:51,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 19: [2023-05-10 10:10:51,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 11: [2023-05-10 10:10:51,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 11: [2023-05-10 10:10:51,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 5: [2023-05-10 10:10:51,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 6: [2023-05-10 10:10:51,394] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 30: [2023-05-10 10:10:51,394] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 6: [2023-05-10 10:10:51,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 7: [2023-05-10 10:10:51,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 7: [2023-05-10 10:10:51,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 7: [2023-05-10 10:10:51,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 30: [2023-05-10 10:10:51,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 7: [2023-05-10 10:10:51,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 7: [2023-05-10 10:10:51,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 7: [2023-05-10 10:10:51,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 7: [2023-05-10 10:10:51,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 7: [2023-05-10 10:10:51,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 30: [2023-05-10 10:10:51,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 30: [2023-05-10 10:10:51,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 21: [2023-05-10 10:10:51,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 30: [2023-05-10 10:10:51,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 30: [2023-05-10 10:10:51,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 30: [2023-05-10 10:10:51,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 11: [2023-05-10 10:10:51,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:51,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 11: [2023-05-10 10:10:51,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 11: [2023-05-10 10:10:51,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 11: [2023-05-10 10:10:51,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:51,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:51,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 3: [2023-05-10 10:10:51,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_11-model_00-model_states.pt. 5: [2023-05-10 10:10:51,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:51,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 5: [2023-05-10 10:10:51,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 5: [2023-05-10 10:10:51,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:51,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:51,442] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 23: [2023-05-10 10:10:51,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,451] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 23: [2023-05-10 10:10:51,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:51,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,465] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:51,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:51,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 1: [2023-05-10 10:10:51,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 1: [2023-05-10 10:10:51,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 1: [2023-05-10 10:10:51,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 1: [2023-05-10 10:10:51,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 1: [2023-05-10 10:10:51,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 1: [2023-05-10 10:10:51,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 1: [2023-05-10 10:10:51,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 1: [2023-05-10 10:10:51,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,480] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:51,480] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:51,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 1: [2023-05-10 10:10:51,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 1: [2023-05-10 10:10:51,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 1: [2023-05-10 10:10:51,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:51,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 1: [2023-05-10 10:10:51,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 1: [2023-05-10 10:10:51,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 1: [2023-05-10 10:10:51,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,502] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,502] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 29: [2023-05-10 10:10:51,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 1: [2023-05-10 10:10:51,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 1: [2023-05-10 10:10:51,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 1: [2023-05-10 10:10:51,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 1: [2023-05-10 10:10:51,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 1: [2023-05-10 10:10:51,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 1: [2023-05-10 10:10:51,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 1: [2023-05-10 10:10:51,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 1: [2023-05-10 10:10:51,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 1: [2023-05-10 10:10:51,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:51,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:51,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:51,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:51,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 1: [2023-05-10 10:10:51,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:51,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:51,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:51,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:51,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 20: [2023-05-10 10:10:51,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 20: [2023-05-10 10:10:51,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 20: [2023-05-10 10:10:51,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 20: [2023-05-10 10:10:51,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 4: [2023-05-10 10:10:51,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 4: [2023-05-10 10:10:51,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 4: [2023-05-10 10:10:51,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 4: [2023-05-10 10:10:51,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 4: [2023-05-10 10:10:51,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 4: [2023-05-10 10:10:51,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 20: [2023-05-10 10:10:51,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 29: [2023-05-10 10:10:51,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 18: [2023-05-10 10:10:51,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 22: [2023-05-10 10:10:51,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 22: [2023-05-10 10:10:51,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 28: [2023-05-10 10:10:51,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 28: [2023-05-10 10:10:51,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 28: [2023-05-10 10:10:51,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 28: [2023-05-10 10:10:51,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 28: [2023-05-10 10:10:51,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 28: [2023-05-10 10:10:51,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 28: [2023-05-10 10:10:51,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 28: [2023-05-10 10:10:51,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 8: [2023-05-10 10:10:51,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:51,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 28: [2023-05-10 10:10:51,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 22: [2023-05-10 10:10:51,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 22: [2023-05-10 10:10:51,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 10: [2023-05-10 10:10:51,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 10: [2023-05-10 10:10:51,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 10: [2023-05-10 10:10:51,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 10: [2023-05-10 10:10:51,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 10: [2023-05-10 10:10:51,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 22: [2023-05-10 10:10:51,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 8: [2023-05-10 10:10:51,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 29: [2023-05-10 10:10:51,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:51,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 28: [2023-05-10 10:10:51,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 28: [2023-05-10 10:10:51,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:51,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 28: [2023-05-10 10:10:51,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:51,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 8: [2023-05-10 10:10:51,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:51,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 8: [2023-05-10 10:10:51,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 10: [2023-05-10 10:10:51,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 29: [2023-05-10 10:10:51,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 29: [2023-05-10 10:10:51,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:51,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:51,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 4: [2023-05-10 10:10:51,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 4: [2023-05-10 10:10:51,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,657] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,657] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,657] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,657] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 4: [2023-05-10 10:10:51,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 4: [2023-05-10 10:10:51,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 4: [2023-05-10 10:10:51,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 4: [2023-05-10 10:10:51,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 31: [2023-05-10 10:10:51,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 31: [2023-05-10 10:10:51,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 31: [2023-05-10 10:10:51,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 31: [2023-05-10 10:10:51,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 31: [2023-05-10 10:10:51,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 31: [2023-05-10 10:10:51,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 31: [2023-05-10 10:10:51,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 31: [2023-05-10 10:10:51,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,663] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 31: [2023-05-10 10:10:51,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 18: [2023-05-10 10:10:51,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 31: [2023-05-10 10:10:51,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 31: [2023-05-10 10:10:51,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 15: [2023-05-10 10:10:51,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 4: [2023-05-10 10:10:51,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 27: [2023-05-10 10:10:51,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 4: [2023-05-10 10:10:51,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:51,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:51,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 27: [2023-05-10 10:10:51,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:51,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:51,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:51,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 15: [2023-05-10 10:10:51,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 15: [2023-05-10 10:10:51,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 15: [2023-05-10 10:10:51,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 15: [2023-05-10 10:10:51,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 15: [2023-05-10 10:10:51,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 15: [2023-05-10 10:10:51,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 28: [2023-05-10 10:10:51,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 28: [2023-05-10 10:10:51,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 22: [2023-05-10 10:10:51,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 27: [2023-05-10 10:10:51,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:51,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:51,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:51,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:51,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:51,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 18: [2023-05-10 10:10:51,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 27: [2023-05-10 10:10:51,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 18: [2023-05-10 10:10:51,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:51,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,699] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 10: [2023-05-10 10:10:51,699] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 22: [2023-05-10 10:10:51,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 10: [2023-05-10 10:10:51,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 16: [2023-05-10 10:10:51,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 16: [2023-05-10 10:10:51,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:51,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 18: [2023-05-10 10:10:51,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:51,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 16: [2023-05-10 10:10:51,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,704] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 18: [2023-05-10 10:10:51,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 18: [2023-05-10 10:10:51,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:51,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 26: [2023-05-10 10:10:51,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 20: [2023-05-10 10:10:51,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:51,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:51,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:51,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 26: [2023-05-10 10:10:51,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 26: [2023-05-10 10:10:51,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 26: [2023-05-10 10:10:51,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 26: [2023-05-10 10:10:51,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 28: [2023-05-10 10:10:51,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 28: [2023-05-10 10:10:51,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 26: [2023-05-10 10:10:51,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:51,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:51,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 22: [2023-05-10 10:10:51,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 26: [2023-05-10 10:10:51,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 26: [2023-05-10 10:10:51,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 28: [2023-05-10 10:10:51,712] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 28: [2023-05-10 10:10:51,712] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 28: [2023-05-10 10:10:51,712] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 28: [2023-05-10 10:10:51,712] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 10: [2023-05-10 10:10:51,713] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 10: [2023-05-10 10:10:51,713] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 10: [2023-05-10 10:10:51,713] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 10: [2023-05-10 10:10:51,713] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 22: [2023-05-10 10:10:51,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 31: [2023-05-10 10:10:51,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 27: [2023-05-10 10:10:51,718] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 27: [2023-05-10 10:10:51,719] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:51,719] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 26: [2023-05-10 10:10:51,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 26: [2023-05-10 10:10:51,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 26: [2023-05-10 10:10:51,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 27: [2023-05-10 10:10:51,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 27: [2023-05-10 10:10:51,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:51,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 26: [2023-05-10 10:10:51,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 0: [2023-05-10 10:10:51,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 10: [2023-05-10 10:10:51,726] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 24: [2023-05-10 10:10:51,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 10: [2023-05-10 10:10:51,730] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:51,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,730] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 10: [2023-05-10 10:10:51,730] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 10: [2023-05-10 10:10:51,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 24: [2023-05-10 10:10:51,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 0: [2023-05-10 10:10:51,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 0: [2023-05-10 10:10:51,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 24: [2023-05-10 10:10:51,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 0: [2023-05-10 10:10:51,732] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 0: [2023-05-10 10:10:51,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 0: [2023-05-10 10:10:51,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 0: [2023-05-10 10:10:51,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 0: [2023-05-10 10:10:51,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 24: [2023-05-10 10:10:51,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 25: [2023-05-10 10:10:51,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 25: [2023-05-10 10:10:51,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 24: [2023-05-10 10:10:51,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 24: [2023-05-10 10:10:51,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 24: [2023-05-10 10:10:51,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 24: [2023-05-10 10:10:51,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 24: [2023-05-10 10:10:51,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 25: [2023-05-10 10:10:51,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 22: [2023-05-10 10:10:51,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 22: [2023-05-10 10:10:51,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 31: [2023-05-10 10:10:51,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:51,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 25: [2023-05-10 10:10:51,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 25: [2023-05-10 10:10:51,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 25: [2023-05-10 10:10:51,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 10: [2023-05-10 10:10:51,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:51,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 12: [2023-05-10 10:10:51,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,741] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,741] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,741] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 22: [2023-05-10 10:10:51,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 22: [2023-05-10 10:10:51,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 12: [2023-05-10 10:10:51,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 14: [2023-05-10 10:10:51,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 10: [2023-05-10 10:10:51,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 22: [2023-05-10 10:10:51,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 22: [2023-05-10 10:10:51,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 10: [2023-05-10 10:10:51,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 10: [2023-05-10 10:10:51,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:51,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 14: [2023-05-10 10:10:51,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 14: [2023-05-10 10:10:51,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 14: [2023-05-10 10:10:51,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 14: [2023-05-10 10:10:51,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 14: [2023-05-10 10:10:51,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 28: [2023-05-10 10:10:51,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:51,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 31: [2023-05-10 10:10:51,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 31: [2023-05-10 10:10:51,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 31: [2023-05-10 10:10:51,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 31: [2023-05-10 10:10:51,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 28: [2023-05-10 10:10:51,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:51,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:51,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:51,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 28: [2023-05-10 10:10:51,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:51,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 16: [2023-05-10 10:10:51,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 14: [2023-05-10 10:10:51,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 15: [2023-05-10 10:10:51,757] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,757] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,762] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 31: [2023-05-10 10:10:51,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 9: [2023-05-10 10:10:51,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 15: [2023-05-10 10:10:51,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 26: [2023-05-10 10:10:51,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 26: [2023-05-10 10:10:51,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,769] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 9: [2023-05-10 10:10:51,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,770] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,770] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 13: [2023-05-10 10:10:51,770] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 13: [2023-05-10 10:10:51,771] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 13: [2023-05-10 10:10:51,771] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,771] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 13: [2023-05-10 10:10:51,771] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 13: [2023-05-10 10:10:51,771] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 16: [2023-05-10 10:10:51,772] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,774] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:51,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 6: [2023-05-10 10:10:51,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 6: [2023-05-10 10:10:51,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 13: [2023-05-10 10:10:51,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 2: [2023-05-10 10:10:51,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 16: [2023-05-10 10:10:51,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 2: [2023-05-10 10:10:51,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,779] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 6: [2023-05-10 10:10:51,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 6: [2023-05-10 10:10:51,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 6: [2023-05-10 10:10:51,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 31: [2023-05-10 10:10:51,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 31: [2023-05-10 10:10:51,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 31: [2023-05-10 10:10:51,782] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:51,782] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,782] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,783] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 5: [2023-05-10 10:10:51,783] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 5: [2023-05-10 10:10:51,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:51,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 5: [2023-05-10 10:10:51,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:51,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 13: [2023-05-10 10:10:51,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 5: [2023-05-10 10:10:51,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 13: [2023-05-10 10:10:51,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 15: [2023-05-10 10:10:51,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 13: [2023-05-10 10:10:51,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:51,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:51,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:51,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:51,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:51,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:51,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:51,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 15: [2023-05-10 10:10:51,788] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:51,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 26: [2023-05-10 10:10:51,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 26: [2023-05-10 10:10:51,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 6: [2023-05-10 10:10:51,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 26: [2023-05-10 10:10:51,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:51,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 5: [2023-05-10 10:10:51,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 5: [2023-05-10 10:10:51,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:51,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 5: [2023-05-10 10:10:51,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 0: [2023-05-10 10:10:51,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 26: [2023-05-10 10:10:51,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:51,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:51,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 5: [2023-05-10 10:10:51,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:51,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:51,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:51,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 6: [2023-05-10 10:10:51,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 0: [2023-05-10 10:10:51,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 5: [2023-05-10 10:10:51,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 5: [2023-05-10 10:10:51,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:51,795] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 26: [2023-05-10 10:10:51,795] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:51,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:51,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:51,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:51,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 15: [2023-05-10 10:10:51,800] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:51,800] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:51,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:51,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:51,802] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 14: [2023-05-10 10:10:51,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 16: [2023-05-10 10:10:51,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:51,804] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:51,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 0: [2023-05-10 10:10:51,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 24: [2023-05-10 10:10:51,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:51,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 12: [2023-05-10 10:10:51,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:51,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 17: [2023-05-10 10:10:51,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 17: [2023-05-10 10:10:51,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 17: [2023-05-10 10:10:51,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 17: [2023-05-10 10:10:51,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 17: [2023-05-10 10:10:51,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 16: [2023-05-10 10:10:51,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:51,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:51,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:51,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 26: [2023-05-10 10:10:51,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:51,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 0: [2023-05-10 10:10:51,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:51,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:51,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 0: [2023-05-10 10:10:51,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:51,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 14: [2023-05-10 10:10:51,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 14: [2023-05-10 10:10:51,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:51,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:51,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:51,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 25: [2023-05-10 10:10:51,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:51,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:51,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:51,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:51,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:51,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:51,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:51,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:51,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:51,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 9: [2023-05-10 10:10:51,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,831] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:51,832] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 6: [2023-05-10 10:10:51,832] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 0: [2023-05-10 10:10:51,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 21: [2023-05-10 10:10:51,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:51,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 7: [2023-05-10 10:10:51,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 21: [2023-05-10 10:10:51,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 9: [2023-05-10 10:10:51,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 21: [2023-05-10 10:10:51,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 21: [2023-05-10 10:10:51,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 0: [2023-05-10 10:10:51,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 0: [2023-05-10 10:10:51,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:51,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 7: [2023-05-10 10:10:51,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 7: [2023-05-10 10:10:51,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 7: [2023-05-10 10:10:51,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 7: [2023-05-10 10:10:51,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 7: [2023-05-10 10:10:51,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 14: [2023-05-10 10:10:51,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 9: [2023-05-10 10:10:51,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 24: [2023-05-10 10:10:51,839] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 11: [2023-05-10 10:10:51,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:51,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 7: [2023-05-10 10:10:51,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 24: [2023-05-10 10:10:51,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 2: [2023-05-10 10:10:51,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:51,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 24: [2023-05-10 10:10:51,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:51,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 30: [2023-05-10 10:10:51,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 30: [2023-05-10 10:10:51,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 12: [2023-05-10 10:10:51,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 30: [2023-05-10 10:10:51,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 30: [2023-05-10 10:10:51,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 11: [2023-05-10 10:10:51,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 11: [2023-05-10 10:10:51,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 11: [2023-05-10 10:10:51,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 11: [2023-05-10 10:10:51,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 11: [2023-05-10 10:10:51,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 24: [2023-05-10 10:10:51,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,843] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 30: [2023-05-10 10:10:51,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 7: [2023-05-10 10:10:51,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 21: [2023-05-10 10:10:51,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 30: [2023-05-10 10:10:51,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 24: [2023-05-10 10:10:51,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 12: [2023-05-10 10:10:51,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 30: [2023-05-10 10:10:51,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 7: [2023-05-10 10:10:51,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 11: [2023-05-10 10:10:51,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 7: [2023-05-10 10:10:51,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 13: [2023-05-10 10:10:51,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 5: [2023-05-10 10:10:51,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:51,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 5: [2023-05-10 10:10:51,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:51,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,846] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,846] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,846] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,846] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:51,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:51,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:51,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:51,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:51,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 2: [2023-05-10 10:10:51,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 14: [2023-05-10 10:10:51,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:51,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 11: [2023-05-10 10:10:51,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 2: [2023-05-10 10:10:51,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 11: [2023-05-10 10:10:51,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:51,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:51,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 11: [2023-05-10 10:10:51,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 9: [2023-05-10 10:10:51,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:51,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 14: [2023-05-10 10:10:51,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:51,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 3: [2023-05-10 10:10:51,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 12: [2023-05-10 10:10:51,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 12: [2023-05-10 10:10:51,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:51,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 6: [2023-05-10 10:10:51,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 12: [2023-05-10 10:10:51,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 9: [2023-05-10 10:10:51,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 9: [2023-05-10 10:10:51,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:51,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 19: [2023-05-10 10:10:51,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 29: [2023-05-10 10:10:51,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:51,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:51,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:51,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:51,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:51,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:51,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:51,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 5: [2023-05-10 10:10:51,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:51,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 19: [2023-05-10 10:10:51,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 19: [2023-05-10 10:10:51,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 19: [2023-05-10 10:10:51,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 19: [2023-05-10 10:10:51,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 19: [2023-05-10 10:10:51,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 9: [2023-05-10 10:10:51,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:51,862] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 29: [2023-05-10 10:10:51,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:51,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 6: [2023-05-10 10:10:51,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 6: [2023-05-10 10:10:51,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 6: [2023-05-10 10:10:51,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 9: [2023-05-10 10:10:51,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:51,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:51,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:51,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 5: [2023-05-10 10:10:51,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 5: [2023-05-10 10:10:51,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 5: [2023-05-10 10:10:51,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:51,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:51,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 5: [2023-05-10 10:10:51,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:51,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 29: [2023-05-10 10:10:51,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 29: [2023-05-10 10:10:51,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:51,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:51,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:51,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:51,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 19: [2023-05-10 10:10:51,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 29: [2023-05-10 10:10:51,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 5: [2023-05-10 10:10:51,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:51,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:51,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt... 23: [2023-05-10 10:10:51,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:51,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 13: [2023-05-10 10:10:51,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 13: [2023-05-10 10:10:51,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 13: [2023-05-10 10:10:51,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:51,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 13: [2023-05-10 10:10:51,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 13: [2023-05-10 10:10:51,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 13: [2023-05-10 10:10:51,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 13: [2023-05-10 10:10:51,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 13: [2023-05-10 10:10:51,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 2: [2023-05-10 10:10:51,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 5: [2023-05-10 10:10:51,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:51,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:51,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:51,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 2: [2023-05-10 10:10:51,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 2: [2023-05-10 10:10:51,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:51,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:51,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 2: [2023-05-10 10:10:51,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:51,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 23: [2023-05-10 10:10:51,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:51,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:51,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:51,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:51,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 7: [2023-05-10 10:10:51,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 5: [2023-05-10 10:10:51,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:51,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:51,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:51,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:51,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 11: [2023-05-10 10:10:51,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 6: [2023-05-10 10:10:51,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 21: [2023-05-10 10:10:51,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 21: [2023-05-10 10:10:51,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:51,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 7: [2023-05-10 10:10:51,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 7: [2023-05-10 10:10:51,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 7: [2023-05-10 10:10:51,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 13: [2023-05-10 10:10:51,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:51,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 7: [2023-05-10 10:10:51,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 7: [2023-05-10 10:10:51,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 21: [2023-05-10 10:10:51,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 21: [2023-05-10 10:10:51,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 21: [2023-05-10 10:10:51,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 21: [2023-05-10 10:10:51,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 13: [2023-05-10 10:10:51,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 13: [2023-05-10 10:10:51,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 13: [2023-05-10 10:10:51,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 13: [2023-05-10 10:10:51,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 13: [2023-05-10 10:10:51,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:51,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 29: [2023-05-10 10:10:51,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:51,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:51,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:51,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:51,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 19: [2023-05-10 10:10:51,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 11: [2023-05-10 10:10:51,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 11: [2023-05-10 10:10:51,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:51,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:51,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:51,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:51,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 11: [2023-05-10 10:10:51,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:51,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 17: [2023-05-10 10:10:51,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:51,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:51,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:51,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:51,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:51,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:51,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:51,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:51,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:51,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:51,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:51,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:51,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:51,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 29: [2023-05-10 10:10:51,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:51,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:51,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:51,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:51,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:51,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:51,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:51,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 30: [2023-05-10 10:10:51,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 21: [2023-05-10 10:10:51,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:51,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:51,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 19: [2023-05-10 10:10:51,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 19: [2023-05-10 10:10:51,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 19: [2023-05-10 10:10:51,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 29: [2023-05-10 10:10:51,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:51,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:51,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:51,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:51,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 29: [2023-05-10 10:10:51,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 11: [2023-05-10 10:10:51,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 19: [2023-05-10 10:10:51,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:51,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 30: [2023-05-10 10:10:51,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 30: [2023-05-10 10:10:51,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:51,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 30: [2023-05-10 10:10:51,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_12-model_00-model_states.pt. 3: [2023-05-10 10:10:51,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:51,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 30: [2023-05-10 10:10:51,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:51,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:51,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:51,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 29: [2023-05-10 10:10:51,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:51,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:51,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:51,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:51,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:51,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 29: [2023-05-10 10:10:51,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:51,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 30: [2023-05-10 10:10:51,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 30: [2023-05-10 10:10:51,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 8: [2023-05-10 10:10:51,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 8: [2023-05-10 10:10:51,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 8: [2023-05-10 10:10:51,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 8: [2023-05-10 10:10:51,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 8: [2023-05-10 10:10:51,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 8: [2023-05-10 10:10:51,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 8: [2023-05-10 10:10:51,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:51,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:51,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:51,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:51,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:51,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:51,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 30: [2023-05-10 10:10:51,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 8: [2023-05-10 10:10:51,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 8: [2023-05-10 10:10:51,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 8: [2023-05-10 10:10:51,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 8: [2023-05-10 10:10:51,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 8: [2023-05-10 10:10:51,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 8: [2023-05-10 10:10:51,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 8: [2023-05-10 10:10:51,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 8: [2023-05-10 10:10:51,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:52,009] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:52,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:52,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:52,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:52,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:52,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:52,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:52,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 8: [2023-05-10 10:10:52,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 8: [2023-05-10 10:10:52,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 8: [2023-05-10 10:10:52,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 8: [2023-05-10 10:10:52,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 8: [2023-05-10 10:10:52,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 8: [2023-05-10 10:10:52,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 8: [2023-05-10 10:10:52,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 8: [2023-05-10 10:10:52,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,056] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:52,057] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:52,057] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 31: [2023-05-10 10:10:52,058] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:52,058] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:52,058] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 31: [2023-05-10 10:10:52,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:52,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:52,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:52,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:52,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 31: [2023-05-10 10:10:52,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 31: [2023-05-10 10:10:52,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 31: [2023-05-10 10:10:52,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 31: [2023-05-10 10:10:52,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 31: [2023-05-10 10:10:52,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 8: [2023-05-10 10:10:52,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:52,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:52,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:52,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:52,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:52,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:52,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 1: [2023-05-10 10:10:52,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 1: [2023-05-10 10:10:52,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 1: [2023-05-10 10:10:52,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 1: [2023-05-10 10:10:52,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 1: [2023-05-10 10:10:52,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 1: [2023-05-10 10:10:52,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 8: [2023-05-10 10:10:52,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 26: [2023-05-10 10:10:52,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 26: [2023-05-10 10:10:52,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 26: [2023-05-10 10:10:52,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 26: [2023-05-10 10:10:52,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 26: [2023-05-10 10:10:52,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 26: [2023-05-10 10:10:52,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 26: [2023-05-10 10:10:52,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 26: [2023-05-10 10:10:52,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 1: [2023-05-10 10:10:52,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:52,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:52,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:52,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:52,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:52,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:52,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:52,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:52,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:52,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:52,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:52,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 26: [2023-05-10 10:10:52,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:52,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:52,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:52,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:52,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 20: [2023-05-10 10:10:52,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,117] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 4: [2023-05-10 10:10:52,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 31: [2023-05-10 10:10:52,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 25: [2023-05-10 10:10:52,136] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,140] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,140] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,140] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,140] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,140] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 4: [2023-05-10 10:10:52,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 4: [2023-05-10 10:10:52,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 1: [2023-05-10 10:10:52,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 26: [2023-05-10 10:10:52,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,143] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:52,143] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:52,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:52,144] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:52,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:52,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:52,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:52,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 4: [2023-05-10 10:10:52,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 4: [2023-05-10 10:10:52,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 4: [2023-05-10 10:10:52,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 4: [2023-05-10 10:10:52,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,154] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 1: [2023-05-10 10:10:52,154] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 1: [2023-05-10 10:10:52,154] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 1: [2023-05-10 10:10:52,154] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 1: [2023-05-10 10:10:52,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 4: [2023-05-10 10:10:52,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 26: [2023-05-10 10:10:52,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 26: [2023-05-10 10:10:52,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 26: [2023-05-10 10:10:52,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 26: [2023-05-10 10:10:52,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 26: [2023-05-10 10:10:52,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 26: [2023-05-10 10:10:52,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,169] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 31: [2023-05-10 10:10:52,169] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 26: [2023-05-10 10:10:52,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 1: [2023-05-10 10:10:52,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 28: [2023-05-10 10:10:52,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 28: [2023-05-10 10:10:52,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 26: [2023-05-10 10:10:52,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 31: [2023-05-10 10:10:52,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 31: [2023-05-10 10:10:52,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 12: [2023-05-10 10:10:52,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 28: [2023-05-10 10:10:52,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 28: [2023-05-10 10:10:52,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 28: [2023-05-10 10:10:52,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 28: [2023-05-10 10:10:52,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 28: [2023-05-10 10:10:52,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 28: [2023-05-10 10:10:52,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 1: [2023-05-10 10:10:52,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 28: [2023-05-10 10:10:52,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:52,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 12: [2023-05-10 10:10:52,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:52,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 12: [2023-05-10 10:10:52,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 12: [2023-05-10 10:10:52,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:52,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:52,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:52,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 28: [2023-05-10 10:10:52,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:52,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:52,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:52,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:52,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 12: [2023-05-10 10:10:52,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 12: [2023-05-10 10:10:52,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 12: [2023-05-10 10:10:52,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 12: [2023-05-10 10:10:52,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 12: [2023-05-10 10:10:52,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 1: [2023-05-10 10:10:52,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 26: [2023-05-10 10:10:52,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 22: [2023-05-10 10:10:52,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 22: [2023-05-10 10:10:52,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:52,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 25: [2023-05-10 10:10:52,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:52,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 26: [2023-05-10 10:10:52,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 22: [2023-05-10 10:10:52,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 22: [2023-05-10 10:10:52,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:52,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:52,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:52,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 26: [2023-05-10 10:10:52,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 22: [2023-05-10 10:10:52,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 26: [2023-05-10 10:10:52,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:52,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:52,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 14: [2023-05-10 10:10:52,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:52,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:52,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:52,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 14: [2023-05-10 10:10:52,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:52,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:52,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:52,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 18: [2023-05-10 10:10:52,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 18: [2023-05-10 10:10:52,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 18: [2023-05-10 10:10:52,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 18: [2023-05-10 10:10:52,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 18: [2023-05-10 10:10:52,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 18: [2023-05-10 10:10:52,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 18: [2023-05-10 10:10:52,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 18: [2023-05-10 10:10:52,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:52,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:52,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:52,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:52,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:52,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 10: [2023-05-10 10:10:52,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 10: [2023-05-10 10:10:52,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 10: [2023-05-10 10:10:52,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 10: [2023-05-10 10:10:52,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 10: [2023-05-10 10:10:52,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:52,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 23: [2023-05-10 10:10:52,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 10: [2023-05-10 10:10:52,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:52,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:52,225] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 18: [2023-05-10 10:10:52,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:52,225] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 23: [2023-05-10 10:10:52,225] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 23: [2023-05-10 10:10:52,225] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 18: [2023-05-10 10:10:52,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 27: [2023-05-10 10:10:52,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:52,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 28: [2023-05-10 10:10:52,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:52,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 18: [2023-05-10 10:10:52,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 18: [2023-05-10 10:10:52,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 18: [2023-05-10 10:10:52,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 15: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 10: [2023-05-10 10:10:52,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 18: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 27: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 0: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 15: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 15: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 0: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 15: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 0: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 0: [2023-05-10 10:10:52,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 0: [2023-05-10 10:10:52,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 0: [2023-05-10 10:10:52,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 0: [2023-05-10 10:10:52,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 15: [2023-05-10 10:10:52,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:52,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:52,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:52,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 16: [2023-05-10 10:10:52,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 16: [2023-05-10 10:10:52,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 10: [2023-05-10 10:10:52,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 27: [2023-05-10 10:10:52,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:52,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 16: [2023-05-10 10:10:52,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 16: [2023-05-10 10:10:52,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 16: [2023-05-10 10:10:52,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 16: [2023-05-10 10:10:52,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 10: [2023-05-10 10:10:52,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:52,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 10: [2023-05-10 10:10:52,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:52,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 10: [2023-05-10 10:10:52,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 10: [2023-05-10 10:10:52,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:52,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 10: [2023-05-10 10:10:52,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:52,233] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:52,233] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:52,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:52,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:52,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 15: [2023-05-10 10:10:52,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 10: [2023-05-10 10:10:52,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:52,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:52,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 7: [2023-05-10 10:10:52,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 0: [2023-05-10 10:10:52,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:52,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:52,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:52,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:52,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:52,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 25: [2023-05-10 10:10:52,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:52,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 16: [2023-05-10 10:10:52,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 0: [2023-05-10 10:10:52,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 0: [2023-05-10 10:10:52,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 25: [2023-05-10 10:10:52,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 25: [2023-05-10 10:10:52,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:52,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 0: [2023-05-10 10:10:52,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 0: [2023-05-10 10:10:52,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:52,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:52,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:52,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 13: [2023-05-10 10:10:52,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 13: [2023-05-10 10:10:52,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 13: [2023-05-10 10:10:52,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 13: [2023-05-10 10:10:52,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 13: [2023-05-10 10:10:52,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:52,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:52,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 13: [2023-05-10 10:10:52,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 13: [2023-05-10 10:10:52,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:52,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 13: [2023-05-10 10:10:52,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 15: [2023-05-10 10:10:52,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:52,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:52,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:52,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:52,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:52,241] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:52,241] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:52,241] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:52,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:52,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:52,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:52,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:52,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:52,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 7: [2023-05-10 10:10:52,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:52,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 28: [2023-05-10 10:10:52,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 21: [2023-05-10 10:10:52,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 21: [2023-05-10 10:10:52,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 21: [2023-05-10 10:10:52,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 21: [2023-05-10 10:10:52,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 21: [2023-05-10 10:10:52,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 21: [2023-05-10 10:10:52,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 21: [2023-05-10 10:10:52,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 13: [2023-05-10 10:10:52,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:52,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 28: [2023-05-10 10:10:52,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 13: [2023-05-10 10:10:52,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 13: [2023-05-10 10:10:52,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:52,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 28: [2023-05-10 10:10:52,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 28: [2023-05-10 10:10:52,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 13: [2023-05-10 10:10:52,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 28: [2023-05-10 10:10:52,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 13: [2023-05-10 10:10:52,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 13: [2023-05-10 10:10:52,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 13: [2023-05-10 10:10:52,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 13: [2023-05-10 10:10:52,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 21: [2023-05-10 10:10:52,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 12: [2023-05-10 10:10:52,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 21: [2023-05-10 10:10:52,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 22: [2023-05-10 10:10:52,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 21: [2023-05-10 10:10:52,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 17: [2023-05-10 10:10:52,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 17: [2023-05-10 10:10:52,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 17: [2023-05-10 10:10:52,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 17: [2023-05-10 10:10:52,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 17: [2023-05-10 10:10:52,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 17: [2023-05-10 10:10:52,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 17: [2023-05-10 10:10:52,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:52,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:52,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:52,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:52,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:52,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:52,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:52,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:52,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:52,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:52,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 9: [2023-05-10 10:10:52,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 9: [2023-05-10 10:10:52,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:52,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:52,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:52,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:52,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:52,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:52,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:52,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 22: [2023-05-10 10:10:52,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 9: [2023-05-10 10:10:52,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 9: [2023-05-10 10:10:52,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 17: [2023-05-10 10:10:52,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 9: [2023-05-10 10:10:52,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 9: [2023-05-10 10:10:52,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 9: [2023-05-10 10:10:52,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 9: [2023-05-10 10:10:52,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 17: [2023-05-10 10:10:52,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:52,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:52,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:52,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:52,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 9: [2023-05-10 10:10:52,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:52,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 9: [2023-05-10 10:10:52,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:52,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 22: [2023-05-10 10:10:52,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 9: [2023-05-10 10:10:52,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 22: [2023-05-10 10:10:52,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 15: [2023-05-10 10:10:52,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 9: [2023-05-10 10:10:52,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 9: [2023-05-10 10:10:52,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 12: [2023-05-10 10:10:52,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 9: [2023-05-10 10:10:52,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 29: [2023-05-10 10:10:52,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 29: [2023-05-10 10:10:52,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 18: [2023-05-10 10:10:52,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 18: [2023-05-10 10:10:52,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 18: [2023-05-10 10:10:52,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 18: [2023-05-10 10:10:52,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:52,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 23: [2023-05-10 10:10:52,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 23: [2023-05-10 10:10:52,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 28: [2023-05-10 10:10:52,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 28: [2023-05-10 10:10:52,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 10: [2023-05-10 10:10:52,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 10: [2023-05-10 10:10:52,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 28: [2023-05-10 10:10:52,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 28: [2023-05-10 10:10:52,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 10: [2023-05-10 10:10:52,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:52,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 14: [2023-05-10 10:10:52,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 18: [2023-05-10 10:10:52,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 18: [2023-05-10 10:10:52,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 30: [2023-05-10 10:10:52,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 18: [2023-05-10 10:10:52,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 18: [2023-05-10 10:10:52,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 28: [2023-05-10 10:10:52,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 30: [2023-05-10 10:10:52,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 30: [2023-05-10 10:10:52,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 30: [2023-05-10 10:10:52,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 30: [2023-05-10 10:10:52,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 30: [2023-05-10 10:10:52,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 30: [2023-05-10 10:10:52,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 16: [2023-05-10 10:10:52,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 10: [2023-05-10 10:10:52,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 16: [2023-05-10 10:10:52,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 28: [2023-05-10 10:10:52,292] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 0: [2023-05-10 10:10:52,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 0: [2023-05-10 10:10:52,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 30: [2023-05-10 10:10:52,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:52,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:52,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:52,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 27: [2023-05-10 10:10:52,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 10: [2023-05-10 10:10:52,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:52,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:52,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:52,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:52,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 2: [2023-05-10 10:10:52,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 2: [2023-05-10 10:10:52,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:52,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 23: [2023-05-10 10:10:52,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 23: [2023-05-10 10:10:52,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:52,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 3: [2023-05-10 10:10:52,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:52,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 27: [2023-05-10 10:10:52,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 2: [2023-05-10 10:10:52,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 2: [2023-05-10 10:10:52,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 2: [2023-05-10 10:10:52,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 2: [2023-05-10 10:10:52,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 2: [2023-05-10 10:10:52,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:52,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 2: [2023-05-10 10:10:52,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 12: [2023-05-10 10:10:52,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:52,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 24: [2023-05-10 10:10:52,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 24: [2023-05-10 10:10:52,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 2: [2023-05-10 10:10:52,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:52,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 3: [2023-05-10 10:10:52,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 30: [2023-05-10 10:10:52,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:52,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 30: [2023-05-10 10:10:52,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:52,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 7: [2023-05-10 10:10:52,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:52,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:52,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:52,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:52,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 27: [2023-05-10 10:10:52,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 30: [2023-05-10 10:10:52,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 27: [2023-05-10 10:10:52,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 24: [2023-05-10 10:10:52,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 24: [2023-05-10 10:10:52,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 24: [2023-05-10 10:10:52,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 14: [2023-05-10 10:10:52,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 0: [2023-05-10 10:10:52,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 0: [2023-05-10 10:10:52,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 0: [2023-05-10 10:10:52,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 0: [2023-05-10 10:10:52,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 30: [2023-05-10 10:10:52,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 24: [2023-05-10 10:10:52,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:52,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:52,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 6: [2023-05-10 10:10:52,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 22: [2023-05-10 10:10:52,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 24: [2023-05-10 10:10:52,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 12: [2023-05-10 10:10:52,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 12: [2023-05-10 10:10:52,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 23: [2023-05-10 10:10:52,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:52,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:52,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 11: [2023-05-10 10:10:52,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:52,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:52,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 10: [2023-05-10 10:10:52,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 21: [2023-05-10 10:10:52,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 22: [2023-05-10 10:10:52,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:52,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 10: [2023-05-10 10:10:52,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 11: [2023-05-10 10:10:52,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 11: [2023-05-10 10:10:52,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 11: [2023-05-10 10:10:52,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 11: [2023-05-10 10:10:52,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 11: [2023-05-10 10:10:52,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 11: [2023-05-10 10:10:52,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 18: [2023-05-10 10:10:52,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 18: [2023-05-10 10:10:52,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 18: [2023-05-10 10:10:52,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 2: [2023-05-10 10:10:52,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 2: [2023-05-10 10:10:52,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 16: [2023-05-10 10:10:52,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 2: [2023-05-10 10:10:52,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:52,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 15: [2023-05-10 10:10:52,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 2: [2023-05-10 10:10:52,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:52,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 11: [2023-05-10 10:10:52,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 2: [2023-05-10 10:10:52,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 17: [2023-05-10 10:10:52,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 29: [2023-05-10 10:10:52,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 18: [2023-05-10 10:10:52,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 16: [2023-05-10 10:10:52,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 21: [2023-05-10 10:10:52,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 16: [2023-05-10 10:10:52,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 16: [2023-05-10 10:10:52,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 16: [2023-05-10 10:10:52,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 16: [2023-05-10 10:10:52,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 16: [2023-05-10 10:10:52,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 22: [2023-05-10 10:10:52,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 14: [2023-05-10 10:10:52,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:52,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 11: [2023-05-10 10:10:52,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 15: [2023-05-10 10:10:52,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 15: [2023-05-10 10:10:52,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 15: [2023-05-10 10:10:52,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 15: [2023-05-10 10:10:52,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 10: [2023-05-10 10:10:52,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 14: [2023-05-10 10:10:52,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 14: [2023-05-10 10:10:52,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 18: [2023-05-10 10:10:52,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 18: [2023-05-10 10:10:52,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 10: [2023-05-10 10:10:52,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:52,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 11: [2023-05-10 10:10:52,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 18: [2023-05-10 10:10:52,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 19: [2023-05-10 10:10:52,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 10: [2023-05-10 10:10:52,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 10: [2023-05-10 10:10:52,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt... 10: [2023-05-10 10:10:52,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 18: [2023-05-10 10:10:52,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 14: [2023-05-10 10:10:52,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:52,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 29: [2023-05-10 10:10:52,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 29: [2023-05-10 10:10:52,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 19: [2023-05-10 10:10:52,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 7: [2023-05-10 10:10:52,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:52,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 7: [2023-05-10 10:10:52,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 7: [2023-05-10 10:10:52,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 7: [2023-05-10 10:10:52,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 21: [2023-05-10 10:10:52,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 21: [2023-05-10 10:10:52,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 15: [2023-05-10 10:10:52,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 10: [2023-05-10 10:10:52,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:52,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:52,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:52,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 5: [2023-05-10 10:10:52,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 13: [2023-05-10 10:10:52,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 23: [2023-05-10 10:10:52,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 13: [2023-05-10 10:10:52,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 13: [2023-05-10 10:10:52,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 27: [2023-05-10 10:10:52,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 27: [2023-05-10 10:10:52,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 7: [2023-05-10 10:10:52,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 7: [2023-05-10 10:10:52,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 23: [2023-05-10 10:10:52,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 23: [2023-05-10 10:10:52,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:52,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 7: [2023-05-10 10:10:52,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 7: [2023-05-10 10:10:52,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 10: [2023-05-10 10:10:52,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 27: [2023-05-10 10:10:52,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 10: [2023-05-10 10:10:52,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 9: [2023-05-10 10:10:52,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 0: [2023-05-10 10:10:52,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 0: [2023-05-10 10:10:52,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 27: [2023-05-10 10:10:52,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 27: [2023-05-10 10:10:52,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 27: [2023-05-10 10:10:52,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 16: [2023-05-10 10:10:52,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 9: [2023-05-10 10:10:52,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 9: [2023-05-10 10:10:52,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 9: [2023-05-10 10:10:52,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 9: [2023-05-10 10:10:52,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 30: [2023-05-10 10:10:52,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 21: [2023-05-10 10:10:52,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:52,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 13: [2023-05-10 10:10:52,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 13: [2023-05-10 10:10:52,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 13: [2023-05-10 10:10:52,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 13: [2023-05-10 10:10:52,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 13: [2023-05-10 10:10:52,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 5: [2023-05-10 10:10:52,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 15: [2023-05-10 10:10:52,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 16: [2023-05-10 10:10:52,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:52,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 17: [2023-05-10 10:10:52,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:52,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:52,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:52,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:52,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 15: [2023-05-10 10:10:52,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 16: [2023-05-10 10:10:52,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 16: [2023-05-10 10:10:52,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 16: [2023-05-10 10:10:52,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 16: [2023-05-10 10:10:52,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,351] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,351] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,351] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 19: [2023-05-10 10:10:52,353] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:52,353] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 15: [2023-05-10 10:10:52,353] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 17: [2023-05-10 10:10:52,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 17: [2023-05-10 10:10:52,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 17: [2023-05-10 10:10:52,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 17: [2023-05-10 10:10:52,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 2: [2023-05-10 10:10:52,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 2: [2023-05-10 10:10:52,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 15: [2023-05-10 10:10:52,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:52,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 13: [2023-05-10 10:10:52,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 13: [2023-05-10 10:10:52,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 15: [2023-05-10 10:10:52,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 21: [2023-05-10 10:10:52,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 13: [2023-05-10 10:10:52,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 15: [2023-05-10 10:10:52,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,359] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:52,359] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,358] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 30: [2023-05-10 10:10:52,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 11: [2023-05-10 10:10:52,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 2: [2023-05-10 10:10:52,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:52,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:52,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 17: [2023-05-10 10:10:52,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 13: [2023-05-10 10:10:52,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:52,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:52,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 13: [2023-05-10 10:10:52,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 24: [2023-05-10 10:10:52,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 13: [2023-05-10 10:10:52,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 13: [2023-05-10 10:10:52,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 30: [2023-05-10 10:10:52,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 13: [2023-05-10 10:10:52,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 2: [2023-05-10 10:10:52,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 2: [2023-05-10 10:10:52,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 2: [2023-05-10 10:10:52,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 2: [2023-05-10 10:10:52,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,384] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,384] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,384] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 6: [2023-05-10 10:10:52,384] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 30: [2023-05-10 10:10:52,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 30: [2023-05-10 10:10:52,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 20: [2023-05-10 10:10:52,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,391] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,391] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 11: [2023-05-10 10:10:52,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 11: [2023-05-10 10:10:52,395] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 11: [2023-05-10 10:10:52,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 11: [2023-05-10 10:10:52,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_13-model_00-model_states.pt. 3: [2023-05-10 10:10:52,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 3: [2023-05-10 10:10:52,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 24: [2023-05-10 10:10:52,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 24: [2023-05-10 10:10:52,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 24: [2023-05-10 10:10:52,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 24: [2023-05-10 10:10:52,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:52,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 2: [2023-05-10 10:10:52,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,427] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,433] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:52,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:52,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:52,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:52,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 20: [2023-05-10 10:10:52,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 1: [2023-05-10 10:10:52,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 1: [2023-05-10 10:10:52,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 1: [2023-05-10 10:10:52,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 1: [2023-05-10 10:10:52,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 1: [2023-05-10 10:10:52,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 1: [2023-05-10 10:10:52,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 1: [2023-05-10 10:10:52,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 1: [2023-05-10 10:10:52,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,463] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,472] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 20: [2023-05-10 10:10:52,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 20: [2023-05-10 10:10:52,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:52,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 20: [2023-05-10 10:10:52,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 1: [2023-05-10 10:10:52,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,510] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:52,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 4: [2023-05-10 10:10:52,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 4: [2023-05-10 10:10:52,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 4: [2023-05-10 10:10:52,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 4: [2023-05-10 10:10:52,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 4: [2023-05-10 10:10:52,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 4: [2023-05-10 10:10:52,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 4: [2023-05-10 10:10:52,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:52,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:52,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:52,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:52,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:52,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:52,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:52,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 1: [2023-05-10 10:10:52,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 1: [2023-05-10 10:10:52,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 1: [2023-05-10 10:10:52,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 1: [2023-05-10 10:10:52,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 25: [2023-05-10 10:10:52,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 25: [2023-05-10 10:10:52,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 25: [2023-05-10 10:10:52,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 25: [2023-05-10 10:10:52,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 25: [2023-05-10 10:10:52,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 25: [2023-05-10 10:10:52,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,548] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,548] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,548] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,548] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,548] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,548] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,548] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,548] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 12: [2023-05-10 10:10:52,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 12: [2023-05-10 10:10:52,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 12: [2023-05-10 10:10:52,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 12: [2023-05-10 10:10:52,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 12: [2023-05-10 10:10:52,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 12: [2023-05-10 10:10:52,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:52,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 14: [2023-05-10 10:10:52,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 14: [2023-05-10 10:10:52,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 14: [2023-05-10 10:10:52,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 14: [2023-05-10 10:10:52,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 14: [2023-05-10 10:10:52,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 8: [2023-05-10 10:10:52,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 14: [2023-05-10 10:10:52,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 25: [2023-05-10 10:10:52,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:52,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:52,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:52,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 26: [2023-05-10 10:10:52,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 4: [2023-05-10 10:10:52,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:52,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:52,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:52,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 26: [2023-05-10 10:10:52,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 26: [2023-05-10 10:10:52,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 25: [2023-05-10 10:10:52,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 25: [2023-05-10 10:10:52,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 26: [2023-05-10 10:10:52,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:52,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:52,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:52,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:52,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:52,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:52,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:52,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:52,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 25: [2023-05-10 10:10:52,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 29: [2023-05-10 10:10:52,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 29: [2023-05-10 10:10:52,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:52,628] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:52,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 29: [2023-05-10 10:10:52,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 29: [2023-05-10 10:10:52,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 29: [2023-05-10 10:10:52,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:52,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:52,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:52,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 31: [2023-05-10 10:10:52,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:52,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:52,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:52,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:52,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:52,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:52,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:52,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:52,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 21: [2023-05-10 10:10:52,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 21: [2023-05-10 10:10:52,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 21: [2023-05-10 10:10:52,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 21: [2023-05-10 10:10:52,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 21: [2023-05-10 10:10:52,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 21: [2023-05-10 10:10:52,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 21: [2023-05-10 10:10:52,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 21: [2023-05-10 10:10:52,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:52,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 31: [2023-05-10 10:10:52,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 31: [2023-05-10 10:10:52,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 31: [2023-05-10 10:10:52,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 31: [2023-05-10 10:10:52,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 31: [2023-05-10 10:10:52,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:52,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 23: [2023-05-10 10:10:52,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 23: [2023-05-10 10:10:52,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 23: [2023-05-10 10:10:52,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 23: [2023-05-10 10:10:52,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:52,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 12: [2023-05-10 10:10:52,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 23: [2023-05-10 10:10:52,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:52,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:52,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,643] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,643] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 23: [2023-05-10 10:10:52,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:52,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:52,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 21: [2023-05-10 10:10:52,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 21: [2023-05-10 10:10:52,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 21: [2023-05-10 10:10:52,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 21: [2023-05-10 10:10:52,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 21: [2023-05-10 10:10:52,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 21: [2023-05-10 10:10:52,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 27: [2023-05-10 10:10:52,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 14: [2023-05-10 10:10:52,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:52,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 19: [2023-05-10 10:10:52,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:52,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 19: [2023-05-10 10:10:52,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 19: [2023-05-10 10:10:52,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 19: [2023-05-10 10:10:52,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 19: [2023-05-10 10:10:52,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 19: [2023-05-10 10:10:52,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 27: [2023-05-10 10:10:52,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 27: [2023-05-10 10:10:52,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 27: [2023-05-10 10:10:52,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 27: [2023-05-10 10:10:52,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:52,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 27: [2023-05-10 10:10:52,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:52,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:52,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:52,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:52,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:52,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:52,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:52,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 14: [2023-05-10 10:10:52,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 29: [2023-05-10 10:10:52,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:52,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 12: [2023-05-10 10:10:52,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:52,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 5: [2023-05-10 10:10:52,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:52,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 7: [2023-05-10 10:10:52,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:52,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 7: [2023-05-10 10:10:52,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 7: [2023-05-10 10:10:52,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 5: [2023-05-10 10:10:52,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:52,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 7: [2023-05-10 10:10:52,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 5: [2023-05-10 10:10:52,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:52,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:52,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 7: [2023-05-10 10:10:52,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:52,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 7: [2023-05-10 10:10:52,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 26: [2023-05-10 10:10:52,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 7: [2023-05-10 10:10:52,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 7: [2023-05-10 10:10:52,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 6: [2023-05-10 10:10:52,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 15: [2023-05-10 10:10:52,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 15: [2023-05-10 10:10:52,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 15: [2023-05-10 10:10:52,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:52,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 15: [2023-05-10 10:10:52,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 15: [2023-05-10 10:10:52,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 15: [2023-05-10 10:10:52,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 15: [2023-05-10 10:10:52,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 10: [2023-05-10 10:10:52,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 10: [2023-05-10 10:10:52,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 10: [2023-05-10 10:10:52,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 7: [2023-05-10 10:10:52,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 7: [2023-05-10 10:10:52,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 7: [2023-05-10 10:10:52,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 7: [2023-05-10 10:10:52,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 23: [2023-05-10 10:10:52,692] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 14: [2023-05-10 10:10:52,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:52,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 10: [2023-05-10 10:10:52,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 10: [2023-05-10 10:10:52,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 10: [2023-05-10 10:10:52,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:52,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:52,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 6: [2023-05-10 10:10:52,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 6: [2023-05-10 10:10:52,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 6: [2023-05-10 10:10:52,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 15: [2023-05-10 10:10:52,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:52,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:52,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:52,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:52,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 21: [2023-05-10 10:10:52,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 21: [2023-05-10 10:10:52,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:52,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:52,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:52,699] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:52,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 15: [2023-05-10 10:10:52,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:52,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 15: [2023-05-10 10:10:52,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 15: [2023-05-10 10:10:52,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:52,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 15: [2023-05-10 10:10:52,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:52,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:52,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 26: [2023-05-10 10:10:52,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:52,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 9: [2023-05-10 10:10:52,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 6: [2023-05-10 10:10:52,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 29: [2023-05-10 10:10:52,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:52,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:52,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:52,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:52,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:52,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 31: [2023-05-10 10:10:52,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:52,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 31: [2023-05-10 10:10:52,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:52,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 9: [2023-05-10 10:10:52,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 9: [2023-05-10 10:10:52,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 9: [2023-05-10 10:10:52,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 9: [2023-05-10 10:10:52,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 9: [2023-05-10 10:10:52,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 21: [2023-05-10 10:10:52,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:52,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,710] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 9: [2023-05-10 10:10:52,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 28: [2023-05-10 10:10:52,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 28: [2023-05-10 10:10:52,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 9: [2023-05-10 10:10:52,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 28: [2023-05-10 10:10:52,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 28: [2023-05-10 10:10:52,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 28: [2023-05-10 10:10:52,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 28: [2023-05-10 10:10:52,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 28: [2023-05-10 10:10:52,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:52,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,711] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,712] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 21: [2023-05-10 10:10:52,713] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 21: [2023-05-10 10:10:52,713] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:52,713] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,713] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 9: [2023-05-10 10:10:52,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,714] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 28: [2023-05-10 10:10:52,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 28: [2023-05-10 10:10:52,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:52,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:52,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:52,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 19: [2023-05-10 10:10:52,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,718] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 23: [2023-05-10 10:10:52,718] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:52,718] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:52,718] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 26: [2023-05-10 10:10:52,719] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 29: [2023-05-10 10:10:52,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 28: [2023-05-10 10:10:52,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 23: [2023-05-10 10:10:52,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:52,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:52,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 19: [2023-05-10 10:10:52,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 19: [2023-05-10 10:10:52,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 28: [2023-05-10 10:10:52,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 28: [2023-05-10 10:10:52,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 28: [2023-05-10 10:10:52,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:52,723] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 28: [2023-05-10 10:10:52,723] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 28: [2023-05-10 10:10:52,723] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 21: [2023-05-10 10:10:52,723] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:52,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:52,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:52,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:52,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 20: [2023-05-10 10:10:52,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:52,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:52,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:52,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:52,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:52,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:52,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:52,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:52,732] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 23: [2023-05-10 10:10:52,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:52,735] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,735] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,735] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,735] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,735] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:52,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 29: [2023-05-10 10:10:52,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:52,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 19: [2023-05-10 10:10:52,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 19: [2023-05-10 10:10:52,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 29: [2023-05-10 10:10:52,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:52,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 29: [2023-05-10 10:10:52,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:52,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:52,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:52,739] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:52,741] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 5: [2023-05-10 10:10:52,741] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:52,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 21: [2023-05-10 10:10:52,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 20: [2023-05-10 10:10:52,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 20: [2023-05-10 10:10:52,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 20: [2023-05-10 10:10:52,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:52,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 20: [2023-05-10 10:10:52,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:52,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 23: [2023-05-10 10:10:52,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 15: [2023-05-10 10:10:52,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:52,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:52,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:52,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 23: [2023-05-10 10:10:52,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 20: [2023-05-10 10:10:52,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 21: [2023-05-10 10:10:52,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:52,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 21: [2023-05-10 10:10:52,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:52,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 10: [2023-05-10 10:10:52,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 23: [2023-05-10 10:10:52,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:52,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 6: [2023-05-10 10:10:52,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 6: [2023-05-10 10:10:52,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 7: [2023-05-10 10:10:52,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 7: [2023-05-10 10:10:52,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 7: [2023-05-10 10:10:52,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:52,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:52,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:52,749] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:52,749] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 7: [2023-05-10 10:10:52,752] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 7: [2023-05-10 10:10:52,752] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 7: [2023-05-10 10:10:52,752] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 7: [2023-05-10 10:10:52,752] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,752] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,752] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 18: [2023-05-10 10:10:52,752] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,752] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 18: [2023-05-10 10:10:52,752] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 10: [2023-05-10 10:10:52,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 18: [2023-05-10 10:10:52,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 18: [2023-05-10 10:10:52,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 17: [2023-05-10 10:10:52,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:52,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:52,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 0: [2023-05-10 10:10:52,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,758] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 5: [2023-05-10 10:10:52,758] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:52,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 10: [2023-05-10 10:10:52,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 0: [2023-05-10 10:10:52,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 5: [2023-05-10 10:10:52,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 5: [2023-05-10 10:10:52,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:52,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 7: [2023-05-10 10:10:52,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:52,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:52,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:52,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:52,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:52,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 5: [2023-05-10 10:10:52,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 28: [2023-05-10 10:10:52,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 28: [2023-05-10 10:10:52,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 6: [2023-05-10 10:10:52,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 9: [2023-05-10 10:10:52,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 9: [2023-05-10 10:10:52,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 19: [2023-05-10 10:10:52,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:52,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:52,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:52,769] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 6: [2023-05-10 10:10:52,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:52,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:52,771] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 9: [2023-05-10 10:10:52,771] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:52,771] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 6: [2023-05-10 10:10:52,771] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 10: [2023-05-10 10:10:52,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 27: [2023-05-10 10:10:52,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:52,772] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 27: [2023-05-10 10:10:52,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:52,773] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,773] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,773] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,773] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,773] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,773] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,773] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:52,774] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 15: [2023-05-10 10:10:52,774] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,774] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:52,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 7: [2023-05-10 10:10:52,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 7: [2023-05-10 10:10:52,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 7: [2023-05-10 10:10:52,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:52,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:52,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 15: [2023-05-10 10:10:52,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 15: [2023-05-10 10:10:52,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 15: [2023-05-10 10:10:52,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 15: [2023-05-10 10:10:52,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 15: [2023-05-10 10:10:52,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 9: [2023-05-10 10:10:52,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 9: [2023-05-10 10:10:52,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 7: [2023-05-10 10:10:52,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 7: [2023-05-10 10:10:52,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:52,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 22: [2023-05-10 10:10:52,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 22: [2023-05-10 10:10:52,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 22: [2023-05-10 10:10:52,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 6: [2023-05-10 10:10:52,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 6: [2023-05-10 10:10:52,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 22: [2023-05-10 10:10:52,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 3: [2023-05-10 10:10:52,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 7: [2023-05-10 10:10:52,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 7: [2023-05-10 10:10:52,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 11: [2023-05-10 10:10:52,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 11: [2023-05-10 10:10:52,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 28: [2023-05-10 10:10:52,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:52,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:52,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:52,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 11: [2023-05-10 10:10:52,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 11: [2023-05-10 10:10:52,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 30: [2023-05-10 10:10:52,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 11: [2023-05-10 10:10:52,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 11: [2023-05-10 10:10:52,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 11: [2023-05-10 10:10:52,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,788] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:52,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,788] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 30: [2023-05-10 10:10:52,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,788] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 30: [2023-05-10 10:10:52,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 30: [2023-05-10 10:10:52,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 30: [2023-05-10 10:10:52,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 30: [2023-05-10 10:10:52,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 3: [2023-05-10 10:10:52,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 3: [2023-05-10 10:10:52,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 30: [2023-05-10 10:10:52,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 3: [2023-05-10 10:10:52,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 3: [2023-05-10 10:10:52,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 3: [2023-05-10 10:10:52,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 3: [2023-05-10 10:10:52,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 10: [2023-05-10 10:10:52,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:52,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:52,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 11: [2023-05-10 10:10:52,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 28: [2023-05-10 10:10:52,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 28: [2023-05-10 10:10:52,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 11: [2023-05-10 10:10:52,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 28: [2023-05-10 10:10:52,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 3: [2023-05-10 10:10:52,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 5: [2023-05-10 10:10:52,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:52,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 28: [2023-05-10 10:10:52,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 5: [2023-05-10 10:10:52,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 28: [2023-05-10 10:10:52,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 28: [2023-05-10 10:10:52,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 9: [2023-05-10 10:10:52,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:52,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 3: [2023-05-10 10:10:52,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 16: [2023-05-10 10:10:52,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:52,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:52,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 11: [2023-05-10 10:10:52,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 16: [2023-05-10 10:10:52,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 16: [2023-05-10 10:10:52,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 11: [2023-05-10 10:10:52,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 16: [2023-05-10 10:10:52,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 16: [2023-05-10 10:10:52,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 9: [2023-05-10 10:10:52,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 16: [2023-05-10 10:10:52,800] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 6: [2023-05-10 10:10:52,800] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 30: [2023-05-10 10:10:52,800] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,800] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 16: [2023-05-10 10:10:52,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 1: [2023-05-10 10:10:52,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 1: [2023-05-10 10:10:52,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 1: [2023-05-10 10:10:52,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 1: [2023-05-10 10:10:52,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 1: [2023-05-10 10:10:52,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 1: [2023-05-10 10:10:52,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 1: [2023-05-10 10:10:52,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 9: [2023-05-10 10:10:52,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 30: [2023-05-10 10:10:52,802] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 4: [2023-05-10 10:10:52,804] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:52,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:52,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:52,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:52,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:52,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:52,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:52,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:52,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 18: [2023-05-10 10:10:52,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 1: [2023-05-10 10:10:52,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:52,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:52,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 0: [2023-05-10 10:10:52,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 4: [2023-05-10 10:10:52,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:52,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 18: [2023-05-10 10:10:52,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 1: [2023-05-10 10:10:52,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:52,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:52,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:52,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:52,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:52,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:52,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:52,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:52,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 20: [2023-05-10 10:10:52,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:52,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:52,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 18: [2023-05-10 10:10:52,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 6: [2023-05-10 10:10:52,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 25: [2023-05-10 10:10:52,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 25: [2023-05-10 10:10:52,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 25: [2023-05-10 10:10:52,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 25: [2023-05-10 10:10:52,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 25: [2023-05-10 10:10:52,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 25: [2023-05-10 10:10:52,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:52,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:52,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:52,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:52,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:52,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:52,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 0: [2023-05-10 10:10:52,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,821] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 20: [2023-05-10 10:10:52,824] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:52,824] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:52,824] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:52,824] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 25: [2023-05-10 10:10:52,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:52,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:52,826] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 18: [2023-05-10 10:10:52,826] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:52,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:52,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 20: [2023-05-10 10:10:52,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 28: [2023-05-10 10:10:52,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:52,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 0: [2023-05-10 10:10:52,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 0: [2023-05-10 10:10:52,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 18: [2023-05-10 10:10:52,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 18: [2023-05-10 10:10:52,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 20: [2023-05-10 10:10:52,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:52,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 8: [2023-05-10 10:10:52,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 8: [2023-05-10 10:10:52,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 8: [2023-05-10 10:10:52,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 8: [2023-05-10 10:10:52,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 8: [2023-05-10 10:10:52,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 8: [2023-05-10 10:10:52,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 8: [2023-05-10 10:10:52,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 8: [2023-05-10 10:10:52,843] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:52,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 0: [2023-05-10 10:10:52,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 11: [2023-05-10 10:10:52,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 11: [2023-05-10 10:10:52,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 18: [2023-05-10 10:10:52,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:52,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 3: [2023-05-10 10:10:52,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 2: [2023-05-10 10:10:52,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 2: [2023-05-10 10:10:52,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 18: [2023-05-10 10:10:52,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 18: [2023-05-10 10:10:52,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 3: [2023-05-10 10:10:52,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 3: [2023-05-10 10:10:52,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 16: [2023-05-10 10:10:52,852] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 2: [2023-05-10 10:10:52,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 2: [2023-05-10 10:10:52,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 2: [2023-05-10 10:10:52,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 2: [2023-05-10 10:10:52,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 2: [2023-05-10 10:10:52,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 2: [2023-05-10 10:10:52,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 11: [2023-05-10 10:10:52,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:52,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 20: [2023-05-10 10:10:52,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:52,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 22: [2023-05-10 10:10:52,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 24: [2023-05-10 10:10:52,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:52,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 13: [2023-05-10 10:10:52,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 13: [2023-05-10 10:10:52,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 13: [2023-05-10 10:10:52,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 13: [2023-05-10 10:10:52,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 13: [2023-05-10 10:10:52,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 13: [2023-05-10 10:10:52,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 13: [2023-05-10 10:10:52,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 11: [2023-05-10 10:10:52,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 20: [2023-05-10 10:10:52,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:52,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 24: [2023-05-10 10:10:52,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,862] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 1: [2023-05-10 10:10:52,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:52,862] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 24: [2023-05-10 10:10:52,862] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 24: [2023-05-10 10:10:52,862] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 20: [2023-05-10 10:10:52,862] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:52,862] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 24: [2023-05-10 10:10:52,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 3: [2023-05-10 10:10:52,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 3: [2023-05-10 10:10:52,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 3: [2023-05-10 10:10:52,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 3: [2023-05-10 10:10:52,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 3: [2023-05-10 10:10:52,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 0: [2023-05-10 10:10:52,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 0: [2023-05-10 10:10:52,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 2: [2023-05-10 10:10:52,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 2: [2023-05-10 10:10:52,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 1: [2023-05-10 10:10:52,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:52,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:52,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:52,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:52,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:52,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 11: [2023-05-10 10:10:52,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 3: [2023-05-10 10:10:52,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 13: [2023-05-10 10:10:52,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 12: [2023-05-10 10:10:52,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:52,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 13: [2023-05-10 10:10:52,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 12: [2023-05-10 10:10:52,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:52,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:52,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 12: [2023-05-10 10:10:52,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 12: [2023-05-10 10:10:52,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 12: [2023-05-10 10:10:52,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:52,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:52,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:52,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:52,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 3: [2023-05-10 10:10:52,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 3: [2023-05-10 10:10:52,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:52,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 13: [2023-05-10 10:10:52,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 30: [2023-05-10 10:10:52,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 30: [2023-05-10 10:10:52,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 13: [2023-05-10 10:10:52,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 13: [2023-05-10 10:10:52,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 13: [2023-05-10 10:10:52,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 16: [2023-05-10 10:10:52,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 13: [2023-05-10 10:10:52,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt... 12: [2023-05-10 10:10:52,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 14: [2023-05-10 10:10:52,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 1: [2023-05-10 10:10:52,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 1: [2023-05-10 10:10:52,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 1: [2023-05-10 10:10:52,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 1: [2023-05-10 10:10:52,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 1: [2023-05-10 10:10:52,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 14: [2023-05-10 10:10:52,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 14: [2023-05-10 10:10:52,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 14: [2023-05-10 10:10:52,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 14: [2023-05-10 10:10:52,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 14: [2023-05-10 10:10:52,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 14: [2023-05-10 10:10:52,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:52,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 30: [2023-05-10 10:10:52,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 30: [2023-05-10 10:10:52,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 11: [2023-05-10 10:10:52,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:52,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 26: [2023-05-10 10:10:52,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 26: [2023-05-10 10:10:52,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 26: [2023-05-10 10:10:52,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 26: [2023-05-10 10:10:52,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 26: [2023-05-10 10:10:52,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 26: [2023-05-10 10:10:52,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:52,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 14: [2023-05-10 10:10:52,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:52,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 25: [2023-05-10 10:10:52,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 25: [2023-05-10 10:10:52,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:52,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 12: [2023-05-10 10:10:52,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 12: [2023-05-10 10:10:52,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 25: [2023-05-10 10:10:52,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 25: [2023-05-10 10:10:52,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 25: [2023-05-10 10:10:52,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 12: [2023-05-10 10:10:52,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:52,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:52,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,889] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:52,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 11: [2023-05-10 10:10:52,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 11: [2023-05-10 10:10:52,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 22: [2023-05-10 10:10:52,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:52,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 3: [2023-05-10 10:10:52,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 30: [2023-05-10 10:10:52,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 3: [2023-05-10 10:10:52,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 3: [2023-05-10 10:10:52,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 3: [2023-05-10 10:10:52,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 3: [2023-05-10 10:10:52,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:52,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 30: [2023-05-10 10:10:52,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 2: [2023-05-10 10:10:52,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 2: [2023-05-10 10:10:52,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 16: [2023-05-10 10:10:52,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 8: [2023-05-10 10:10:52,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:52,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 1: [2023-05-10 10:10:52,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:52,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 16: [2023-05-10 10:10:52,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 24: [2023-05-10 10:10:52,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 25: [2023-05-10 10:10:52,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:52,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 16: [2023-05-10 10:10:52,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 11: [2023-05-10 10:10:52,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 16: [2023-05-10 10:10:52,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:52,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 1: [2023-05-10 10:10:52,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:52,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 1: [2023-05-10 10:10:52,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:52,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:52,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:52,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:52,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:52,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:52,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:52,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:52,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 25: [2023-05-10 10:10:52,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 8: [2023-05-10 10:10:52,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 8: [2023-05-10 10:10:52,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 8: [2023-05-10 10:10:52,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 8: [2023-05-10 10:10:52,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 25: [2023-05-10 10:10:52,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 25: [2023-05-10 10:10:52,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:52,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:52,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 11: [2023-05-10 10:10:52,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 25: [2023-05-10 10:10:52,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 25: [2023-05-10 10:10:52,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 25: [2023-05-10 10:10:52,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:52,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 2: [2023-05-10 10:10:52,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 2: [2023-05-10 10:10:52,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 30: [2023-05-10 10:10:52,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:52,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:52,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:52,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 2: [2023-05-10 10:10:52,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 2: [2023-05-10 10:10:52,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:52,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:52,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:52,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:52,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:52,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:52,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:52,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:52,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 24: [2023-05-10 10:10:52,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 8: [2023-05-10 10:10:52,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:52,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 2: [2023-05-10 10:10:52,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 2: [2023-05-10 10:10:52,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 2: [2023-05-10 10:10:52,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 17: [2023-05-10 10:10:52,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 8: [2023-05-10 10:10:52,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:52,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 14: [2023-05-10 10:10:52,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 14: [2023-05-10 10:10:52,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 26: [2023-05-10 10:10:52,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 26: [2023-05-10 10:10:52,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:52,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 8: [2023-05-10 10:10:52,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 17: [2023-05-10 10:10:52,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:52,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 17: [2023-05-10 10:10:52,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:52,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:52,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 17: [2023-05-10 10:10:52,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 8: [2023-05-10 10:10:52,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:52,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:52,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 13: [2023-05-10 10:10:52,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 13: [2023-05-10 10:10:52,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:52,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:52,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:52,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:52,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:52,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:52,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 14: [2023-05-10 10:10:52,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 14: [2023-05-10 10:10:52,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:52,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 14: [2023-05-10 10:10:52,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:52,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 14: [2023-05-10 10:10:52,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:52,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 12: [2023-05-10 10:10:52,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:52,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:52,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:52,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 13: [2023-05-10 10:10:52,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 26: [2023-05-10 10:10:52,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:52,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 13: [2023-05-10 10:10:52,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 31: [2023-05-10 10:10:52,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 2: [2023-05-10 10:10:52,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:52,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:52,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:52,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:52,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 2: [2023-05-10 10:10:52,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:52,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 26: [2023-05-10 10:10:52,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 26: [2023-05-10 10:10:52,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 14: [2023-05-10 10:10:52,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:52,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 24: [2023-05-10 10:10:52,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 2: [2023-05-10 10:10:52,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:52,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 2: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 2: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 12: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 12: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 12: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 12: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:52,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 26: [2023-05-10 10:10:52,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 26: [2023-05-10 10:10:52,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:52,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 10: [2023-05-10 10:10:52,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:52,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:52,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:52,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:52,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:52,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:52,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:52,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:52,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 27: [2023-05-10 10:10:52,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 13: [2023-05-10 10:10:52,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 13: [2023-05-10 10:10:52,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_14-model_00-model_states.pt. 27: [2023-05-10 10:10:52,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 27: [2023-05-10 10:10:52,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 27: [2023-05-10 10:10:52,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 27: [2023-05-10 10:10:52,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 27: [2023-05-10 10:10:52,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 27: [2023-05-10 10:10:52,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 10: [2023-05-10 10:10:52,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 10: [2023-05-10 10:10:52,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 10: [2023-05-10 10:10:52,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:52,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:52,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 10: [2023-05-10 10:10:52,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:52,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 10: [2023-05-10 10:10:52,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 12: [2023-05-10 10:10:52,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:52,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:52,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 14: [2023-05-10 10:10:52,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:52,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 13: [2023-05-10 10:10:52,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:52,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:52,987] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:52,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:52,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 13: [2023-05-10 10:10:52,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 26: [2023-05-10 10:10:52,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:53,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:53,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:53,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 19: [2023-05-10 10:10:53,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 19: [2023-05-10 10:10:53,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 19: [2023-05-10 10:10:53,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 19: [2023-05-10 10:10:53,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 19: [2023-05-10 10:10:53,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 19: [2023-05-10 10:10:53,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 19: [2023-05-10 10:10:53,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:53,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 13: [2023-05-10 10:10:53,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:53,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:53,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:53,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:53,009] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 19: [2023-05-10 10:10:53,010] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:53,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 19: [2023-05-10 10:10:53,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:53,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:53,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:53,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:53,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:53,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:53,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:53,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:53,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:53,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:53,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:53,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:53,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:53,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:53,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:53,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:53,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:53,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:53,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:53,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:53,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:53,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:53,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:53,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:53,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:53,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:53,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 23: [2023-05-10 10:10:53,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 15: [2023-05-10 10:10:53,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:53,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 15: [2023-05-10 10:10:53,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:53,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 15: [2023-05-10 10:10:53,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 7: [2023-05-10 10:10:53,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:53,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:53,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:53,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:53,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:53,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:53,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:53,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 28: [2023-05-10 10:10:53,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 28: [2023-05-10 10:10:53,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:53,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 28: [2023-05-10 10:10:53,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 28: [2023-05-10 10:10:53,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 9: [2023-05-10 10:10:53,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 9: [2023-05-10 10:10:53,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:53,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 28: [2023-05-10 10:10:53,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 28: [2023-05-10 10:10:53,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 9: [2023-05-10 10:10:53,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 9: [2023-05-10 10:10:53,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 28: [2023-05-10 10:10:53,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:53,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:53,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 9: [2023-05-10 10:10:53,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 9: [2023-05-10 10:10:53,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:53,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:53,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 7: [2023-05-10 10:10:53,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 10: [2023-05-10 10:10:53,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:53,035] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:53,035] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:53,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:53,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:53,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:53,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:53,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:53,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:53,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:53,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:53,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:53,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:53,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:53,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:53,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:53,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 10: [2023-05-10 10:10:53,042] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 9: [2023-05-10 10:10:53,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:53,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:53,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:53,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:53,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:53,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:53,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:53,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:53,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:53,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 31: [2023-05-10 10:10:53,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:53,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:53,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 28: [2023-05-10 10:10:53,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:53,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:53,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:53,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:53,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:53,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 28: [2023-05-10 10:10:53,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 17: [2023-05-10 10:10:53,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:53,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:53,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:53,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:53,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:53,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:53,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:53,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:53,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:53,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:53,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:53,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 31: [2023-05-10 10:10:53,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 31: [2023-05-10 10:10:53,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:53,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 17: [2023-05-10 10:10:53,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:53,056] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 17: [2023-05-10 10:10:53,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 31: [2023-05-10 10:10:53,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:53,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 17: [2023-05-10 10:10:53,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 17: [2023-05-10 10:10:53,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 17: [2023-05-10 10:10:53,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:53,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:53,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:53,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:53,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 19: [2023-05-10 10:10:53,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:53,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 31: [2023-05-10 10:10:53,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 31: [2023-05-10 10:10:53,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 31: [2023-05-10 10:10:53,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,080] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:53,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:53,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,081] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,081] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,081] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:53,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:53,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 19: [2023-05-10 10:10:53,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:53,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:53,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:53,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:53,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 10: [2023-05-10 10:10:53,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:53,084] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:53,084] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:53,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 19: [2023-05-10 10:10:53,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 19: [2023-05-10 10:10:53,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 19: [2023-05-10 10:10:53,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:53,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:53,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:53,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:53,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:53,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 9: [2023-05-10 10:10:53,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:53,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:53,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:53,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 0: [2023-05-10 10:10:53,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 27: [2023-05-10 10:10:53,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:53,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 28: [2023-05-10 10:10:53,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 27: [2023-05-10 10:10:53,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:53,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 0: [2023-05-10 10:10:53,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 0: [2023-05-10 10:10:53,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 0: [2023-05-10 10:10:53,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 0: [2023-05-10 10:10:53,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:53,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 18: [2023-05-10 10:10:53,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:53,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 19: [2023-05-10 10:10:53,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 18: [2023-05-10 10:10:53,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 18: [2023-05-10 10:10:53,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 18: [2023-05-10 10:10:53,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 23: [2023-05-10 10:10:53,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:53,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 9: [2023-05-10 10:10:53,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 9: [2023-05-10 10:10:53,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 7: [2023-05-10 10:10:53,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 7: [2023-05-10 10:10:53,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:53,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:53,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 9: [2023-05-10 10:10:53,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:53,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:53,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:53,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:53,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 16: [2023-05-10 10:10:53,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 16: [2023-05-10 10:10:53,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 9: [2023-05-10 10:10:53,103] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 9: [2023-05-10 10:10:53,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 9: [2023-05-10 10:10:53,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 23: [2023-05-10 10:10:53,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 15: [2023-05-10 10:10:53,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 6: [2023-05-10 10:10:53,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:53,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 9: [2023-05-10 10:10:53,107] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:53,107] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 16: [2023-05-10 10:10:53,108] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 16: [2023-05-10 10:10:53,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 19: [2023-05-10 10:10:53,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 28: [2023-05-10 10:10:53,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 28: [2023-05-10 10:10:53,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:53,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:53,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 16: [2023-05-10 10:10:53,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:53,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:53,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:53,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 7: [2023-05-10 10:10:53,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:53,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 16: [2023-05-10 10:10:53,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 16: [2023-05-10 10:10:53,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:53,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 28: [2023-05-10 10:10:53,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 28: [2023-05-10 10:10:53,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 5: [2023-05-10 10:10:53,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 28: [2023-05-10 10:10:53,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 5: [2023-05-10 10:10:53,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 28: [2023-05-10 10:10:53,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:53,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:53,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 5: [2023-05-10 10:10:53,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 5: [2023-05-10 10:10:53,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 5: [2023-05-10 10:10:53,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 5: [2023-05-10 10:10:53,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 5: [2023-05-10 10:10:53,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 7: [2023-05-10 10:10:53,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:53,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 5: [2023-05-10 10:10:53,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 5: [2023-05-10 10:10:53,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 5: [2023-05-10 10:10:53,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 5: [2023-05-10 10:10:53,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:53,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 11: [2023-05-10 10:10:53,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:53,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:53,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:53,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:53,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 6: [2023-05-10 10:10:53,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 5: [2023-05-10 10:10:53,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:53,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:53,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 15: [2023-05-10 10:10:53,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:53,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:53,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:53,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:53,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 11: [2023-05-10 10:10:53,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:53,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:53,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 30: [2023-05-10 10:10:53,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:53,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:53,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:53,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:53,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:53,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:53,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:53,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 21: [2023-05-10 10:10:53,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 5: [2023-05-10 10:10:53,123] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 5: [2023-05-10 10:10:53,123] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:53,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 23: [2023-05-10 10:10:53,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 23: [2023-05-10 10:10:53,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:53,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 9: [2023-05-10 10:10:53,128] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 9: [2023-05-10 10:10:53,128] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:53,128] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:53,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:53,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:53,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:53,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:53,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:53,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:53,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 9: [2023-05-10 10:10:53,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:53,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 9: [2023-05-10 10:10:53,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:53,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:53,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 9: [2023-05-10 10:10:53,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:53,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 30: [2023-05-10 10:10:53,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 9: [2023-05-10 10:10:53,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:53,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 30: [2023-05-10 10:10:53,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 30: [2023-05-10 10:10:53,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 6: [2023-05-10 10:10:53,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 3: [2023-05-10 10:10:53,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 3: [2023-05-10 10:10:53,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 2: [2023-05-10 10:10:53,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 2: [2023-05-10 10:10:53,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 2: [2023-05-10 10:10:53,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 2: [2023-05-10 10:10:53,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 2: [2023-05-10 10:10:53,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 2: [2023-05-10 10:10:53,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 2: [2023-05-10 10:10:53,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 3: [2023-05-10 10:10:53,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 3: [2023-05-10 10:10:53,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 22: [2023-05-10 10:10:53,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 22: [2023-05-10 10:10:53,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 2: [2023-05-10 10:10:53,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 3: [2023-05-10 10:10:53,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 3: [2023-05-10 10:10:53,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 3: [2023-05-10 10:10:53,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 3: [2023-05-10 10:10:53,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 22: [2023-05-10 10:10:53,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 22: [2023-05-10 10:10:53,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:53,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:53,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 22: [2023-05-10 10:10:53,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 22: [2023-05-10 10:10:53,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 22: [2023-05-10 10:10:53,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:53,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 3: [2023-05-10 10:10:53,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 15: [2023-05-10 10:10:53,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 1: [2023-05-10 10:10:53,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:53,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 2: [2023-05-10 10:10:53,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 2: [2023-05-10 10:10:53,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:53,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,143] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 3: [2023-05-10 10:10:53,143] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:53,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:53,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:53,146] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:53,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:53,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 28: [2023-05-10 10:10:53,146] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 28: [2023-05-10 10:10:53,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 28: [2023-05-10 10:10:53,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 3: [2023-05-10 10:10:53,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 3: [2023-05-10 10:10:53,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 3: [2023-05-10 10:10:53,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:53,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 3: [2023-05-10 10:10:53,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:53,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 2: [2023-05-10 10:10:53,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:53,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 2: [2023-05-10 10:10:53,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:53,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:53,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:53,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:53,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:53,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:53,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:53,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 1: [2023-05-10 10:10:53,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 28: [2023-05-10 10:10:53,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 1: [2023-05-10 10:10:53,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,152] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 15: [2023-05-10 10:10:53,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 15: [2023-05-10 10:10:53,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 15: [2023-05-10 10:10:53,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 18: [2023-05-10 10:10:53,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 29: [2023-05-10 10:10:53,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:53,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:53,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:53,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:53,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 15: [2023-05-10 10:10:53,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:53,162] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:53,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 5: [2023-05-10 10:10:53,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:53,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:53,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 20: [2023-05-10 10:10:53,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 20: [2023-05-10 10:10:53,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:53,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:53,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:53,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:53,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:53,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:53,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:53,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 20: [2023-05-10 10:10:53,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 20: [2023-05-10 10:10:53,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 20: [2023-05-10 10:10:53,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 20: [2023-05-10 10:10:53,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 20: [2023-05-10 10:10:53,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 5: [2023-05-10 10:10:53,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 30: [2023-05-10 10:10:53,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:53,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 24: [2023-05-10 10:10:53,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:53,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 5: [2023-05-10 10:10:53,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 20: [2023-05-10 10:10:53,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:53,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 24: [2023-05-10 10:10:53,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 24: [2023-05-10 10:10:53,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 24: [2023-05-10 10:10:53,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 29: [2023-05-10 10:10:53,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 13: [2023-05-10 10:10:53,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:53,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:53,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:53,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 13: [2023-05-10 10:10:53,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:53,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:53,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:53,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:53,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:53,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:53,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:53,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:53,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:53,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:53,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 20: [2023-05-10 10:10:53,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 20: [2023-05-10 10:10:53,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 20: [2023-05-10 10:10:53,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 20: [2023-05-10 10:10:53,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:53,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:53,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 20: [2023-05-10 10:10:53,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 5: [2023-05-10 10:10:53,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 18: [2023-05-10 10:10:53,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:53,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 18: [2023-05-10 10:10:53,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 20: [2023-05-10 10:10:53,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:53,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:53,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:53,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:53,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 0: [2023-05-10 10:10:53,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 3: [2023-05-10 10:10:53,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 5: [2023-05-10 10:10:53,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 5: [2023-05-10 10:10:53,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 13: [2023-05-10 10:10:53,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 5: [2023-05-10 10:10:53,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 2: [2023-05-10 10:10:53,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 18: [2023-05-10 10:10:53,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 18: [2023-05-10 10:10:53,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 18: [2023-05-10 10:10:53,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:53,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:53,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:53,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:53,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 13: [2023-05-10 10:10:53,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:53,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 13: [2023-05-10 10:10:53,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:53,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:53,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 18: [2023-05-10 10:10:53,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:53,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 3: [2023-05-10 10:10:53,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:53,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 4: [2023-05-10 10:10:53,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 5: [2023-05-10 10:10:53,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 21: [2023-05-10 10:10:53,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt... 22: [2023-05-10 10:10:53,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 22: [2023-05-10 10:10:53,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 5: [2023-05-10 10:10:53,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 3: [2023-05-10 10:10:53,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 1: [2023-05-10 10:10:53,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 5: [2023-05-10 10:10:53,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 5: [2023-05-10 10:10:53,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 3: [2023-05-10 10:10:53,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 1: [2023-05-10 10:10:53,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:53,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 21: [2023-05-10 10:10:53,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:53,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:53,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 3: [2023-05-10 10:10:53,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 22: [2023-05-10 10:10:53,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 22: [2023-05-10 10:10:53,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 5: [2023-05-10 10:10:53,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 5: [2023-05-10 10:10:53,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 2: [2023-05-10 10:10:53,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 3: [2023-05-10 10:10:53,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 3: [2023-05-10 10:10:53,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 3: [2023-05-10 10:10:53,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:53,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 2: [2023-05-10 10:10:53,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 2: [2023-05-10 10:10:53,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 2: [2023-05-10 10:10:53,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 16: [2023-05-10 10:10:53,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:53,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:53,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 5: [2023-05-10 10:10:53,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:53,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 22: [2023-05-10 10:10:53,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 22: [2023-05-10 10:10:53,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 22: [2023-05-10 10:10:53,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 2: [2023-05-10 10:10:53,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 3: [2023-05-10 10:10:53,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 5: [2023-05-10 10:10:53,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 20: [2023-05-10 10:10:53,225] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:53,227] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:53,227] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:53,227] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 11: [2023-05-10 10:10:53,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 3: [2023-05-10 10:10:53,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 3: [2023-05-10 10:10:53,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:53,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:53,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 21: [2023-05-10 10:10:53,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 1: [2023-05-10 10:10:53,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 20: [2023-05-10 10:10:53,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 30: [2023-05-10 10:10:53,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 1: [2023-05-10 10:10:53,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 4: [2023-05-10 10:10:53,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 1: [2023-05-10 10:10:53,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 3: [2023-05-10 10:10:53,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 3: [2023-05-10 10:10:53,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:53,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:53,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:53,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:53,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:53,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:53,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:53,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 24: [2023-05-10 10:10:53,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:53,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:53,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 30: [2023-05-10 10:10:53,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:53,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 2: [2023-05-10 10:10:53,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 30: [2023-05-10 10:10:53,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 20: [2023-05-10 10:10:53,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 2: [2023-05-10 10:10:53,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:53,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:53,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 20: [2023-05-10 10:10:53,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 20: [2023-05-10 10:10:53,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 20: [2023-05-10 10:10:53,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 20: [2023-05-10 10:10:53,269] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:53,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:53,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 4: [2023-05-10 10:10:53,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 4: [2023-05-10 10:10:53,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 4: [2023-05-10 10:10:53,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:53,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:53,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:53,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:53,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:53,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 4: [2023-05-10 10:10:53,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 13: [2023-05-10 10:10:53,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:53,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:53,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:53,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 13: [2023-05-10 10:10:53,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_15-model_00-model_states.pt. 4: [2023-05-10 10:10:53,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 4: [2023-05-10 10:10:53,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 13: [2023-05-10 10:10:53,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 25: [2023-05-10 10:10:53,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 25: [2023-05-10 10:10:53,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 25: [2023-05-10 10:10:53,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 25: [2023-05-10 10:10:53,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 25: [2023-05-10 10:10:53,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 8: [2023-05-10 10:10:53,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 25: [2023-05-10 10:10:53,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 25: [2023-05-10 10:10:53,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 25: [2023-05-10 10:10:53,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 8: [2023-05-10 10:10:53,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 8: [2023-05-10 10:10:53,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 8: [2023-05-10 10:10:53,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 8: [2023-05-10 10:10:53,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 8: [2023-05-10 10:10:53,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 8: [2023-05-10 10:10:53,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 8: [2023-05-10 10:10:53,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:53,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 12: [2023-05-10 10:10:53,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 12: [2023-05-10 10:10:53,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 12: [2023-05-10 10:10:53,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 12: [2023-05-10 10:10:53,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 12: [2023-05-10 10:10:53,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 12: [2023-05-10 10:10:53,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 12: [2023-05-10 10:10:53,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 12: [2023-05-10 10:10:53,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:53,369] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 14: [2023-05-10 10:10:53,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 14: [2023-05-10 10:10:53,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 14: [2023-05-10 10:10:53,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 14: [2023-05-10 10:10:53,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 14: [2023-05-10 10:10:53,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 14: [2023-05-10 10:10:53,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 12: [2023-05-10 10:10:53,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:53,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:53,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:53,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:53,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:53,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:53,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:53,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:53,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:53,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:53,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:53,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:53,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:53,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:53,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:53,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 15: [2023-05-10 10:10:53,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 26: [2023-05-10 10:10:53,395] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 26: [2023-05-10 10:10:53,395] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 26: [2023-05-10 10:10:53,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 26: [2023-05-10 10:10:53,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 26: [2023-05-10 10:10:53,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 26: [2023-05-10 10:10:53,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 26: [2023-05-10 10:10:53,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 26: [2023-05-10 10:10:53,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 15: [2023-05-10 10:10:53,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:53,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 25: [2023-05-10 10:10:53,399] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,399] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,399] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 26: [2023-05-10 10:10:53,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 8: [2023-05-10 10:10:53,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 15: [2023-05-10 10:10:53,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 25: [2023-05-10 10:10:53,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 8: [2023-05-10 10:10:53,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 15: [2023-05-10 10:10:53,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 15: [2023-05-10 10:10:53,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:53,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:53,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:53,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:53,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:53,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:53,407] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:53,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:53,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:53,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:53,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:53,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:53,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:53,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 8: [2023-05-10 10:10:53,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 25: [2023-05-10 10:10:53,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 25: [2023-05-10 10:10:53,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 14: [2023-05-10 10:10:53,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 14: [2023-05-10 10:10:53,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 25: [2023-05-10 10:10:53,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 25: [2023-05-10 10:10:53,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 25: [2023-05-10 10:10:53,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 25: [2023-05-10 10:10:53,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 25: [2023-05-10 10:10:53,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,443] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 26: [2023-05-10 10:10:53,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 9: [2023-05-10 10:10:53,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 9: [2023-05-10 10:10:53,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 14: [2023-05-10 10:10:53,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 6: [2023-05-10 10:10:53,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 6: [2023-05-10 10:10:53,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 12: [2023-05-10 10:10:53,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 9: [2023-05-10 10:10:53,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 9: [2023-05-10 10:10:53,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 14: [2023-05-10 10:10:53,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 9: [2023-05-10 10:10:53,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 9: [2023-05-10 10:10:53,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 9: [2023-05-10 10:10:53,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 14: [2023-05-10 10:10:53,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 14: [2023-05-10 10:10:53,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 14: [2023-05-10 10:10:53,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 9: [2023-05-10 10:10:53,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 6: [2023-05-10 10:10:53,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 6: [2023-05-10 10:10:53,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 8: [2023-05-10 10:10:53,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 6: [2023-05-10 10:10:53,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 6: [2023-05-10 10:10:53,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 6: [2023-05-10 10:10:53,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 6: [2023-05-10 10:10:53,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 6: [2023-05-10 10:10:53,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,450] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 12: [2023-05-10 10:10:53,450] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,450] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 9: [2023-05-10 10:10:53,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 9: [2023-05-10 10:10:53,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 9: [2023-05-10 10:10:53,453] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:53,453] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 27: [2023-05-10 10:10:53,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 9: [2023-05-10 10:10:53,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 9: [2023-05-10 10:10:53,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 9: [2023-05-10 10:10:53,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 9: [2023-05-10 10:10:53,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 9: [2023-05-10 10:10:53,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:53,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 6: [2023-05-10 10:10:53,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:53,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 6: [2023-05-10 10:10:53,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 12: [2023-05-10 10:10:53,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 12: [2023-05-10 10:10:53,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 12: [2023-05-10 10:10:53,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 12: [2023-05-10 10:10:53,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 31: [2023-05-10 10:10:53,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 31: [2023-05-10 10:10:53,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 31: [2023-05-10 10:10:53,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 31: [2023-05-10 10:10:53,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 31: [2023-05-10 10:10:53,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 31: [2023-05-10 10:10:53,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 31: [2023-05-10 10:10:53,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 31: [2023-05-10 10:10:53,465] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 26: [2023-05-10 10:10:53,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 26: [2023-05-10 10:10:53,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 31: [2023-05-10 10:10:53,469] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,469] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 23: [2023-05-10 10:10:53,469] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 7: [2023-05-10 10:10:53,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 7: [2023-05-10 10:10:53,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 7: [2023-05-10 10:10:53,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 7: [2023-05-10 10:10:53,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 7: [2023-05-10 10:10:53,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,472] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 23: [2023-05-10 10:10:53,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 23: [2023-05-10 10:10:53,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 23: [2023-05-10 10:10:53,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 23: [2023-05-10 10:10:53,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 23: [2023-05-10 10:10:53,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 10: [2023-05-10 10:10:53,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 31: [2023-05-10 10:10:53,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 31: [2023-05-10 10:10:53,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 31: [2023-05-10 10:10:53,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 31: [2023-05-10 10:10:53,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 27: [2023-05-10 10:10:53,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 23: [2023-05-10 10:10:53,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 15: [2023-05-10 10:10:53,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 31: [2023-05-10 10:10:53,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 14: [2023-05-10 10:10:53,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:53,477] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,477] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,477] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 15: [2023-05-10 10:10:53,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 7: [2023-05-10 10:10:53,477] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,477] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,477] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 26: [2023-05-10 10:10:53,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 23: [2023-05-10 10:10:53,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 23: [2023-05-10 10:10:53,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:53,480] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:53,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,480] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 12: [2023-05-10 10:10:53,480] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 26: [2023-05-10 10:10:53,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 26: [2023-05-10 10:10:53,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 26: [2023-05-10 10:10:53,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 26: [2023-05-10 10:10:53,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:53,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 12: [2023-05-10 10:10:53,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 17: [2023-05-10 10:10:53,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 26: [2023-05-10 10:10:53,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 10: [2023-05-10 10:10:53,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 19: [2023-05-10 10:10:53,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 26: [2023-05-10 10:10:53,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:53,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 17: [2023-05-10 10:10:53,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:53,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:53,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:53,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:53,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 17: [2023-05-10 10:10:53,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 17: [2023-05-10 10:10:53,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:53,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 17: [2023-05-10 10:10:53,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 16: [2023-05-10 10:10:53,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 12: [2023-05-10 10:10:53,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 17: [2023-05-10 10:10:53,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:53,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 21: [2023-05-10 10:10:53,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:53,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 17: [2023-05-10 10:10:53,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:53,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 12: [2023-05-10 10:10:53,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 21: [2023-05-10 10:10:53,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 21: [2023-05-10 10:10:53,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 17: [2023-05-10 10:10:53,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:53,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 21: [2023-05-10 10:10:53,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 12: [2023-05-10 10:10:53,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 19: [2023-05-10 10:10:53,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:53,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 6: [2023-05-10 10:10:53,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 6: [2023-05-10 10:10:53,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,502] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 19: [2023-05-10 10:10:53,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:53,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 9: [2023-05-10 10:10:53,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:53,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:53,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 10: [2023-05-10 10:10:53,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 3: [2023-05-10 10:10:53,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:53,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 26: [2023-05-10 10:10:53,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 9: [2023-05-10 10:10:53,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 9: [2023-05-10 10:10:53,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 15: [2023-05-10 10:10:53,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 26: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 9: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:53,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 3: [2023-05-10 10:10:53,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 18: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 9: [2023-05-10 10:10:53,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 30: [2023-05-10 10:10:53,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 15: [2023-05-10 10:10:53,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 30: [2023-05-10 10:10:53,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:53,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 9: [2023-05-10 10:10:53,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 30: [2023-05-10 10:10:53,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 30: [2023-05-10 10:10:53,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 9: [2023-05-10 10:10:53,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 9: [2023-05-10 10:10:53,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 30: [2023-05-10 10:10:53,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 15: [2023-05-10 10:10:53,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 15: [2023-05-10 10:10:53,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:53,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 10: [2023-05-10 10:10:53,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 10: [2023-05-10 10:10:53,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 26: [2023-05-10 10:10:53,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 18: [2023-05-10 10:10:53,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 18: [2023-05-10 10:10:53,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 10: [2023-05-10 10:10:53,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 10: [2023-05-10 10:10:53,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 6: [2023-05-10 10:10:53,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 31: [2023-05-10 10:10:53,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 18: [2023-05-10 10:10:53,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 1: [2023-05-10 10:10:53,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 1: [2023-05-10 10:10:53,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,524] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,524] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 6: [2023-05-10 10:10:53,524] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 30: [2023-05-10 10:10:53,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,527] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:53,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:53,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:53,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:53,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 9: [2023-05-10 10:10:53,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:53,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 23: [2023-05-10 10:10:53,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 23: [2023-05-10 10:10:53,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 6: [2023-05-10 10:10:53,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 6: [2023-05-10 10:10:53,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 23: [2023-05-10 10:10:53,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 7: [2023-05-10 10:10:53,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 7: [2023-05-10 10:10:53,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 4: [2023-05-10 10:10:53,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 4: [2023-05-10 10:10:53,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 4: [2023-05-10 10:10:53,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 4: [2023-05-10 10:10:53,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 4: [2023-05-10 10:10:53,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 4: [2023-05-10 10:10:53,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 4: [2023-05-10 10:10:53,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 23: [2023-05-10 10:10:53,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 9: [2023-05-10 10:10:53,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 7: [2023-05-10 10:10:53,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 7: [2023-05-10 10:10:53,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 7: [2023-05-10 10:10:53,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 9: [2023-05-10 10:10:53,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,542] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 31: [2023-05-10 10:10:53,542] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 9: [2023-05-10 10:10:53,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 6: [2023-05-10 10:10:53,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 4: [2023-05-10 10:10:53,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 4: [2023-05-10 10:10:53,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 4: [2023-05-10 10:10:53,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 4: [2023-05-10 10:10:53,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,546] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 31: [2023-05-10 10:10:53,546] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 31: [2023-05-10 10:10:53,546] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 31: [2023-05-10 10:10:53,546] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 31: [2023-05-10 10:10:53,546] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 4: [2023-05-10 10:10:53,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 9: [2023-05-10 10:10:53,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 4: [2023-05-10 10:10:53,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 9: [2023-05-10 10:10:53,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 9: [2023-05-10 10:10:53,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 9: [2023-05-10 10:10:53,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,547] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:53,550] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,550] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,550] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,553] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:53,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 7: [2023-05-10 10:10:53,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 0: [2023-05-10 10:10:53,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:53,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 6: [2023-05-10 10:10:53,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 6: [2023-05-10 10:10:53,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 28: [2023-05-10 10:10:53,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 28: [2023-05-10 10:10:53,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:53,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 2: [2023-05-10 10:10:53,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:53,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 28: [2023-05-10 10:10:53,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 28: [2023-05-10 10:10:53,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 28: [2023-05-10 10:10:53,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 28: [2023-05-10 10:10:53,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 5: [2023-05-10 10:10:53,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:53,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:53,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 5: [2023-05-10 10:10:53,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 23: [2023-05-10 10:10:53,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:53,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 5: [2023-05-10 10:10:53,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:53,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:53,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:53,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 2: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 2: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 2: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 2: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 2: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 21: [2023-05-10 10:10:53,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:53,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,560] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:53,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 29: [2023-05-10 10:10:53,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:53,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 28: [2023-05-10 10:10:53,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 28: [2023-05-10 10:10:53,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:53,561] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:53,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:53,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:53,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:53,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:53,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:53,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 24: [2023-05-10 10:10:53,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:53,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 5: [2023-05-10 10:10:53,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 19: [2023-05-10 10:10:53,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 5: [2023-05-10 10:10:53,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:53,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 5: [2023-05-10 10:10:53,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 29: [2023-05-10 10:10:53,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:53,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 23: [2023-05-10 10:10:53,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:53,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:53,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 28: [2023-05-10 10:10:53,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 5: [2023-05-10 10:10:53,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 28: [2023-05-10 10:10:53,567] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 5: [2023-05-10 10:10:53,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 17: [2023-05-10 10:10:53,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 7: [2023-05-10 10:10:53,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 5: [2023-05-10 10:10:53,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 23: [2023-05-10 10:10:53,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:53,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:53,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 19: [2023-05-10 10:10:53,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 28: [2023-05-10 10:10:53,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 30: [2023-05-10 10:10:53,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 5: [2023-05-10 10:10:53,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 17: [2023-05-10 10:10:53,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 5: [2023-05-10 10:10:53,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 6: [2023-05-10 10:10:53,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 3: [2023-05-10 10:10:53,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 7: [2023-05-10 10:10:53,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 6: [2023-05-10 10:10:53,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:53,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 7: [2023-05-10 10:10:53,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:53,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 11: [2023-05-10 10:10:53,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 21: [2023-05-10 10:10:53,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 21: [2023-05-10 10:10:53,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 21: [2023-05-10 10:10:53,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 21: [2023-05-10 10:10:53,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 21: [2023-05-10 10:10:53,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 19: [2023-05-10 10:10:53,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 19: [2023-05-10 10:10:53,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 17: [2023-05-10 10:10:53,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:53,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 31: [2023-05-10 10:10:53,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 20: [2023-05-10 10:10:53,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 20: [2023-05-10 10:10:53,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,577] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:53,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 20: [2023-05-10 10:10:53,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:53,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 19: [2023-05-10 10:10:53,577] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 20: [2023-05-10 10:10:53,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 20: [2023-05-10 10:10:53,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:53,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 18: [2023-05-10 10:10:53,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 20: [2023-05-10 10:10:53,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 23: [2023-05-10 10:10:53,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:53,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:53,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 29: [2023-05-10 10:10:53,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 1: [2023-05-10 10:10:53,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 23: [2023-05-10 10:10:53,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:53,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 20: [2023-05-10 10:10:53,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 19: [2023-05-10 10:10:53,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 1: [2023-05-10 10:10:53,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:53,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:53,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 18: [2023-05-10 10:10:53,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 17: [2023-05-10 10:10:53,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 17: [2023-05-10 10:10:53,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:53,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:53,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 16: [2023-05-10 10:10:53,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 19: [2023-05-10 10:10:53,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 1: [2023-05-10 10:10:53,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 1: [2023-05-10 10:10:53,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 29: [2023-05-10 10:10:53,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:53,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:53,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 3: [2023-05-10 10:10:53,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 4: [2023-05-10 10:10:53,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 4: [2023-05-10 10:10:53,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 4: [2023-05-10 10:10:53,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 4: [2023-05-10 10:10:53,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:53,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:53,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 21: [2023-05-10 10:10:53,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:53,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:53,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:53,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 21: [2023-05-10 10:10:53,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:53,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 17: [2023-05-10 10:10:53,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 4: [2023-05-10 10:10:53,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 4: [2023-05-10 10:10:53,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 4: [2023-05-10 10:10:53,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 4: [2023-05-10 10:10:53,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 17: [2023-05-10 10:10:53,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 17: [2023-05-10 10:10:53,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 30: [2023-05-10 10:10:53,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 30: [2023-05-10 10:10:53,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 2: [2023-05-10 10:10:53,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 2: [2023-05-10 10:10:53,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 28: [2023-05-10 10:10:53,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 28: [2023-05-10 10:10:53,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 17: [2023-05-10 10:10:53,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 17: [2023-05-10 10:10:53,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 11: [2023-05-10 10:10:53,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 18: [2023-05-10 10:10:53,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 5: [2023-05-10 10:10:53,617] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:53,617] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:53,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 1: [2023-05-10 10:10:53,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:53,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 5: [2023-05-10 10:10:53,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:53,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 1: [2023-05-10 10:10:53,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:53,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 4: [2023-05-10 10:10:53,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 11: [2023-05-10 10:10:53,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 20: [2023-05-10 10:10:53,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 4: [2023-05-10 10:10:53,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 4: [2023-05-10 10:10:53,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:53,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 28: [2023-05-10 10:10:53,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 28: [2023-05-10 10:10:53,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 0: [2023-05-10 10:10:53,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 5: [2023-05-10 10:10:53,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:53,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:53,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:53,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:53,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:53,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 0: [2023-05-10 10:10:53,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:53,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 13: [2023-05-10 10:10:53,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 24: [2023-05-10 10:10:53,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 24: [2023-05-10 10:10:53,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 28: [2023-05-10 10:10:53,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 28: [2023-05-10 10:10:53,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:53,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 28: [2023-05-10 10:10:53,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 28: [2023-05-10 10:10:53,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 28: [2023-05-10 10:10:53,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 2: [2023-05-10 10:10:53,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 2: [2023-05-10 10:10:53,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 4: [2023-05-10 10:10:53,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:53,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 5: [2023-05-10 10:10:53,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 5: [2023-05-10 10:10:53,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:53,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 2: [2023-05-10 10:10:53,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 2: [2023-05-10 10:10:53,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,643] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,643] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,643] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 5: [2023-05-10 10:10:53,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:53,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 22: [2023-05-10 10:10:53,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 20: [2023-05-10 10:10:53,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:53,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 13: [2023-05-10 10:10:53,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 13: [2023-05-10 10:10:53,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 0: [2023-05-10 10:10:53,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 22: [2023-05-10 10:10:53,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 22: [2023-05-10 10:10:53,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt... 2: [2023-05-10 10:10:53,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 11: [2023-05-10 10:10:53,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 20: [2023-05-10 10:10:53,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 20: [2023-05-10 10:10:53,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:53,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 0: [2023-05-10 10:10:53,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 0: [2023-05-10 10:10:53,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 0: [2023-05-10 10:10:53,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 5: [2023-05-10 10:10:53,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 5: [2023-05-10 10:10:53,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 5: [2023-05-10 10:10:53,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 24: [2023-05-10 10:10:53,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 5: [2023-05-10 10:10:53,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 5: [2023-05-10 10:10:53,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:53,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 20: [2023-05-10 10:10:53,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 20: [2023-05-10 10:10:53,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:53,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 28: [2023-05-10 10:10:53,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 28: [2023-05-10 10:10:53,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 28: [2023-05-10 10:10:53,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:53,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 8: [2023-05-10 10:10:53,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:53,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 8: [2023-05-10 10:10:53,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 8: [2023-05-10 10:10:53,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 8: [2023-05-10 10:10:53,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:53,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:53,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 28: [2023-05-10 10:10:53,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 22: [2023-05-10 10:10:53,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 20: [2023-05-10 10:10:53,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:53,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:53,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:53,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:53,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:53,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:53,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:53,723] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,723] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,735] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 8: [2023-05-10 10:10:53,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 8: [2023-05-10 10:10:53,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 8: [2023-05-10 10:10:53,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 13: [2023-05-10 10:10:53,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 13: [2023-05-10 10:10:53,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_16-model_00-model_states.pt. 22: [2023-05-10 10:10:53,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 8: [2023-05-10 10:10:53,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 8: [2023-05-10 10:10:53,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 8: [2023-05-10 10:10:53,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:53,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 13: [2023-05-10 10:10:53,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 22: [2023-05-10 10:10:53,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 22: [2023-05-10 10:10:53,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 13: [2023-05-10 10:10:53,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:53,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 22: [2023-05-10 10:10:53,763] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 22: [2023-05-10 10:10:53,763] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 14: [2023-05-10 10:10:53,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 14: [2023-05-10 10:10:53,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 14: [2023-05-10 10:10:53,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 14: [2023-05-10 10:10:53,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 14: [2023-05-10 10:10:53,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 14: [2023-05-10 10:10:53,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 14: [2023-05-10 10:10:53,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 15: [2023-05-10 10:10:53,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 15: [2023-05-10 10:10:53,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 15: [2023-05-10 10:10:53,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 15: [2023-05-10 10:10:53,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 15: [2023-05-10 10:10:53,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 8: [2023-05-10 10:10:53,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 15: [2023-05-10 10:10:53,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 15: [2023-05-10 10:10:53,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 15: [2023-05-10 10:10:53,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 13: [2023-05-10 10:10:53,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:53,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 15: [2023-05-10 10:10:53,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 15: [2023-05-10 10:10:53,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:53,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 15: [2023-05-10 10:10:53,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 15: [2023-05-10 10:10:53,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 15: [2023-05-10 10:10:53,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:53,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 15: [2023-05-10 10:10:53,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 15: [2023-05-10 10:10:53,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 8: [2023-05-10 10:10:53,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 15: [2023-05-10 10:10:53,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 13: [2023-05-10 10:10:53,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 13: [2023-05-10 10:10:53,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 13: [2023-05-10 10:10:53,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 12: [2023-05-10 10:10:53,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 12: [2023-05-10 10:10:53,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 12: [2023-05-10 10:10:53,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 12: [2023-05-10 10:10:53,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 12: [2023-05-10 10:10:53,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 12: [2023-05-10 10:10:53,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 12: [2023-05-10 10:10:53,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 15: [2023-05-10 10:10:53,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 26: [2023-05-10 10:10:53,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 15: [2023-05-10 10:10:53,824] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 14: [2023-05-10 10:10:53,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 26: [2023-05-10 10:10:53,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,831] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 26: [2023-05-10 10:10:53,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 26: [2023-05-10 10:10:53,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 15: [2023-05-10 10:10:53,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 26: [2023-05-10 10:10:53,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 26: [2023-05-10 10:10:53,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 26: [2023-05-10 10:10:53,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 14: [2023-05-10 10:10:53,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 14: [2023-05-10 10:10:53,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 14: [2023-05-10 10:10:53,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 14: [2023-05-10 10:10:53,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 14: [2023-05-10 10:10:53,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,843] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 15: [2023-05-10 10:10:53,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:53,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 6: [2023-05-10 10:10:53,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 15: [2023-05-10 10:10:53,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 15: [2023-05-10 10:10:53,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 15: [2023-05-10 10:10:53,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 15: [2023-05-10 10:10:53,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 15: [2023-05-10 10:10:53,852] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 15: [2023-05-10 10:10:53,852] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 6: [2023-05-10 10:10:53,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:53,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 6: [2023-05-10 10:10:53,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 6: [2023-05-10 10:10:53,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:53,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 25: [2023-05-10 10:10:53,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 25: [2023-05-10 10:10:53,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 25: [2023-05-10 10:10:53,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 25: [2023-05-10 10:10:53,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 25: [2023-05-10 10:10:53,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 31: [2023-05-10 10:10:53,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 31: [2023-05-10 10:10:53,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 31: [2023-05-10 10:10:53,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 31: [2023-05-10 10:10:53,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 31: [2023-05-10 10:10:53,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 25: [2023-05-10 10:10:53,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 25: [2023-05-10 10:10:53,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 31: [2023-05-10 10:10:53,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 31: [2023-05-10 10:10:53,869] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 25: [2023-05-10 10:10:53,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 31: [2023-05-10 10:10:53,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 14: [2023-05-10 10:10:53,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:53,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 12: [2023-05-10 10:10:53,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 14: [2023-05-10 10:10:53,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 14: [2023-05-10 10:10:53,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 25: [2023-05-10 10:10:53,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 25: [2023-05-10 10:10:53,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 25: [2023-05-10 10:10:53,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 25: [2023-05-10 10:10:53,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 25: [2023-05-10 10:10:53,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 25: [2023-05-10 10:10:53,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 25: [2023-05-10 10:10:53,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:53,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 29: [2023-05-10 10:10:53,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 29: [2023-05-10 10:10:53,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 29: [2023-05-10 10:10:53,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 29: [2023-05-10 10:10:53,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 29: [2023-05-10 10:10:53,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 29: [2023-05-10 10:10:53,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:53,879] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,879] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,879] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,879] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 14: [2023-05-10 10:10:53,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 14: [2023-05-10 10:10:53,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 15: [2023-05-10 10:10:53,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 31: [2023-05-10 10:10:53,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 26: [2023-05-10 10:10:53,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 15: [2023-05-10 10:10:53,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 29: [2023-05-10 10:10:53,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 27: [2023-05-10 10:10:53,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:53,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 27: [2023-05-10 10:10:53,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:53,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:53,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:53,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:53,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:53,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:53,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:53,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 15: [2023-05-10 10:10:53,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 15: [2023-05-10 10:10:53,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 27: [2023-05-10 10:10:53,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 15: [2023-05-10 10:10:53,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 15: [2023-05-10 10:10:53,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 29: [2023-05-10 10:10:53,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 29: [2023-05-10 10:10:53,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 29: [2023-05-10 10:10:53,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 26: [2023-05-10 10:10:53,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:53,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 12: [2023-05-10 10:10:53,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:53,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:53,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:53,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:53,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,898] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:53,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:53,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:53,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:53,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:53,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:53,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:53,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:53,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:53,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 26: [2023-05-10 10:10:53,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:53,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 3: [2023-05-10 10:10:53,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 3: [2023-05-10 10:10:53,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:53,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 3: [2023-05-10 10:10:53,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 3: [2023-05-10 10:10:53,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 3: [2023-05-10 10:10:53,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 12: [2023-05-10 10:10:53,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:53,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:53,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 1: [2023-05-10 10:10:53,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 1: [2023-05-10 10:10:53,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 1: [2023-05-10 10:10:53,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 26: [2023-05-10 10:10:53,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:53,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 1: [2023-05-10 10:10:53,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 1: [2023-05-10 10:10:53,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:53,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 3: [2023-05-10 10:10:53,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 12: [2023-05-10 10:10:53,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:53,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:53,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 26: [2023-05-10 10:10:53,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:53,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 3: [2023-05-10 10:10:53,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:53,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:53,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 27: [2023-05-10 10:10:53,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:53,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 27: [2023-05-10 10:10:53,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 27: [2023-05-10 10:10:53,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:53,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:53,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:53,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 7: [2023-05-10 10:10:53,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 7: [2023-05-10 10:10:53,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 31: [2023-05-10 10:10:53,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 31: [2023-05-10 10:10:53,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 6: [2023-05-10 10:10:53,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 12: [2023-05-10 10:10:53,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:53,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 10: [2023-05-10 10:10:53,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 10: [2023-05-10 10:10:53,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 7: [2023-05-10 10:10:53,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 12: [2023-05-10 10:10:53,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:53,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 19: [2023-05-10 10:10:53,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 7: [2023-05-10 10:10:53,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 12: [2023-05-10 10:10:53,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 12: [2023-05-10 10:10:53,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:53,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 10: [2023-05-10 10:10:53,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 19: [2023-05-10 10:10:53,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 19: [2023-05-10 10:10:53,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 19: [2023-05-10 10:10:53,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 19: [2023-05-10 10:10:53,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 19: [2023-05-10 10:10:53,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 19: [2023-05-10 10:10:53,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 12: [2023-05-10 10:10:53,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:53,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 25: [2023-05-10 10:10:53,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 19: [2023-05-10 10:10:53,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 17: [2023-05-10 10:10:53,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 25: [2023-05-10 10:10:53,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:53,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 18: [2023-05-10 10:10:53,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:53,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 18: [2023-05-10 10:10:53,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 17: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 17: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 18: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 18: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 18: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 17: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 17: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 17: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 18: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 17: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 19: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 20: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 20: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 20: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 20: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 20: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 20: [2023-05-10 10:10:53,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 20: [2023-05-10 10:10:53,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:53,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:53,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 25: [2023-05-10 10:10:53,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 25: [2023-05-10 10:10:53,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 25: [2023-05-10 10:10:53,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 25: [2023-05-10 10:10:53,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 25: [2023-05-10 10:10:53,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 25: [2023-05-10 10:10:53,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 19: [2023-05-10 10:10:53,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:53,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:53,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:53,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:53,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:53,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:53,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:53,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 17: [2023-05-10 10:10:53,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 30: [2023-05-10 10:10:53,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:53,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:53,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:53,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:53,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:53,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:53,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:53,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:53,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 19: [2023-05-10 10:10:53,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:53,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 9: [2023-05-10 10:10:53,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 9: [2023-05-10 10:10:53,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 18: [2023-05-10 10:10:53,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 19: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 19: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 9: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 9: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 9: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 9: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 9: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 9: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:53,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:53,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:53,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:53,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 0: [2023-05-10 10:10:53,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:53,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:53,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 0: [2023-05-10 10:10:53,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:53,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:53,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:53,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:53,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:53,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 26: [2023-05-10 10:10:53,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 26: [2023-05-10 10:10:53,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 24: [2023-05-10 10:10:53,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 31: [2023-05-10 10:10:53,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:53,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 24: [2023-05-10 10:10:53,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 0: [2023-05-10 10:10:53,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 17: [2023-05-10 10:10:53,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 17: [2023-05-10 10:10:53,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:53,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:53,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:53,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 9: [2023-05-10 10:10:53,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 9: [2023-05-10 10:10:53,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 9: [2023-05-10 10:10:53,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 20: [2023-05-10 10:10:53,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:53,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 20: [2023-05-10 10:10:53,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:53,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:53,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 17: [2023-05-10 10:10:53,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 0: [2023-05-10 10:10:53,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 17: [2023-05-10 10:10:53,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 0: [2023-05-10 10:10:53,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 6: [2023-05-10 10:10:53,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 17: [2023-05-10 10:10:53,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:53,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:53,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:53,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:53,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 9: [2023-05-10 10:10:53,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 17: [2023-05-10 10:10:53,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 9: [2023-05-10 10:10:53,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 0: [2023-05-10 10:10:53,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 0: [2023-05-10 10:10:53,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 0: [2023-05-10 10:10:53,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 0: [2023-05-10 10:10:53,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 16: [2023-05-10 10:10:53,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:53,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:53,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 31: [2023-05-10 10:10:53,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 31: [2023-05-10 10:10:53,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 31: [2023-05-10 10:10:53,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:53,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 31: [2023-05-10 10:10:53,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:53,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 11: [2023-05-10 10:10:53,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 29: [2023-05-10 10:10:53,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:53,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 29: [2023-05-10 10:10:53,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:53,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:53,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:53,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 21: [2023-05-10 10:10:53,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 21: [2023-05-10 10:10:53,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 21: [2023-05-10 10:10:53,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 21: [2023-05-10 10:10:53,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 21: [2023-05-10 10:10:53,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 21: [2023-05-10 10:10:53,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 21: [2023-05-10 10:10:53,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:53,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:53,959] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:53,959] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 29: [2023-05-10 10:10:53,960] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 29: [2023-05-10 10:10:53,960] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 29: [2023-05-10 10:10:53,960] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 29: [2023-05-10 10:10:53,960] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 29: [2023-05-10 10:10:53,960] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:53,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 1: [2023-05-10 10:10:53,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:53,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 21: [2023-05-10 10:10:53,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 25: [2023-05-10 10:10:53,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:53,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:53,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:53,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 3: [2023-05-10 10:10:53,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 3: [2023-05-10 10:10:53,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 21: [2023-05-10 10:10:53,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:53,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:53,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 1: [2023-05-10 10:10:53,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:53,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:53,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:53,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 25: [2023-05-10 10:10:53,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 25: [2023-05-10 10:10:53,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:53,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 25: [2023-05-10 10:10:53,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 25: [2023-05-10 10:10:53,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:53,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:53,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 25: [2023-05-10 10:10:53,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 31: [2023-05-10 10:10:53,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 29: [2023-05-10 10:10:53,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 1: [2023-05-10 10:10:53,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:53,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:53,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:53,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:53,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:53,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:53,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:53,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 16: [2023-05-10 10:10:53,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:53,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:53,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:53,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:53,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 1: [2023-05-10 10:10:53,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 1: [2023-05-10 10:10:53,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:53,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 31: [2023-05-10 10:10:53,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 31: [2023-05-10 10:10:53,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:53,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 3: [2023-05-10 10:10:53,987] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:53,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 3: [2023-05-10 10:10:53,988] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:53,988] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 17: [2023-05-10 10:10:53,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 20: [2023-05-10 10:10:53,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:53,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 3: [2023-05-10 10:10:53,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 31: [2023-05-10 10:10:53,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 31: [2023-05-10 10:10:53,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:53,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 29: [2023-05-10 10:10:53,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 19: [2023-05-10 10:10:53,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 19: [2023-05-10 10:10:53,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 29: [2023-05-10 10:10:53,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 29: [2023-05-10 10:10:53,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 29: [2023-05-10 10:10:53,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 2: [2023-05-10 10:10:53,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:53,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:53,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:53,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:53,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:53,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:53,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 9: [2023-05-10 10:10:53,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 9: [2023-05-10 10:10:53,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:53,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 3: [2023-05-10 10:10:53,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:53,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:53,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:53,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 5: [2023-05-10 10:10:53,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 10: [2023-05-10 10:10:53,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:53,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:53,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 5: [2023-05-10 10:10:53,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 5: [2023-05-10 10:10:53,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 5: [2023-05-10 10:10:53,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 5: [2023-05-10 10:10:53,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:53,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:53,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:53,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:53,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:53,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:53,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 22: [2023-05-10 10:10:53,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:53,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:53,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:53,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:53,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:54,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:54,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 18: [2023-05-10 10:10:54,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 18: [2023-05-10 10:10:54,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 18: [2023-05-10 10:10:54,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:54,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:54,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:54,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:54,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 5: [2023-05-10 10:10:54,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 5: [2023-05-10 10:10:54,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:54,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:54,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:54,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:54,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:54,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:54,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:54,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:54,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 1: [2023-05-10 10:10:54,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 19: [2023-05-10 10:10:54,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:54,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 19: [2023-05-10 10:10:54,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 19: [2023-05-10 10:10:54,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 1: [2023-05-10 10:10:54,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 28: [2023-05-10 10:10:54,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 28: [2023-05-10 10:10:54,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:54,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 18: [2023-05-10 10:10:54,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 18: [2023-05-10 10:10:54,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:54,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:54,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:54,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 5: [2023-05-10 10:10:54,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:54,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:54,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 17: [2023-05-10 10:10:54,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:54,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 5: [2023-05-10 10:10:54,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,009] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:54,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 5: [2023-05-10 10:10:54,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:54,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 5: [2023-05-10 10:10:54,010] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:54,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:54,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:54,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:54,010] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 22: [2023-05-10 10:10:54,010] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 10: [2023-05-10 10:10:54,010] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:54,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 9: [2023-05-10 10:10:54,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 9: [2023-05-10 10:10:54,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 9: [2023-05-10 10:10:54,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 9: [2023-05-10 10:10:54,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 9: [2023-05-10 10:10:54,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 3: [2023-05-10 10:10:54,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:54,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:54,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:54,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 2: [2023-05-10 10:10:54,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 17: [2023-05-10 10:10:54,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:54,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 10: [2023-05-10 10:10:54,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:54,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 19: [2023-05-10 10:10:54,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:54,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 19: [2023-05-10 10:10:54,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 24: [2023-05-10 10:10:54,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:54,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 9: [2023-05-10 10:10:54,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 24: [2023-05-10 10:10:54,018] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:54,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 13: [2023-05-10 10:10:54,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 13: [2023-05-10 10:10:54,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 13: [2023-05-10 10:10:54,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 13: [2023-05-10 10:10:54,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 13: [2023-05-10 10:10:54,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 13: [2023-05-10 10:10:54,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 13: [2023-05-10 10:10:54,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 3: [2023-05-10 10:10:54,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:54,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 11: [2023-05-10 10:10:54,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:54,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 11: [2023-05-10 10:10:54,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:54,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 21: [2023-05-10 10:10:54,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 21: [2023-05-10 10:10:54,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:54,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:54,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:54,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:54,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:54,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 17: [2023-05-10 10:10:54,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 0: [2023-05-10 10:10:54,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:54,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:54,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:54,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:54,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:54,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:54,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:54,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:54,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:54,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:54,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:54,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:54,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:54,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 21: [2023-05-10 10:10:54,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 20: [2023-05-10 10:10:54,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 30: [2023-05-10 10:10:54,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:54,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:54,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:54,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 11: [2023-05-10 10:10:54,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 10: [2023-05-10 10:10:54,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:54,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 10: [2023-05-10 10:10:54,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:54,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:54,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:54,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 17: [2023-05-10 10:10:54,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 17: [2023-05-10 10:10:54,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 17: [2023-05-10 10:10:54,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 17: [2023-05-10 10:10:54,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 24: [2023-05-10 10:10:54,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:54,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 13: [2023-05-10 10:10:54,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:54,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:54,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 20: [2023-05-10 10:10:54,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 20: [2023-05-10 10:10:54,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 20: [2023-05-10 10:10:54,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:54,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:54,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 20: [2023-05-10 10:10:54,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:54,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:54,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 13: [2023-05-10 10:10:54,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 17: [2023-05-10 10:10:54,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 19: [2023-05-10 10:10:54,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:54,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 18: [2023-05-10 10:10:54,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:54,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 21: [2023-05-10 10:10:54,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 21: [2023-05-10 10:10:54,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:54,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:54,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt... 0: [2023-05-10 10:10:54,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 17: [2023-05-10 10:10:54,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:54,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 5: [2023-05-10 10:10:54,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:54,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:54,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:54,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 0: [2023-05-10 10:10:54,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:54,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 17: [2023-05-10 10:10:54,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:54,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:54,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:54,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 11: [2023-05-10 10:10:54,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 24: [2023-05-10 10:10:54,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:54,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 24: [2023-05-10 10:10:54,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 11: [2023-05-10 10:10:54,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 24: [2023-05-10 10:10:54,056] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 24: [2023-05-10 10:10:54,056] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:54,058] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:54,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:54,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:54,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 2: [2023-05-10 10:10:54,058] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 8: [2023-05-10 10:10:54,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 8: [2023-05-10 10:10:54,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 8: [2023-05-10 10:10:54,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 8: [2023-05-10 10:10:54,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 8: [2023-05-10 10:10:54,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 8: [2023-05-10 10:10:54,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 11: [2023-05-10 10:10:54,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:54,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 11: [2023-05-10 10:10:54,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:54,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:54,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 5: [2023-05-10 10:10:54,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 2: [2023-05-10 10:10:54,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 4: [2023-05-10 10:10:54,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 4: [2023-05-10 10:10:54,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 4: [2023-05-10 10:10:54,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 21: [2023-05-10 10:10:54,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 4: [2023-05-10 10:10:54,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 4: [2023-05-10 10:10:54,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 4: [2023-05-10 10:10:54,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 4: [2023-05-10 10:10:54,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 20: [2023-05-10 10:10:54,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 17: [2023-05-10 10:10:54,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 17: [2023-05-10 10:10:54,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:54,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 4: [2023-05-10 10:10:54,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 4: [2023-05-10 10:10:54,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 17: [2023-05-10 10:10:54,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 4: [2023-05-10 10:10:54,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 17: [2023-05-10 10:10:54,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 4: [2023-05-10 10:10:54,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 4: [2023-05-10 10:10:54,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 4: [2023-05-10 10:10:54,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 4: [2023-05-10 10:10:54,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 5: [2023-05-10 10:10:54,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 5: [2023-05-10 10:10:54,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 5: [2023-05-10 10:10:54,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 5: [2023-05-10 10:10:54,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 5: [2023-05-10 10:10:54,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 22: [2023-05-10 10:10:54,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:54,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:54,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:54,080] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:54,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:54,081] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:54,081] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:54,081] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:54,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:54,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:54,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:54,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:54,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:54,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:54,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:54,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:54,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 23: [2023-05-10 10:10:54,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:54,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:54,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 23: [2023-05-10 10:10:54,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:54,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:54,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:54,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:54,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:54,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 5: [2023-05-10 10:10:54,108] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,108] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 5: [2023-05-10 10:10:54,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 22: [2023-05-10 10:10:54,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:54,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:54,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:54,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:54,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 28: [2023-05-10 10:10:54,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:54,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 4: [2023-05-10 10:10:54,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 28: [2023-05-10 10:10:54,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:54,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:54,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:54,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 28: [2023-05-10 10:10:54,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 4: [2023-05-10 10:10:54,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 4: [2023-05-10 10:10:54,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 4: [2023-05-10 10:10:54,128] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 4: [2023-05-10 10:10:54,128] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,130] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,130] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,130] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,130] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,130] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,130] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 13: [2023-05-10 10:10:54,130] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 13: [2023-05-10 10:10:54,130] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 13: [2023-05-10 10:10:54,130] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_17-model_00-model_states.pt. 15: [2023-05-10 10:10:54,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 8: [2023-05-10 10:10:54,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 8: [2023-05-10 10:10:54,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 8: [2023-05-10 10:10:54,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 8: [2023-05-10 10:10:54,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 4: [2023-05-10 10:10:54,136] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 4: [2023-05-10 10:10:54,136] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 4: [2023-05-10 10:10:54,136] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 15: [2023-05-10 10:10:54,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 15: [2023-05-10 10:10:54,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 15: [2023-05-10 10:10:54,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 15: [2023-05-10 10:10:54,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 15: [2023-05-10 10:10:54,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:54,143] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:54,143] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 4: [2023-05-10 10:10:54,143] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 23: [2023-05-10 10:10:54,152] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,152] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,152] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 4: [2023-05-10 10:10:54,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:54,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 4: [2023-05-10 10:10:54,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 4: [2023-05-10 10:10:54,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:54,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 8: [2023-05-10 10:10:54,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:54,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 14: [2023-05-10 10:10:54,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:54,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 8: [2023-05-10 10:10:54,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 14: [2023-05-10 10:10:54,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 4: [2023-05-10 10:10:54,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 14: [2023-05-10 10:10:54,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 14: [2023-05-10 10:10:54,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 14: [2023-05-10 10:10:54,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 14: [2023-05-10 10:10:54,169] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:54,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 23: [2023-05-10 10:10:54,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 23: [2023-05-10 10:10:54,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 27: [2023-05-10 10:10:54,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 27: [2023-05-10 10:10:54,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 27: [2023-05-10 10:10:54,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 27: [2023-05-10 10:10:54,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 27: [2023-05-10 10:10:54,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 27: [2023-05-10 10:10:54,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 27: [2023-05-10 10:10:54,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 27: [2023-05-10 10:10:54,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 27: [2023-05-10 10:10:54,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 27: [2023-05-10 10:10:54,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 27: [2023-05-10 10:10:54,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:54,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 27: [2023-05-10 10:10:54,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 27: [2023-05-10 10:10:54,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 27: [2023-05-10 10:10:54,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 27: [2023-05-10 10:10:54,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 27: [2023-05-10 10:10:54,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 12: [2023-05-10 10:10:54,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 12: [2023-05-10 10:10:54,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 12: [2023-05-10 10:10:54,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:54,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 12: [2023-05-10 10:10:54,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 12: [2023-05-10 10:10:54,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:54,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 12: [2023-05-10 10:10:54,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 12: [2023-05-10 10:10:54,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:54,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 27: [2023-05-10 10:10:54,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 14: [2023-05-10 10:10:54,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 16: [2023-05-10 10:10:54,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 16: [2023-05-10 10:10:54,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 16: [2023-05-10 10:10:54,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 16: [2023-05-10 10:10:54,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 16: [2023-05-10 10:10:54,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 16: [2023-05-10 10:10:54,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 16: [2023-05-10 10:10:54,245] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 27: [2023-05-10 10:10:54,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 27: [2023-05-10 10:10:54,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:54,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:54,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:54,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:54,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:54,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:54,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:54,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 26: [2023-05-10 10:10:54,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 15: [2023-05-10 10:10:54,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 26: [2023-05-10 10:10:54,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 27: [2023-05-10 10:10:54,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 7: [2023-05-10 10:10:54,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 7: [2023-05-10 10:10:54,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 7: [2023-05-10 10:10:54,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 7: [2023-05-10 10:10:54,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 7: [2023-05-10 10:10:54,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 7: [2023-05-10 10:10:54,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 7: [2023-05-10 10:10:54,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 27: [2023-05-10 10:10:54,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 27: [2023-05-10 10:10:54,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 27: [2023-05-10 10:10:54,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 27: [2023-05-10 10:10:54,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 27: [2023-05-10 10:10:54,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 26: [2023-05-10 10:10:54,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 14: [2023-05-10 10:10:54,268] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,268] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 7: [2023-05-10 10:10:54,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 26: [2023-05-10 10:10:54,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 7: [2023-05-10 10:10:54,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 26: [2023-05-10 10:10:54,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 26: [2023-05-10 10:10:54,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 26: [2023-05-10 10:10:54,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 7: [2023-05-10 10:10:54,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 7: [2023-05-10 10:10:54,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 12: [2023-05-10 10:10:54,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 7: [2023-05-10 10:10:54,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 27: [2023-05-10 10:10:54,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 7: [2023-05-10 10:10:54,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 14: [2023-05-10 10:10:54,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 27: [2023-05-10 10:10:54,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 12: [2023-05-10 10:10:54,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:54,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:54,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:54,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:54,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:54,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:54,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:54,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:54,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 27: [2023-05-10 10:10:54,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 27: [2023-05-10 10:10:54,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 27: [2023-05-10 10:10:54,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:54,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 31: [2023-05-10 10:10:54,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 27: [2023-05-10 10:10:54,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 27: [2023-05-10 10:10:54,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:54,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 31: [2023-05-10 10:10:54,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 31: [2023-05-10 10:10:54,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 31: [2023-05-10 10:10:54,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 31: [2023-05-10 10:10:54,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 31: [2023-05-10 10:10:54,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:54,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 30: [2023-05-10 10:10:54,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 12: [2023-05-10 10:10:54,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 30: [2023-05-10 10:10:54,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 30: [2023-05-10 10:10:54,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 30: [2023-05-10 10:10:54,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 30: [2023-05-10 10:10:54,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 30: [2023-05-10 10:10:54,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 30: [2023-05-10 10:10:54,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 30: [2023-05-10 10:10:54,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:54,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:54,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:54,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 12: [2023-05-10 10:10:54,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 12: [2023-05-10 10:10:54,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:54,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:54,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 12: [2023-05-10 10:10:54,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 3: [2023-05-10 10:10:54,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 12: [2023-05-10 10:10:54,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 12: [2023-05-10 10:10:54,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 3: [2023-05-10 10:10:54,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:54,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:54,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:54,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:54,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:54,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:54,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:54,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:54,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:54,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:54,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:54,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:54,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:54,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:54,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:54,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:54,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 16: [2023-05-10 10:10:54,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 16: [2023-05-10 10:10:54,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 16: [2023-05-10 10:10:54,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 16: [2023-05-10 10:10:54,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 16: [2023-05-10 10:10:54,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 16: [2023-05-10 10:10:54,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:54,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 7: [2023-05-10 10:10:54,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 7: [2023-05-10 10:10:54,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:54,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:54,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:54,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 16: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 7: [2023-05-10 10:10:54,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 7: [2023-05-10 10:10:54,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 25: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 7: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 7: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 7: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:54,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 25: [2023-05-10 10:10:54,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 25: [2023-05-10 10:10:54,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 19: [2023-05-10 10:10:54,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 25: [2023-05-10 10:10:54,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 25: [2023-05-10 10:10:54,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:54,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 25: [2023-05-10 10:10:54,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 25: [2023-05-10 10:10:54,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 26: [2023-05-10 10:10:54,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:54,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:54,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 19: [2023-05-10 10:10:54,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 19: [2023-05-10 10:10:54,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 19: [2023-05-10 10:10:54,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 19: [2023-05-10 10:10:54,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 19: [2023-05-10 10:10:54,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:54,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:54,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 26: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 9: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 9: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 9: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 9: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 9: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 9: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 9: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 10: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 10: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 10: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 10: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 10: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 10: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 10: [2023-05-10 10:10:54,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 10: [2023-05-10 10:10:54,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 19: [2023-05-10 10:10:54,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 31: [2023-05-10 10:10:54,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 19: [2023-05-10 10:10:54,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:54,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 9: [2023-05-10 10:10:54,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:54,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:54,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:54,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:54,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:54,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:54,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:54,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:54,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 19: [2023-05-10 10:10:54,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:54,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:54,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:54,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 7: [2023-05-10 10:10:54,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 19: [2023-05-10 10:10:54,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:54,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:54,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:54,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:54,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,352] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:54,352] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:54,352] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:54,352] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:54,353] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:54,353] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 7: [2023-05-10 10:10:54,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 0: [2023-05-10 10:10:54,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:54,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:54,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:54,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:54,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 0: [2023-05-10 10:10:54,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 16: [2023-05-10 10:10:54,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,361] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 16: [2023-05-10 10:10:54,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:54,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 3: [2023-05-10 10:10:54,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:54,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:54,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:54,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:54,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:54,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:54,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:54,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 2: [2023-05-10 10:10:54,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 2: [2023-05-10 10:10:54,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:54,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 2: [2023-05-10 10:10:54,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:54,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:54,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 2: [2023-05-10 10:10:54,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 2: [2023-05-10 10:10:54,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 2: [2023-05-10 10:10:54,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 2: [2023-05-10 10:10:54,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 2: [2023-05-10 10:10:54,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 30: [2023-05-10 10:10:54,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 26: [2023-05-10 10:10:54,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 17: [2023-05-10 10:10:54,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 17: [2023-05-10 10:10:54,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 17: [2023-05-10 10:10:54,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 17: [2023-05-10 10:10:54,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 17: [2023-05-10 10:10:54,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:54,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:54,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 2: [2023-05-10 10:10:54,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:54,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:54,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:54,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:54,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 2: [2023-05-10 10:10:54,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 1: [2023-05-10 10:10:54,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 3: [2023-05-10 10:10:54,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:54,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 1: [2023-05-10 10:10:54,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 25: [2023-05-10 10:10:54,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:54,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:54,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,399] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 1: [2023-05-10 10:10:54,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 1: [2023-05-10 10:10:54,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:54,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 24: [2023-05-10 10:10:54,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 24: [2023-05-10 10:10:54,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 24: [2023-05-10 10:10:54,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 24: [2023-05-10 10:10:54,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 24: [2023-05-10 10:10:54,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:54,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 30: [2023-05-10 10:10:54,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 30: [2023-05-10 10:10:54,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 30: [2023-05-10 10:10:54,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 30: [2023-05-10 10:10:54,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:54,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:54,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 19: [2023-05-10 10:10:54,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 19: [2023-05-10 10:10:54,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 10: [2023-05-10 10:10:54,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 10: [2023-05-10 10:10:54,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:54,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 24: [2023-05-10 10:10:54,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 31: [2023-05-10 10:10:54,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 11: [2023-05-10 10:10:54,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 31: [2023-05-10 10:10:54,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,407] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,407] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 9: [2023-05-10 10:10:54,407] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 19: [2023-05-10 10:10:54,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:54,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:54,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,407] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 31: [2023-05-10 10:10:54,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 9: [2023-05-10 10:10:54,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 18: [2023-05-10 10:10:54,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 24: [2023-05-10 10:10:54,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 18: [2023-05-10 10:10:54,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:54,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 24: [2023-05-10 10:10:54,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 10: [2023-05-10 10:10:54,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:54,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:54,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 22: [2023-05-10 10:10:54,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:54,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 22: [2023-05-10 10:10:54,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 22: [2023-05-10 10:10:54,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 22: [2023-05-10 10:10:54,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 22: [2023-05-10 10:10:54,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 22: [2023-05-10 10:10:54,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:54,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 9: [2023-05-10 10:10:54,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 11: [2023-05-10 10:10:54,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 0: [2023-05-10 10:10:54,414] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:54,414] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:54,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:54,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 6: [2023-05-10 10:10:54,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 22: [2023-05-10 10:10:54,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 3: [2023-05-10 10:10:54,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 3: [2023-05-10 10:10:54,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 17: [2023-05-10 10:10:54,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 19: [2023-05-10 10:10:54,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 10: [2023-05-10 10:10:54,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 10: [2023-05-10 10:10:54,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 10: [2023-05-10 10:10:54,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 10: [2023-05-10 10:10:54,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 22: [2023-05-10 10:10:54,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:54,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 19: [2023-05-10 10:10:54,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 19: [2023-05-10 10:10:54,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 5: [2023-05-10 10:10:54,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 5: [2023-05-10 10:10:54,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 22: [2023-05-10 10:10:54,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 5: [2023-05-10 10:10:54,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 30: [2023-05-10 10:10:54,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 25: [2023-05-10 10:10:54,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 25: [2023-05-10 10:10:54,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 25: [2023-05-10 10:10:54,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 5: [2023-05-10 10:10:54,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 5: [2023-05-10 10:10:54,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 5: [2023-05-10 10:10:54,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 5: [2023-05-10 10:10:54,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:54,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:54,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:54,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 0: [2023-05-10 10:10:54,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 9: [2023-05-10 10:10:54,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,425] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 28: [2023-05-10 10:10:54,425] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 25: [2023-05-10 10:10:54,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,427] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,427] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 19: [2023-05-10 10:10:54,427] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 19: [2023-05-10 10:10:54,427] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 0: [2023-05-10 10:10:54,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 28: [2023-05-10 10:10:54,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 28: [2023-05-10 10:10:54,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 28: [2023-05-10 10:10:54,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 28: [2023-05-10 10:10:54,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 28: [2023-05-10 10:10:54,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 19: [2023-05-10 10:10:54,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 21: [2023-05-10 10:10:54,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 5: [2023-05-10 10:10:54,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 28: [2023-05-10 10:10:54,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:54,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 21: [2023-05-10 10:10:54,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 21: [2023-05-10 10:10:54,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 21: [2023-05-10 10:10:54,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 28: [2023-05-10 10:10:54,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:54,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 2: [2023-05-10 10:10:54,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 2: [2023-05-10 10:10:54,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 25: [2023-05-10 10:10:54,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 25: [2023-05-10 10:10:54,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 25: [2023-05-10 10:10:54,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,433] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,433] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,433] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 21: [2023-05-10 10:10:54,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 17: [2023-05-10 10:10:54,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:54,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 28: [2023-05-10 10:10:54,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 20: [2023-05-10 10:10:54,439] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 20: [2023-05-10 10:10:54,439] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 30: [2023-05-10 10:10:54,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,439] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 20: [2023-05-10 10:10:54,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 20: [2023-05-10 10:10:54,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 20: [2023-05-10 10:10:54,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 20: [2023-05-10 10:10:54,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,443] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:54,443] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 9: [2023-05-10 10:10:54,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:54,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:54,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 13: [2023-05-10 10:10:54,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 13: [2023-05-10 10:10:54,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 23: [2023-05-10 10:10:54,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 13: [2023-05-10 10:10:54,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt... 6: [2023-05-10 10:10:54,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 4: [2023-05-10 10:10:54,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 4: [2023-05-10 10:10:54,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 4: [2023-05-10 10:10:54,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 4: [2023-05-10 10:10:54,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 4: [2023-05-10 10:10:54,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 4: [2023-05-10 10:10:54,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 20: [2023-05-10 10:10:54,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 19: [2023-05-10 10:10:54,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 23: [2023-05-10 10:10:54,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 23: [2023-05-10 10:10:54,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 19: [2023-05-10 10:10:54,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 23: [2023-05-10 10:10:54,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 19: [2023-05-10 10:10:54,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 1: [2023-05-10 10:10:54,453] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,453] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,453] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,453] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 2: [2023-05-10 10:10:54,453] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 2: [2023-05-10 10:10:54,453] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,453] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,453] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 0: [2023-05-10 10:10:54,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 0: [2023-05-10 10:10:54,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,454] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 8: [2023-05-10 10:10:54,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 8: [2023-05-10 10:10:54,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 8: [2023-05-10 10:10:54,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 8: [2023-05-10 10:10:54,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 8: [2023-05-10 10:10:54,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 8: [2023-05-10 10:10:54,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,456] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 24: [2023-05-10 10:10:54,456] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 4: [2023-05-10 10:10:54,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 23: [2023-05-10 10:10:54,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 0: [2023-05-10 10:10:54,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 0: [2023-05-10 10:10:54,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 23: [2023-05-10 10:10:54,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 17: [2023-05-10 10:10:54,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 1: [2023-05-10 10:10:54,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 8: [2023-05-10 10:10:54,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 2: [2023-05-10 10:10:54,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 2: [2023-05-10 10:10:54,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 2: [2023-05-10 10:10:54,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 8: [2023-05-10 10:10:54,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 2: [2023-05-10 10:10:54,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 2: [2023-05-10 10:10:54,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 8: [2023-05-10 10:10:54,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,463] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:54,465] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 2: [2023-05-10 10:10:54,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:54,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 1: [2023-05-10 10:10:54,477] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:54,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 1: [2023-05-10 10:10:54,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 1: [2023-05-10 10:10:54,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:54,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 28: [2023-05-10 10:10:54,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 5: [2023-05-10 10:10:54,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 5: [2023-05-10 10:10:54,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,481] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 24: [2023-05-10 10:10:54,481] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 24: [2023-05-10 10:10:54,481] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 24: [2023-05-10 10:10:54,481] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 24: [2023-05-10 10:10:54,481] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 24: [2023-05-10 10:10:54,481] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 1: [2023-05-10 10:10:54,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:54,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,483] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 11: [2023-05-10 10:10:54,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 5: [2023-05-10 10:10:54,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 20: [2023-05-10 10:10:54,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 22: [2023-05-10 10:10:54,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 1: [2023-05-10 10:10:54,490] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 21: [2023-05-10 10:10:54,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 21: [2023-05-10 10:10:54,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 21: [2023-05-10 10:10:54,492] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 2: [2023-05-10 10:10:54,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 22: [2023-05-10 10:10:54,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 2: [2023-05-10 10:10:54,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 2: [2023-05-10 10:10:54,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 1: [2023-05-10 10:10:54,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:54,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 22: [2023-05-10 10:10:54,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 22: [2023-05-10 10:10:54,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 22: [2023-05-10 10:10:54,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 5: [2023-05-10 10:10:54,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:54,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 5: [2023-05-10 10:10:54,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 5: [2023-05-10 10:10:54,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 2: [2023-05-10 10:10:54,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 1: [2023-05-10 10:10:54,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:54,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 2: [2023-05-10 10:10:54,502] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,502] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 17: [2023-05-10 10:10:54,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 28: [2023-05-10 10:10:54,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 21: [2023-05-10 10:10:54,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 20: [2023-05-10 10:10:54,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:54,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:54,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:54,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 28: [2023-05-10 10:10:54,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 28: [2023-05-10 10:10:54,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 23: [2023-05-10 10:10:54,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 5: [2023-05-10 10:10:54,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 28: [2023-05-10 10:10:54,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 17: [2023-05-10 10:10:54,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 21: [2023-05-10 10:10:54,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,510] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 4: [2023-05-10 10:10:54,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 4: [2023-05-10 10:10:54,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 17: [2023-05-10 10:10:54,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 21: [2023-05-10 10:10:54,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 23: [2023-05-10 10:10:54,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 4: [2023-05-10 10:10:54,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 4: [2023-05-10 10:10:54,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 8: [2023-05-10 10:10:54,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:54,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 27: [2023-05-10 10:10:54,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 8: [2023-05-10 10:10:54,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 8: [2023-05-10 10:10:54,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 8: [2023-05-10 10:10:54,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 20: [2023-05-10 10:10:54,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,524] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 21: [2023-05-10 10:10:54,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 21: [2023-05-10 10:10:54,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 27: [2023-05-10 10:10:54,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 23: [2023-05-10 10:10:54,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 20: [2023-05-10 10:10:54,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 27: [2023-05-10 10:10:54,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 23: [2023-05-10 10:10:54,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:54,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:54,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:54,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 23: [2023-05-10 10:10:54,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 23: [2023-05-10 10:10:54,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 27: [2023-05-10 10:10:54,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 27: [2023-05-10 10:10:54,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 27: [2023-05-10 10:10:54,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 27: [2023-05-10 10:10:54,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 8: [2023-05-10 10:10:54,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 8: [2023-05-10 10:10:54,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 8: [2023-05-10 10:10:54,532] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 5: [2023-05-10 10:10:54,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:54,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 21: [2023-05-10 10:10:54,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 23: [2023-05-10 10:10:54,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:54,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 13: [2023-05-10 10:10:54,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_18-model_00-model_states.pt. 22: [2023-05-10 10:10:54,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,535] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 20: [2023-05-10 10:10:54,535] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 20: [2023-05-10 10:10:54,535] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 20: [2023-05-10 10:10:54,535] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 20: [2023-05-10 10:10:54,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 21: [2023-05-10 10:10:54,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 21: [2023-05-10 10:10:54,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:54,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 21: [2023-05-10 10:10:54,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:54,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:54,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:54,543] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 29: [2023-05-10 10:10:54,543] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 4: [2023-05-10 10:10:54,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:54,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 29: [2023-05-10 10:10:54,544] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 28: [2023-05-10 10:10:54,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 29: [2023-05-10 10:10:54,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 29: [2023-05-10 10:10:54,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 29: [2023-05-10 10:10:54,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 29: [2023-05-10 10:10:54,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 29: [2023-05-10 10:10:54,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:54,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 23: [2023-05-10 10:10:54,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:54,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 4: [2023-05-10 10:10:54,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:54,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:54,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:54,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:54,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 29: [2023-05-10 10:10:54,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 29: [2023-05-10 10:10:54,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 29: [2023-05-10 10:10:54,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 23: [2023-05-10 10:10:54,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:54,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 29: [2023-05-10 10:10:54,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 29: [2023-05-10 10:10:54,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 23: [2023-05-10 10:10:54,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 20: [2023-05-10 10:10:54,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 23: [2023-05-10 10:10:54,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:54,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:54,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:54,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:54,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 20: [2023-05-10 10:10:54,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 20: [2023-05-10 10:10:54,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:54,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 20: [2023-05-10 10:10:54,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 20: [2023-05-10 10:10:54,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:54,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,573] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 27: [2023-05-10 10:10:54,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 29: [2023-05-10 10:10:54,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,596] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 27: [2023-05-10 10:10:54,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 29: [2023-05-10 10:10:54,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 27: [2023-05-10 10:10:54,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:54,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 29: [2023-05-10 10:10:54,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 29: [2023-05-10 10:10:54,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 29: [2023-05-10 10:10:54,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 27: [2023-05-10 10:10:54,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:54,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 27: [2023-05-10 10:10:54,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 27: [2023-05-10 10:10:54,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 27: [2023-05-10 10:10:54,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 29: [2023-05-10 10:10:54,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 29: [2023-05-10 10:10:54,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 29: [2023-05-10 10:10:54,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 12: [2023-05-10 10:10:54,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 12: [2023-05-10 10:10:54,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 12: [2023-05-10 10:10:54,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 12: [2023-05-10 10:10:54,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 12: [2023-05-10 10:10:54,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:54,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:54,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:54,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:54,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:54,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:54,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:54,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,638] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,645] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,645] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 15: [2023-05-10 10:10:54,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 29: [2023-05-10 10:10:54,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 15: [2023-05-10 10:10:54,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 15: [2023-05-10 10:10:54,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 15: [2023-05-10 10:10:54,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 15: [2023-05-10 10:10:54,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:54,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 29: [2023-05-10 10:10:54,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 29: [2023-05-10 10:10:54,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 29: [2023-05-10 10:10:54,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:54,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:54,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,697] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 15: [2023-05-10 10:10:54,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 16: [2023-05-10 10:10:54,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:54,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,719] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 6: [2023-05-10 10:10:54,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 7: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 6: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 16: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 15: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,723] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,727] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 15: [2023-05-10 10:10:54,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,730] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,730] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:54,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 31: [2023-05-10 10:10:54,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 31: [2023-05-10 10:10:54,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 31: [2023-05-10 10:10:54,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 31: [2023-05-10 10:10:54,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 31: [2023-05-10 10:10:54,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 31: [2023-05-10 10:10:54,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,730] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 7: [2023-05-10 10:10:54,730] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:54,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 31: [2023-05-10 10:10:54,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 12: [2023-05-10 10:10:54,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:54,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,735] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 15: [2023-05-10 10:10:54,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 15: [2023-05-10 10:10:54,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 15: [2023-05-10 10:10:54,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 15: [2023-05-10 10:10:54,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 15: [2023-05-10 10:10:54,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:54,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:54,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:54,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:54,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:54,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:54,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:54,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 12: [2023-05-10 10:10:54,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:54,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:54,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:54,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:54,741] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,741] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,741] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 26: [2023-05-10 10:10:54,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 16: [2023-05-10 10:10:54,749] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 14: [2023-05-10 10:10:54,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 12: [2023-05-10 10:10:54,753] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 16: [2023-05-10 10:10:54,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 16: [2023-05-10 10:10:54,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 16: [2023-05-10 10:10:54,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 18: [2023-05-10 10:10:54,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,756] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:54,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 14: [2023-05-10 10:10:54,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 14: [2023-05-10 10:10:54,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,761] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 15: [2023-05-10 10:10:54,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 6: [2023-05-10 10:10:54,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 14: [2023-05-10 10:10:54,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:54,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 31: [2023-05-10 10:10:54,786] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 14: [2023-05-10 10:10:54,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 14: [2023-05-10 10:10:54,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 6: [2023-05-10 10:10:54,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 14: [2023-05-10 10:10:54,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:54,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 14: [2023-05-10 10:10:54,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:54,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 14: [2023-05-10 10:10:54,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 6: [2023-05-10 10:10:54,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 3: [2023-05-10 10:10:54,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 3: [2023-05-10 10:10:54,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 3: [2023-05-10 10:10:54,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,800] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 3: [2023-05-10 10:10:54,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 3: [2023-05-10 10:10:54,804] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 9: [2023-05-10 10:10:54,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 9: [2023-05-10 10:10:54,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 30: [2023-05-10 10:10:54,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 30: [2023-05-10 10:10:54,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 30: [2023-05-10 10:10:54,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 30: [2023-05-10 10:10:54,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 30: [2023-05-10 10:10:54,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 30: [2023-05-10 10:10:54,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 30: [2023-05-10 10:10:54,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 9: [2023-05-10 10:10:54,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 9: [2023-05-10 10:10:54,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 31: [2023-05-10 10:10:54,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:54,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 9: [2023-05-10 10:10:54,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 9: [2023-05-10 10:10:54,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 9: [2023-05-10 10:10:54,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 9: [2023-05-10 10:10:54,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 10: [2023-05-10 10:10:54,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 3: [2023-05-10 10:10:54,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 10: [2023-05-10 10:10:54,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 10: [2023-05-10 10:10:54,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 10: [2023-05-10 10:10:54,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 10: [2023-05-10 10:10:54,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 10: [2023-05-10 10:10:54,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 10: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,808] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 3: [2023-05-10 10:10:54,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:54,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 31: [2023-05-10 10:10:54,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 31: [2023-05-10 10:10:54,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 31: [2023-05-10 10:10:54,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 31: [2023-05-10 10:10:54,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,811] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 25: [2023-05-10 10:10:54,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 9: [2023-05-10 10:10:54,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 25: [2023-05-10 10:10:54,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 25: [2023-05-10 10:10:54,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 25: [2023-05-10 10:10:54,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 10: [2023-05-10 10:10:54,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:54,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 31: [2023-05-10 10:10:54,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 25: [2023-05-10 10:10:54,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 9: [2023-05-10 10:10:54,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 25: [2023-05-10 10:10:54,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 30: [2023-05-10 10:10:54,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 11: [2023-05-10 10:10:54,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 19: [2023-05-10 10:10:54,821] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,821] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 30: [2023-05-10 10:10:54,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 19: [2023-05-10 10:10:54,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 17: [2023-05-10 10:10:54,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,826] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 17: [2023-05-10 10:10:54,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:54,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 6: [2023-05-10 10:10:54,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 17: [2023-05-10 10:10:54,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 6: [2023-05-10 10:10:54,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:54,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 19: [2023-05-10 10:10:54,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 19: [2023-05-10 10:10:54,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 19: [2023-05-10 10:10:54,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 19: [2023-05-10 10:10:54,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 17: [2023-05-10 10:10:54,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 17: [2023-05-10 10:10:54,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 17: [2023-05-10 10:10:54,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 17: [2023-05-10 10:10:54,839] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 17: [2023-05-10 10:10:54,839] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 17: [2023-05-10 10:10:54,839] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 17: [2023-05-10 10:10:54,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 6: [2023-05-10 10:10:54,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:54,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:54,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:54,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 18: [2023-05-10 10:10:54,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 6: [2023-05-10 10:10:54,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:54,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:54,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 28: [2023-05-10 10:10:54,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 31: [2023-05-10 10:10:54,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 18: [2023-05-10 10:10:54,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 26: [2023-05-10 10:10:54,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 26: [2023-05-10 10:10:54,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 28: [2023-05-10 10:10:54,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 28: [2023-05-10 10:10:54,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 28: [2023-05-10 10:10:54,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 28: [2023-05-10 10:10:54,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 28: [2023-05-10 10:10:54,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 18: [2023-05-10 10:10:54,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 26: [2023-05-10 10:10:54,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:54,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 3: [2023-05-10 10:10:54,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 26: [2023-05-10 10:10:54,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:54,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 18: [2023-05-10 10:10:54,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 18: [2023-05-10 10:10:54,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 30: [2023-05-10 10:10:54,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 28: [2023-05-10 10:10:54,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,861] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 28: [2023-05-10 10:10:54,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 9: [2023-05-10 10:10:54,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 1: [2023-05-10 10:10:54,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 1: [2023-05-10 10:10:54,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 1: [2023-05-10 10:10:54,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 1: [2023-05-10 10:10:54,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 1: [2023-05-10 10:10:54,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 1: [2023-05-10 10:10:54,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 1: [2023-05-10 10:10:54,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:54,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 10: [2023-05-10 10:10:54,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:54,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 9: [2023-05-10 10:10:54,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 9: [2023-05-10 10:10:54,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 1: [2023-05-10 10:10:54,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:54,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:54,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 9: [2023-05-10 10:10:54,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 30: [2023-05-10 10:10:54,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 9: [2023-05-10 10:10:54,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 9: [2023-05-10 10:10:54,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 9: [2023-05-10 10:10:54,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 1: [2023-05-10 10:10:54,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 1: [2023-05-10 10:10:54,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 9: [2023-05-10 10:10:54,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 10: [2023-05-10 10:10:54,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 30: [2023-05-10 10:10:54,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:54,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 10: [2023-05-10 10:10:54,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:54,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 3: [2023-05-10 10:10:54,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 3: [2023-05-10 10:10:54,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:54,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 9: [2023-05-10 10:10:54,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:54,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 10: [2023-05-10 10:10:54,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 10: [2023-05-10 10:10:54,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 10: [2023-05-10 10:10:54,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,886] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,886] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:54,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 30: [2023-05-10 10:10:54,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 30: [2023-05-10 10:10:54,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:54,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 10: [2023-05-10 10:10:54,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:54,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:54,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 22: [2023-05-10 10:10:54,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:54,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:54,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:54,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:54,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:54,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:54,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 9: [2023-05-10 10:10:54,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:54,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 3: [2023-05-10 10:10:54,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 3: [2023-05-10 10:10:54,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:54,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:54,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 28: [2023-05-10 10:10:54,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 9: [2023-05-10 10:10:54,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 9: [2023-05-10 10:10:54,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 30: [2023-05-10 10:10:54,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:54,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:54,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:54,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 9: [2023-05-10 10:10:54,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 30: [2023-05-10 10:10:54,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 25: [2023-05-10 10:10:54,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:54,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:54,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:54,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:54,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 2: [2023-05-10 10:10:54,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 19: [2023-05-10 10:10:54,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:54,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:54,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 9: [2023-05-10 10:10:54,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:54,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 2: [2023-05-10 10:10:54,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 9: [2023-05-10 10:10:54,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:54,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 2: [2023-05-10 10:10:54,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 2: [2023-05-10 10:10:54,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 24: [2023-05-10 10:10:54,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:54,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 9: [2023-05-10 10:10:54,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:54,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 24: [2023-05-10 10:10:54,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:54,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 17: [2023-05-10 10:10:54,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 30: [2023-05-10 10:10:54,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 30: [2023-05-10 10:10:54,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:54,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 3: [2023-05-10 10:10:54,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:54,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:54,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:54,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:54,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 10: [2023-05-10 10:10:54,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:54,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 2: [2023-05-10 10:10:54,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 2: [2023-05-10 10:10:54,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 17: [2023-05-10 10:10:54,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 2: [2023-05-10 10:10:54,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 2: [2023-05-10 10:10:54,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 0: [2023-05-10 10:10:54,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 2: [2023-05-10 10:10:54,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 0: [2023-05-10 10:10:54,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 0: [2023-05-10 10:10:54,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 0: [2023-05-10 10:10:54,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 30: [2023-05-10 10:10:54,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:54,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:54,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 0: [2023-05-10 10:10:54,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 22: [2023-05-10 10:10:54,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:54,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:54,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:54,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:54,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 21: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:54,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 11: [2023-05-10 10:10:54,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:54,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:54,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:54,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:54,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:54,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 1: [2023-05-10 10:10:54,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 1: [2023-05-10 10:10:54,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:54,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:54,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 28: [2023-05-10 10:10:54,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 11: [2023-05-10 10:10:54,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 23: [2023-05-10 10:10:54,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 23: [2023-05-10 10:10:54,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 23: [2023-05-10 10:10:54,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 23: [2023-05-10 10:10:54,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 23: [2023-05-10 10:10:54,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 23: [2023-05-10 10:10:54,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 23: [2023-05-10 10:10:54,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 19: [2023-05-10 10:10:54,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:54,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 23: [2023-05-10 10:10:54,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:54,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 21: [2023-05-10 10:10:54,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 21: [2023-05-10 10:10:54,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 21: [2023-05-10 10:10:54,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:54,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:54,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:54,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 21: [2023-05-10 10:10:54,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:54,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 21: [2023-05-10 10:10:54,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 21: [2023-05-10 10:10:54,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 28: [2023-05-10 10:10:54,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 29: [2023-05-10 10:10:54,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 21: [2023-05-10 10:10:54,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 29: [2023-05-10 10:10:54,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 1: [2023-05-10 10:10:54,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:54,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 21: [2023-05-10 10:10:54,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 23: [2023-05-10 10:10:54,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 23: [2023-05-10 10:10:54,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 23: [2023-05-10 10:10:54,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 23: [2023-05-10 10:10:54,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 23: [2023-05-10 10:10:54,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:54,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 13: [2023-05-10 10:10:54,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 13: [2023-05-10 10:10:54,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 13: [2023-05-10 10:10:54,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 13: [2023-05-10 10:10:54,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:54,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 13: [2023-05-10 10:10:54,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 13: [2023-05-10 10:10:54,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 30: [2023-05-10 10:10:54,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 23: [2023-05-10 10:10:54,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:54,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 8: [2023-05-10 10:10:54,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 8: [2023-05-10 10:10:54,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 8: [2023-05-10 10:10:54,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 8: [2023-05-10 10:10:54,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 8: [2023-05-10 10:10:54,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 8: [2023-05-10 10:10:54,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 8: [2023-05-10 10:10:54,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 1: [2023-05-10 10:10:54,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:54,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 17: [2023-05-10 10:10:54,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 30: [2023-05-10 10:10:54,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 1: [2023-05-10 10:10:54,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 1: [2023-05-10 10:10:54,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 1: [2023-05-10 10:10:54,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 1: [2023-05-10 10:10:54,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 8: [2023-05-10 10:10:54,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:54,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:54,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:54,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 8: [2023-05-10 10:10:54,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:54,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:54,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:54,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:54,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 30: [2023-05-10 10:10:54,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 30: [2023-05-10 10:10:54,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:54,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 1: [2023-05-10 10:10:54,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 1: [2023-05-10 10:10:54,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:54,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:54,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 13: [2023-05-10 10:10:54,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt... 5: [2023-05-10 10:10:54,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 5: [2023-05-10 10:10:54,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 5: [2023-05-10 10:10:54,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 5: [2023-05-10 10:10:54,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 5: [2023-05-10 10:10:54,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 5: [2023-05-10 10:10:54,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 5: [2023-05-10 10:10:54,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 5: [2023-05-10 10:10:54,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:54,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 17: [2023-05-10 10:10:54,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:54,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 4: [2023-05-10 10:10:54,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 4: [2023-05-10 10:10:54,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 4: [2023-05-10 10:10:54,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 4: [2023-05-10 10:10:54,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:54,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:54,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 4: [2023-05-10 10:10:54,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 4: [2023-05-10 10:10:54,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:54,959] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 17: [2023-05-10 10:10:54,959] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 1: [2023-05-10 10:10:54,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:54,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:54,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:54,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:54,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:54,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 4: [2023-05-10 10:10:54,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:54,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:54,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:54,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:54,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:54,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:54,963] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 28: [2023-05-10 10:10:54,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:54,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:54,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 12: [2023-05-10 10:10:54,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 4: [2023-05-10 10:10:54,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:54,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:54,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:54,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:54,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:54,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 12: [2023-05-10 10:10:54,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 12: [2023-05-10 10:10:54,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 12: [2023-05-10 10:10:54,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 12: [2023-05-10 10:10:54,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 12: [2023-05-10 10:10:54,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 1: [2023-05-10 10:10:54,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:54,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:54,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 1: [2023-05-10 10:10:54,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:54,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 1: [2023-05-10 10:10:54,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 1: [2023-05-10 10:10:54,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 1: [2023-05-10 10:10:54,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:54,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:54,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:54,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:54,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:54,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:54,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:54,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:54,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:54,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:54,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 2: [2023-05-10 10:10:54,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:54,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:54,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 23: [2023-05-10 10:10:54,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:54,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 2: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 27: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 27: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 27: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 27: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 27: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 27: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 27: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 27: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:54,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:54,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 23: [2023-05-10 10:10:54,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 23: [2023-05-10 10:10:54,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:54,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 2: [2023-05-10 10:10:54,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 2: [2023-05-10 10:10:54,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 15: [2023-05-10 10:10:54,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:54,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 0: [2023-05-10 10:10:54,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 15: [2023-05-10 10:10:54,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:54,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 27: [2023-05-10 10:10:54,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 27: [2023-05-10 10:10:54,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:54,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 27: [2023-05-10 10:10:54,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 16: [2023-05-10 10:10:54,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:54,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:54,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:54,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:54,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:54,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:54,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:54,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:54,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 23: [2023-05-10 10:10:54,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 27: [2023-05-10 10:10:54,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 27: [2023-05-10 10:10:54,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 27: [2023-05-10 10:10:54,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 27: [2023-05-10 10:10:54,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:54,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 27: [2023-05-10 10:10:54,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 16: [2023-05-10 10:10:55,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 16: [2023-05-10 10:10:55,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:55,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 8: [2023-05-10 10:10:55,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:55,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:55,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:55,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 16: [2023-05-10 10:10:55,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:55,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 16: [2023-05-10 10:10:55,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 16: [2023-05-10 10:10:55,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:55,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 16: [2023-05-10 10:10:55,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:55,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:55,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:55,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:55,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:55,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 29: [2023-05-10 10:10:55,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:55,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 23: [2023-05-10 10:10:55,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 23: [2023-05-10 10:10:55,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 23: [2023-05-10 10:10:55,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 4: [2023-05-10 10:10:55,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:55,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 21: [2023-05-10 10:10:55,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 13: [2023-05-10 10:10:55,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 5: [2023-05-10 10:10:55,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 23: [2023-05-10 10:10:55,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 8: [2023-05-10 10:10:55,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 8: [2023-05-10 10:10:55,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 8: [2023-05-10 10:10:55,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:55,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 21: [2023-05-10 10:10:55,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 21: [2023-05-10 10:10:55,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 5: [2023-05-10 10:10:55,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:55,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:55,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:55,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 4: [2023-05-10 10:10:55,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 4: [2023-05-10 10:10:55,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:55,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 8: [2023-05-10 10:10:55,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 4: [2023-05-10 10:10:55,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 4: [2023-05-10 10:10:55,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 4: [2023-05-10 10:10:55,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 23: [2023-05-10 10:10:55,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 5: [2023-05-10 10:10:55,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 8: [2023-05-10 10:10:55,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 5: [2023-05-10 10:10:55,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 5: [2023-05-10 10:10:55,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:55,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 15: [2023-05-10 10:10:55,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 21: [2023-05-10 10:10:55,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:55,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:55,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:55,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:55,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:55,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:55,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 21: [2023-05-10 10:10:55,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 4: [2023-05-10 10:10:55,038] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 4: [2023-05-10 10:10:55,038] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 29: [2023-05-10 10:10:55,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 5: [2023-05-10 10:10:55,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 4: [2023-05-10 10:10:55,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,047] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 27: [2023-05-10 10:10:55,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 13: [2023-05-10 10:10:55,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 13: [2023-05-10 10:10:55,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 13: [2023-05-10 10:10:55,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_19-model_00-model_states.pt. 16: [2023-05-10 10:10:55,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 4: [2023-05-10 10:10:55,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 4: [2023-05-10 10:10:55,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 12: [2023-05-10 10:10:55,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:55,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 7: [2023-05-10 10:10:55,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 7: [2023-05-10 10:10:55,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 7: [2023-05-10 10:10:55,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 7: [2023-05-10 10:10:55,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 7: [2023-05-10 10:10:55,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 7: [2023-05-10 10:10:55,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 7: [2023-05-10 10:10:55,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 7: [2023-05-10 10:10:55,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:55,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,056] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 4: [2023-05-10 10:10:55,057] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 4: [2023-05-10 10:10:55,057] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 4: [2023-05-10 10:10:55,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:55,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 12: [2023-05-10 10:10:55,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 12: [2023-05-10 10:10:55,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 12: [2023-05-10 10:10:55,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:55,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 7: [2023-05-10 10:10:55,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:55,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:55,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:55,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:55,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:55,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 7: [2023-05-10 10:10:55,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:55,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:55,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 7: [2023-05-10 10:10:55,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:55,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:55,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:55,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:55,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:55,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:55,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:55,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:55,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 5: [2023-05-10 10:10:55,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 27: [2023-05-10 10:10:55,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 27: [2023-05-10 10:10:55,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 27: [2023-05-10 10:10:55,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 27: [2023-05-10 10:10:55,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 5: [2023-05-10 10:10:55,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 15: [2023-05-10 10:10:55,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:55,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:55,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:55,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:55,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 27: [2023-05-10 10:10:55,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 14: [2023-05-10 10:10:55,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 14: [2023-05-10 10:10:55,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 14: [2023-05-10 10:10:55,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 14: [2023-05-10 10:10:55,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 14: [2023-05-10 10:10:55,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 14: [2023-05-10 10:10:55,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 14: [2023-05-10 10:10:55,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 14: [2023-05-10 10:10:55,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:55,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:55,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:55,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:55,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:55,078] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:55,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 14: [2023-05-10 10:10:55,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 14: [2023-05-10 10:10:55,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 14: [2023-05-10 10:10:55,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:55,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 20: [2023-05-10 10:10:55,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 20: [2023-05-10 10:10:55,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 20: [2023-05-10 10:10:55,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:55,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 20: [2023-05-10 10:10:55,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 20: [2023-05-10 10:10:55,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 20: [2023-05-10 10:10:55,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 20: [2023-05-10 10:10:55,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 24: [2023-05-10 10:10:55,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 20: [2023-05-10 10:10:55,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 14: [2023-05-10 10:10:55,084] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 14: [2023-05-10 10:10:55,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 14: [2023-05-10 10:10:55,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:55,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 14: [2023-05-10 10:10:55,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 14: [2023-05-10 10:10:55,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 20: [2023-05-10 10:10:55,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:55,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 24: [2023-05-10 10:10:55,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 24: [2023-05-10 10:10:55,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 24: [2023-05-10 10:10:55,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 24: [2023-05-10 10:10:55,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 24: [2023-05-10 10:10:55,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:55,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:55,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 20: [2023-05-10 10:10:55,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:55,093] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 15: [2023-05-10 10:10:55,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 24: [2023-05-10 10:10:55,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 20: [2023-05-10 10:10:55,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 20: [2023-05-10 10:10:55,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 20: [2023-05-10 10:10:55,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:55,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 20: [2023-05-10 10:10:55,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:55,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:55,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 20: [2023-05-10 10:10:55,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:55,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:55,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 20: [2023-05-10 10:10:55,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 12: [2023-05-10 10:10:55,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 15: [2023-05-10 10:10:55,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 15: [2023-05-10 10:10:55,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,107] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 15: [2023-05-10 10:10:55,108] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 6: [2023-05-10 10:10:55,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:55,108] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:55,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 6: [2023-05-10 10:10:55,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 6: [2023-05-10 10:10:55,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 6: [2023-05-10 10:10:55,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:55,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 15: [2023-05-10 10:10:55,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:55,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 6: [2023-05-10 10:10:55,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 6: [2023-05-10 10:10:55,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:55,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 16: [2023-05-10 10:10:55,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 6: [2023-05-10 10:10:55,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 6: [2023-05-10 10:10:55,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:55,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 6: [2023-05-10 10:10:55,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 6: [2023-05-10 10:10:55,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 6: [2023-05-10 10:10:55,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:55,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 7: [2023-05-10 10:10:55,123] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 7: [2023-05-10 10:10:55,124] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 7: [2023-05-10 10:10:55,124] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 7: [2023-05-10 10:10:55,124] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 7: [2023-05-10 10:10:55,124] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 18: [2023-05-10 10:10:55,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 31: [2023-05-10 10:10:55,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 18: [2023-05-10 10:10:55,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 18: [2023-05-10 10:10:55,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 18: [2023-05-10 10:10:55,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 26: [2023-05-10 10:10:55,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 26: [2023-05-10 10:10:55,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 22: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 26: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 22: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 26: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 26: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 26: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 26: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 22: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 22: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 22: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 22: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 22: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 26: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 22: [2023-05-10 10:10:55,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:55,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 14: [2023-05-10 10:10:55,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 14: [2023-05-10 10:10:55,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 14: [2023-05-10 10:10:55,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 20: [2023-05-10 10:10:55,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 26: [2023-05-10 10:10:55,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 26: [2023-05-10 10:10:55,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:55,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:55,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 3: [2023-05-10 10:10:55,141] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,141] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 9: [2023-05-10 10:10:55,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:55,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 24: [2023-05-10 10:10:55,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:55,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 3: [2023-05-10 10:10:55,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 3: [2023-05-10 10:10:55,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,143] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 9: [2023-05-10 10:10:55,143] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 3: [2023-05-10 10:10:55,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,143] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 26: [2023-05-10 10:10:55,143] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 26: [2023-05-10 10:10:55,143] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 26: [2023-05-10 10:10:55,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 9: [2023-05-10 10:10:55,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 26: [2023-05-10 10:10:55,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:55,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:55,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 26: [2023-05-10 10:10:55,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 26: [2023-05-10 10:10:55,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:55,146] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 3: [2023-05-10 10:10:55,146] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 3: [2023-05-10 10:10:55,146] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:55,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:55,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 3: [2023-05-10 10:10:55,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:55,149] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 19: [2023-05-10 10:10:55,149] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 7: [2023-05-10 10:10:55,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 3: [2023-05-10 10:10:55,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 3: [2023-05-10 10:10:55,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 14: [2023-05-10 10:10:55,151] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 14: [2023-05-10 10:10:55,151] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 14: [2023-05-10 10:10:55,151] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 14: [2023-05-10 10:10:55,151] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 20: [2023-05-10 10:10:55,152] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 7: [2023-05-10 10:10:55,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 7: [2023-05-10 10:10:55,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 14: [2023-05-10 10:10:55,152] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 3: [2023-05-10 10:10:55,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:55,152] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 19: [2023-05-10 10:10:55,152] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 19: [2023-05-10 10:10:55,152] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 19: [2023-05-10 10:10:55,152] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 19: [2023-05-10 10:10:55,152] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 19: [2023-05-10 10:10:55,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 19: [2023-05-10 10:10:55,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:55,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 7: [2023-05-10 10:10:55,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:55,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 7: [2023-05-10 10:10:55,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 7: [2023-05-10 10:10:55,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 20: [2023-05-10 10:10:55,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,159] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 14: [2023-05-10 10:10:55,160] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 14: [2023-05-10 10:10:55,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:55,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:55,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:55,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:55,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 14: [2023-05-10 10:10:55,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 24: [2023-05-10 10:10:55,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 6: [2023-05-10 10:10:55,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 24: [2023-05-10 10:10:55,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 24: [2023-05-10 10:10:55,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 24: [2023-05-10 10:10:55,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 24: [2023-05-10 10:10:55,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 24: [2023-05-10 10:10:55,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 24: [2023-05-10 10:10:55,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 20: [2023-05-10 10:10:55,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 31: [2023-05-10 10:10:55,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 20: [2023-05-10 10:10:55,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 14: [2023-05-10 10:10:55,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 14: [2023-05-10 10:10:55,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 14: [2023-05-10 10:10:55,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 20: [2023-05-10 10:10:55,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 20: [2023-05-10 10:10:55,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 20: [2023-05-10 10:10:55,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 20: [2023-05-10 10:10:55,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 20: [2023-05-10 10:10:55,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 14: [2023-05-10 10:10:55,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 6: [2023-05-10 10:10:55,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 14: [2023-05-10 10:10:55,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 6: [2023-05-10 10:10:55,187] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 6: [2023-05-10 10:10:55,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:55,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:55,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:55,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:55,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 26: [2023-05-10 10:10:55,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 26: [2023-05-10 10:10:55,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:55,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:55,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 6: [2023-05-10 10:10:55,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 6: [2023-05-10 10:10:55,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 6: [2023-05-10 10:10:55,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 30: [2023-05-10 10:10:55,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:55,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:55,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:55,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 25: [2023-05-10 10:10:55,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 25: [2023-05-10 10:10:55,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 25: [2023-05-10 10:10:55,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 25: [2023-05-10 10:10:55,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 25: [2023-05-10 10:10:55,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 25: [2023-05-10 10:10:55,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 25: [2023-05-10 10:10:55,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 11: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 11: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 11: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 11: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 11: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 11: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 11: [2023-05-10 10:10:55,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 24: [2023-05-10 10:10:55,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 9: [2023-05-10 10:10:55,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 9: [2023-05-10 10:10:55,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:55,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 20: [2023-05-10 10:10:55,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 24: [2023-05-10 10:10:55,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:55,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:55,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:55,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:55,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 0: [2023-05-10 10:10:55,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:55,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:55,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:55,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:55,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:55,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:55,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 22: [2023-05-10 10:10:55,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:55,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:55,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 24: [2023-05-10 10:10:55,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 0: [2023-05-10 10:10:55,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 25: [2023-05-10 10:10:55,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:55,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:55,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:55,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:55,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:55,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 18: [2023-05-10 10:10:55,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 20: [2023-05-10 10:10:55,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:55,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:55,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 3: [2023-05-10 10:10:55,211] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 19: [2023-05-10 10:10:55,211] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 19: [2023-05-10 10:10:55,211] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 19: [2023-05-10 10:10:55,211] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 26: [2023-05-10 10:10:55,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 26: [2023-05-10 10:10:55,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 20: [2023-05-10 10:10:55,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 20: [2023-05-10 10:10:55,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 26: [2023-05-10 10:10:55,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 20: [2023-05-10 10:10:55,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 20: [2023-05-10 10:10:55,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 18: [2023-05-10 10:10:55,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 6: [2023-05-10 10:10:55,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 22: [2023-05-10 10:10:55,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 6: [2023-05-10 10:10:55,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 22: [2023-05-10 10:10:55,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 22: [2023-05-10 10:10:55,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 22: [2023-05-10 10:10:55,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 22: [2023-05-10 10:10:55,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 22: [2023-05-10 10:10:55,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 22: [2023-05-10 10:10:55,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 3: [2023-05-10 10:10:55,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 3: [2023-05-10 10:10:55,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 9: [2023-05-10 10:10:55,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 9: [2023-05-10 10:10:55,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 9: [2023-05-10 10:10:55,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 9: [2023-05-10 10:10:55,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 9: [2023-05-10 10:10:55,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 9: [2023-05-10 10:10:55,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 9: [2023-05-10 10:10:55,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 22: [2023-05-10 10:10:55,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 9: [2023-05-10 10:10:55,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,233] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 19: [2023-05-10 10:10:55,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 19: [2023-05-10 10:10:55,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 3: [2023-05-10 10:10:55,241] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 10: [2023-05-10 10:10:55,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,245] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 10: [2023-05-10 10:10:55,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,250] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 22: [2023-05-10 10:10:55,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 22: [2023-05-10 10:10:55,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 11: [2023-05-10 10:10:55,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 3: [2023-05-10 10:10:55,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 22: [2023-05-10 10:10:55,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 10: [2023-05-10 10:10:55,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 22: [2023-05-10 10:10:55,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 22: [2023-05-10 10:10:55,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 19: [2023-05-10 10:10:55,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:55,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 17: [2023-05-10 10:10:55,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:55,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:55,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 19: [2023-05-10 10:10:55,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 25: [2023-05-10 10:10:55,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 25: [2023-05-10 10:10:55,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 25: [2023-05-10 10:10:55,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 25: [2023-05-10 10:10:55,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 25: [2023-05-10 10:10:55,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 25: [2023-05-10 10:10:55,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:55,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 17: [2023-05-10 10:10:55,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:55,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 17: [2023-05-10 10:10:55,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:55,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:55,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:55,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 17: [2023-05-10 10:10:55,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 28: [2023-05-10 10:10:55,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 17: [2023-05-10 10:10:55,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 17: [2023-05-10 10:10:55,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:55,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 17: [2023-05-10 10:10:55,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 30: [2023-05-10 10:10:55,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 10: [2023-05-10 10:10:55,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:55,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 0: [2023-05-10 10:10:55,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:55,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 2: [2023-05-10 10:10:55,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:55,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 11: [2023-05-10 10:10:55,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:55,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 11: [2023-05-10 10:10:55,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 11: [2023-05-10 10:10:55,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 11: [2023-05-10 10:10:55,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:55,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:55,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:55,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:55,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 13: [2023-05-10 10:10:55,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt... 0: [2023-05-10 10:10:55,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 0: [2023-05-10 10:10:55,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 0: [2023-05-10 10:10:55,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 30: [2023-05-10 10:10:55,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 10: [2023-05-10 10:10:55,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 10: [2023-05-10 10:10:55,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 10: [2023-05-10 10:10:55,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 10: [2023-05-10 10:10:55,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 25: [2023-05-10 10:10:55,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 0: [2023-05-10 10:10:55,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 0: [2023-05-10 10:10:55,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 0: [2023-05-10 10:10:55,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 0: [2023-05-10 10:10:55,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 0: [2023-05-10 10:10:55,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 11: [2023-05-10 10:10:55,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 11: [2023-05-10 10:10:55,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 28: [2023-05-10 10:10:55,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,356] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,356] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,356] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 2: [2023-05-10 10:10:55,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 13: [2023-05-10 10:10:55,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_20-model_00-model_states.pt. 17: [2023-05-10 10:10:55,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 4: [2023-05-10 10:10:55,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 4: [2023-05-10 10:10:55,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 4: [2023-05-10 10:10:55,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 4: [2023-05-10 10:10:55,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 4: [2023-05-10 10:10:55,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 4: [2023-05-10 10:10:55,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 4: [2023-05-10 10:10:55,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 4: [2023-05-10 10:10:55,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,433] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 29: [2023-05-10 10:10:55,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 29: [2023-05-10 10:10:55,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 29: [2023-05-10 10:10:55,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 29: [2023-05-10 10:10:55,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 29: [2023-05-10 10:10:55,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 16: [2023-05-10 10:10:55,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 16: [2023-05-10 10:10:55,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 16: [2023-05-10 10:10:55,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 16: [2023-05-10 10:10:55,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 16: [2023-05-10 10:10:55,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 16: [2023-05-10 10:10:55,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 16: [2023-05-10 10:10:55,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 29: [2023-05-10 10:10:55,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,442] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,443] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 4: [2023-05-10 10:10:55,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:55,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:55,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 12: [2023-05-10 10:10:55,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 4: [2023-05-10 10:10:55,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:55,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:55,494] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:55,494] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:55,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:55,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 29: [2023-05-10 10:10:55,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 16: [2023-05-10 10:10:55,505] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 29: [2023-05-10 10:10:55,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:55,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,510] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:55,510] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 29: [2023-05-10 10:10:55,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 15: [2023-05-10 10:10:55,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 15: [2023-05-10 10:10:55,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 15: [2023-05-10 10:10:55,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 15: [2023-05-10 10:10:55,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 15: [2023-05-10 10:10:55,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 16: [2023-05-10 10:10:55,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 29: [2023-05-10 10:10:55,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:55,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:55,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 16: [2023-05-10 10:10:55,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 16: [2023-05-10 10:10:55,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 16: [2023-05-10 10:10:55,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 16: [2023-05-10 10:10:55,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 1: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 1: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 1: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 1: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 1: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 1: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 1: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 1: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 29: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 29: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 29: [2023-05-10 10:10:55,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 21: [2023-05-10 10:10:55,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 21: [2023-05-10 10:10:55,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 31: [2023-05-10 10:10:55,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 31: [2023-05-10 10:10:55,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 31: [2023-05-10 10:10:55,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 31: [2023-05-10 10:10:55,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 31: [2023-05-10 10:10:55,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 31: [2023-05-10 10:10:55,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 31: [2023-05-10 10:10:55,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 21: [2023-05-10 10:10:55,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 21: [2023-05-10 10:10:55,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 16: [2023-05-10 10:10:55,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 21: [2023-05-10 10:10:55,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 21: [2023-05-10 10:10:55,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 21: [2023-05-10 10:10:55,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 31: [2023-05-10 10:10:55,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 21: [2023-05-10 10:10:55,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 1: [2023-05-10 10:10:55,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 1: [2023-05-10 10:10:55,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 1: [2023-05-10 10:10:55,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 1: [2023-05-10 10:10:55,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 1: [2023-05-10 10:10:55,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 1: [2023-05-10 10:10:55,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,533] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 21: [2023-05-10 10:10:55,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 21: [2023-05-10 10:10:55,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 1: [2023-05-10 10:10:55,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 1: [2023-05-10 10:10:55,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 21: [2023-05-10 10:10:55,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 21: [2023-05-10 10:10:55,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 21: [2023-05-10 10:10:55,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 21: [2023-05-10 10:10:55,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 21: [2023-05-10 10:10:55,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 21: [2023-05-10 10:10:55,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 7: [2023-05-10 10:10:55,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 7: [2023-05-10 10:10:55,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 7: [2023-05-10 10:10:55,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 7: [2023-05-10 10:10:55,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 7: [2023-05-10 10:10:55,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 7: [2023-05-10 10:10:55,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 7: [2023-05-10 10:10:55,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 29: [2023-05-10 10:10:55,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 7: [2023-05-10 10:10:55,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 29: [2023-05-10 10:10:55,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 12: [2023-05-10 10:10:55,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:55,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 7: [2023-05-10 10:10:55,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 7: [2023-05-10 10:10:55,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 7: [2023-05-10 10:10:55,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 7: [2023-05-10 10:10:55,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 7: [2023-05-10 10:10:55,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 7: [2023-05-10 10:10:55,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 7: [2023-05-10 10:10:55,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 7: [2023-05-10 10:10:55,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 12: [2023-05-10 10:10:55,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:55,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 12: [2023-05-10 10:10:55,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:55,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:55,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:55,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:55,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:55,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:55,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 16: [2023-05-10 10:10:55,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:55,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:55,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 8: [2023-05-10 10:10:55,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 12: [2023-05-10 10:10:55,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 12: [2023-05-10 10:10:55,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:55,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 12: [2023-05-10 10:10:55,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 8: [2023-05-10 10:10:55,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 21: [2023-05-10 10:10:55,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 8: [2023-05-10 10:10:55,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 31: [2023-05-10 10:10:55,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 31: [2023-05-10 10:10:55,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 1: [2023-05-10 10:10:55,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 1: [2023-05-10 10:10:55,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 1: [2023-05-10 10:10:55,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 1: [2023-05-10 10:10:55,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 21: [2023-05-10 10:10:55,591] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 7: [2023-05-10 10:10:55,596] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 7: [2023-05-10 10:10:55,597] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 7: [2023-05-10 10:10:55,597] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 5: [2023-05-10 10:10:55,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 5: [2023-05-10 10:10:55,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 24: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 6: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 6: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 21: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 5: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 6: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 6: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 6: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 1: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 5: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 5: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 6: [2023-05-10 10:10:55,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 5: [2023-05-10 10:10:55,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 27: [2023-05-10 10:10:55,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 1: [2023-05-10 10:10:55,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 1: [2023-05-10 10:10:55,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 21: [2023-05-10 10:10:55,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 1: [2023-05-10 10:10:55,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 8: [2023-05-10 10:10:55,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:55,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 21: [2023-05-10 10:10:55,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 21: [2023-05-10 10:10:55,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 21: [2023-05-10 10:10:55,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 21: [2023-05-10 10:10:55,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 21: [2023-05-10 10:10:55,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:55,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 24: [2023-05-10 10:10:55,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 24: [2023-05-10 10:10:55,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 24: [2023-05-10 10:10:55,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 24: [2023-05-10 10:10:55,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 31: [2023-05-10 10:10:55,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:55,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 23: [2023-05-10 10:10:55,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 27: [2023-05-10 10:10:55,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 23: [2023-05-10 10:10:55,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 7: [2023-05-10 10:10:55,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 7: [2023-05-10 10:10:55,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 7: [2023-05-10 10:10:55,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 1: [2023-05-10 10:10:55,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:55,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 1: [2023-05-10 10:10:55,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:55,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:55,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 31: [2023-05-10 10:10:55,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 31: [2023-05-10 10:10:55,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 31: [2023-05-10 10:10:55,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 31: [2023-05-10 10:10:55,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 6: [2023-05-10 10:10:55,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 27: [2023-05-10 10:10:55,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:55,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 8: [2023-05-10 10:10:55,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:55,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:55,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 7: [2023-05-10 10:10:55,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 7: [2023-05-10 10:10:55,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 8: [2023-05-10 10:10:55,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:55,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 21: [2023-05-10 10:10:55,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 1: [2023-05-10 10:10:55,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:55,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 1: [2023-05-10 10:10:55,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 7: [2023-05-10 10:10:55,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 7: [2023-05-10 10:10:55,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 7: [2023-05-10 10:10:55,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:55,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:55,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:55,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 14: [2023-05-10 10:10:55,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 14: [2023-05-10 10:10:55,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 14: [2023-05-10 10:10:55,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 14: [2023-05-10 10:10:55,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 14: [2023-05-10 10:10:55,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 14: [2023-05-10 10:10:55,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 14: [2023-05-10 10:10:55,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 23: [2023-05-10 10:10:55,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 21: [2023-05-10 10:10:55,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:55,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 1: [2023-05-10 10:10:55,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 21: [2023-05-10 10:10:55,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:55,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 1: [2023-05-10 10:10:55,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 27: [2023-05-10 10:10:55,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:55,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 23: [2023-05-10 10:10:55,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 27: [2023-05-10 10:10:55,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 27: [2023-05-10 10:10:55,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 7: [2023-05-10 10:10:55,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 1: [2023-05-10 10:10:55,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 23: [2023-05-10 10:10:55,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 27: [2023-05-10 10:10:55,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 23: [2023-05-10 10:10:55,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:55,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 21: [2023-05-10 10:10:55,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:55,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 1: [2023-05-10 10:10:55,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 31: [2023-05-10 10:10:55,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:55,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 14: [2023-05-10 10:10:55,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 20: [2023-05-10 10:10:55,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 14: [2023-05-10 10:10:55,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 20: [2023-05-10 10:10:55,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 15: [2023-05-10 10:10:55,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 20: [2023-05-10 10:10:55,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 27: [2023-05-10 10:10:55,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 21: [2023-05-10 10:10:55,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:55,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 21: [2023-05-10 10:10:55,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 7: [2023-05-10 10:10:55,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:55,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:55,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 20: [2023-05-10 10:10:55,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 21: [2023-05-10 10:10:55,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 7: [2023-05-10 10:10:55,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 20: [2023-05-10 10:10:55,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 7: [2023-05-10 10:10:55,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 7: [2023-05-10 10:10:55,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 31: [2023-05-10 10:10:55,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 31: [2023-05-10 10:10:55,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 20: [2023-05-10 10:10:55,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 20: [2023-05-10 10:10:55,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 20: [2023-05-10 10:10:55,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 20: [2023-05-10 10:10:55,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 20: [2023-05-10 10:10:55,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 31: [2023-05-10 10:10:55,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 31: [2023-05-10 10:10:55,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 20: [2023-05-10 10:10:55,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 6: [2023-05-10 10:10:55,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 6: [2023-05-10 10:10:55,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 5: [2023-05-10 10:10:55,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 5: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 13: [2023-05-10 10:10:55,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 13: [2023-05-10 10:10:55,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 13: [2023-05-10 10:10:55,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 13: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 13: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 13: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 13: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 13: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 5: [2023-05-10 10:10:55,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 24: [2023-05-10 10:10:55,670] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 26: [2023-05-10 10:10:55,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 3: [2023-05-10 10:10:55,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 3: [2023-05-10 10:10:55,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 3: [2023-05-10 10:10:55,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 24: [2023-05-10 10:10:55,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:55,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 24: [2023-05-10 10:10:55,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:55,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 5: [2023-05-10 10:10:55,674] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 3: [2023-05-10 10:10:55,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 3: [2023-05-10 10:10:55,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 5: [2023-05-10 10:10:55,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 5: [2023-05-10 10:10:55,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 5: [2023-05-10 10:10:55,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 13: [2023-05-10 10:10:55,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 3: [2023-05-10 10:10:55,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 14: [2023-05-10 10:10:55,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 6: [2023-05-10 10:10:55,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 20: [2023-05-10 10:10:55,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 24: [2023-05-10 10:10:55,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 6: [2023-05-10 10:10:55,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:55,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:55,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 11: [2023-05-10 10:10:55,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:55,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 6: [2023-05-10 10:10:55,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 6: [2023-05-10 10:10:55,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 6: [2023-05-10 10:10:55,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 9: [2023-05-10 10:10:55,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 9: [2023-05-10 10:10:55,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 14: [2023-05-10 10:10:55,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 5: [2023-05-10 10:10:55,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:55,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:55,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:55,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 9: [2023-05-10 10:10:55,688] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 9: [2023-05-10 10:10:55,688] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 9: [2023-05-10 10:10:55,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 9: [2023-05-10 10:10:55,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 9: [2023-05-10 10:10:55,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 9: [2023-05-10 10:10:55,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 18: [2023-05-10 10:10:55,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 9: [2023-05-10 10:10:55,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 24: [2023-05-10 10:10:55,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:55,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 9: [2023-05-10 10:10:55,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 9: [2023-05-10 10:10:55,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 9: [2023-05-10 10:10:55,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:55,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 9: [2023-05-10 10:10:55,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 18: [2023-05-10 10:10:55,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 9: [2023-05-10 10:10:55,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 9: [2023-05-10 10:10:55,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 9: [2023-05-10 10:10:55,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 20: [2023-05-10 10:10:55,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:55,699] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 14: [2023-05-10 10:10:55,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:55,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 14: [2023-05-10 10:10:55,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 14: [2023-05-10 10:10:55,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 14: [2023-05-10 10:10:55,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 5: [2023-05-10 10:10:55,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:55,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:55,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:55,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 19: [2023-05-10 10:10:55,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 19: [2023-05-10 10:10:55,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 19: [2023-05-10 10:10:55,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 19: [2023-05-10 10:10:55,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 19: [2023-05-10 10:10:55,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 24: [2023-05-10 10:10:55,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:55,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 19: [2023-05-10 10:10:55,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 5: [2023-05-10 10:10:55,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:55,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:55,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:55,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 24: [2023-05-10 10:10:55,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:55,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 24: [2023-05-10 10:10:55,712] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 24: [2023-05-10 10:10:55,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:55,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 24: [2023-05-10 10:10:55,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:55,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 14: [2023-05-10 10:10:55,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:55,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:55,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:55,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 11: [2023-05-10 10:10:55,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:55,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 19: [2023-05-10 10:10:55,718] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 6: [2023-05-10 10:10:55,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:55,718] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:55,718] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:55,719] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:55,719] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:55,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:55,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:55,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 20: [2023-05-10 10:10:55,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,723] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:55,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:55,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 26: [2023-05-10 10:10:55,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 14: [2023-05-10 10:10:55,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:55,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 20: [2023-05-10 10:10:55,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 14: [2023-05-10 10:10:55,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 9: [2023-05-10 10:10:55,740] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,741] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 26: [2023-05-10 10:10:55,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 17: [2023-05-10 10:10:55,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 17: [2023-05-10 10:10:55,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 17: [2023-05-10 10:10:55,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 17: [2023-05-10 10:10:55,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 17: [2023-05-10 10:10:55,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 17: [2023-05-10 10:10:55,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 17: [2023-05-10 10:10:55,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 17: [2023-05-10 10:10:55,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 3: [2023-05-10 10:10:55,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 30: [2023-05-10 10:10:55,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 3: [2023-05-10 10:10:55,749] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 3: [2023-05-10 10:10:55,749] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:55,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 13: [2023-05-10 10:10:55,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 13: [2023-05-10 10:10:55,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 13: [2023-05-10 10:10:55,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:55,751] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 9: [2023-05-10 10:10:55,751] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 9: [2023-05-10 10:10:55,751] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 9: [2023-05-10 10:10:55,751] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,753] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 17: [2023-05-10 10:10:55,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 10: [2023-05-10 10:10:55,757] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,757] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,758] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,758] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 17: [2023-05-10 10:10:55,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 10: [2023-05-10 10:10:55,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 9: [2023-05-10 10:10:55,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 9: [2023-05-10 10:10:55,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 9: [2023-05-10 10:10:55,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 9: [2023-05-10 10:10:55,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 26: [2023-05-10 10:10:55,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 9: [2023-05-10 10:10:55,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:55,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 10: [2023-05-10 10:10:55,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 10: [2023-05-10 10:10:55,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 10: [2023-05-10 10:10:55,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 10: [2023-05-10 10:10:55,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 10: [2023-05-10 10:10:55,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:55,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:55,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 20: [2023-05-10 10:10:55,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:55,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 22: [2023-05-10 10:10:55,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 22: [2023-05-10 10:10:55,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 22: [2023-05-10 10:10:55,768] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 22: [2023-05-10 10:10:55,769] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 22: [2023-05-10 10:10:55,769] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 22: [2023-05-10 10:10:55,769] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 22: [2023-05-10 10:10:55,769] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 3: [2023-05-10 10:10:55,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 20: [2023-05-10 10:10:55,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 26: [2023-05-10 10:10:55,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:55,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,772] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 13: [2023-05-10 10:10:55,773] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 13: [2023-05-10 10:10:55,773] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 13: [2023-05-10 10:10:55,773] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:55,774] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:55,774] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 28: [2023-05-10 10:10:55,774] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 3: [2023-05-10 10:10:55,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:55,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 25: [2023-05-10 10:10:55,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 26: [2023-05-10 10:10:55,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 26: [2023-05-10 10:10:55,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:55,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 28: [2023-05-10 10:10:55,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 28: [2023-05-10 10:10:55,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 28: [2023-05-10 10:10:55,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 28: [2023-05-10 10:10:55,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 9: [2023-05-10 10:10:55,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:55,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 22: [2023-05-10 10:10:55,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 22: [2023-05-10 10:10:55,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 22: [2023-05-10 10:10:55,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 26: [2023-05-10 10:10:55,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:55,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 22: [2023-05-10 10:10:55,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:55,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 22: [2023-05-10 10:10:55,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 9: [2023-05-10 10:10:55,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:55,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:55,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 9: [2023-05-10 10:10:55,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:55,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 19: [2023-05-10 10:10:55,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 19: [2023-05-10 10:10:55,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 28: [2023-05-10 10:10:55,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,788] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 13: [2023-05-10 10:10:55,788] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:55,788] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:55,788] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,788] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 28: [2023-05-10 10:10:55,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 9: [2023-05-10 10:10:55,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:55,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 9: [2023-05-10 10:10:55,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:55,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 9: [2023-05-10 10:10:55,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 9: [2023-05-10 10:10:55,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:55,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:55,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:55,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:55,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 0: [2023-05-10 10:10:55,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 19: [2023-05-10 10:10:55,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 17: [2023-05-10 10:10:55,795] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,800] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:55,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:55,804] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 30: [2023-05-10 10:10:55,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:55,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:55,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:55,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:55,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:55,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:55,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:55,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 2: [2023-05-10 10:10:55,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 25: [2023-05-10 10:10:55,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:55,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:55,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 2: [2023-05-10 10:10:55,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 2: [2023-05-10 10:10:55,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 2: [2023-05-10 10:10:55,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 2: [2023-05-10 10:10:55,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 2: [2023-05-10 10:10:55,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 2: [2023-05-10 10:10:55,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:55,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:55,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:55,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 19: [2023-05-10 10:10:55,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:55,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 2: [2023-05-10 10:10:55,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 2: [2023-05-10 10:10:55,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 17: [2023-05-10 10:10:55,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 2: [2023-05-10 10:10:55,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt... 30: [2023-05-10 10:10:55,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 17: [2023-05-10 10:10:55,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 28: [2023-05-10 10:10:55,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 22: [2023-05-10 10:10:55,831] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 28: [2023-05-10 10:10:55,832] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:55,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 17: [2023-05-10 10:10:55,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 17: [2023-05-10 10:10:55,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 17: [2023-05-10 10:10:55,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 17: [2023-05-10 10:10:55,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 17: [2023-05-10 10:10:55,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:55,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:55,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:55,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 30: [2023-05-10 10:10:55,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 17: [2023-05-10 10:10:55,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:55,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 22: [2023-05-10 10:10:55,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 22: [2023-05-10 10:10:55,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 22: [2023-05-10 10:10:55,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 22: [2023-05-10 10:10:55,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 22: [2023-05-10 10:10:55,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,852] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,852] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 28: [2023-05-10 10:10:55,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:55,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 17: [2023-05-10 10:10:55,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:55,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 22: [2023-05-10 10:10:55,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 30: [2023-05-10 10:10:55,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:55,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:55,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 28: [2023-05-10 10:10:55,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 28: [2023-05-10 10:10:55,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 28: [2023-05-10 10:10:55,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 28: [2023-05-10 10:10:55,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 30: [2023-05-10 10:10:55,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:55,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 0: [2023-05-10 10:10:55,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:55,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:55,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 2: [2023-05-10 10:10:55,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 10: [2023-05-10 10:10:55,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:55,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:55,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 30: [2023-05-10 10:10:55,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 17: [2023-05-10 10:10:55,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 17: [2023-05-10 10:10:55,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 17: [2023-05-10 10:10:55,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 17: [2023-05-10 10:10:55,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 17: [2023-05-10 10:10:55,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:55,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:55,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 30: [2023-05-10 10:10:55,879] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:55,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 30: [2023-05-10 10:10:55,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 30: [2023-05-10 10:10:55,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 30: [2023-05-10 10:10:55,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:55,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:55,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:55,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:55,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:55,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:55,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:55,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:55,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:55,889] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:55,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:55,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:55,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 2: [2023-05-10 10:10:55,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 2: [2023-05-10 10:10:55,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 28: [2023-05-10 10:10:55,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:55,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:55,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:55,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:55,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:55,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:55,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:55,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 2: [2023-05-10 10:10:55,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 2: [2023-05-10 10:10:55,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_21-model_00-model_states.pt. 2: [2023-05-10 10:10:55,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:55,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:55,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:55,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:55,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:55,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,118] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 18: [2023-05-10 10:10:56,118] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 18: [2023-05-10 10:10:56,118] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 18: [2023-05-10 10:10:56,118] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 18: [2023-05-10 10:10:56,118] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 18: [2023-05-10 10:10:56,118] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 18: [2023-05-10 10:10:56,118] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 18: [2023-05-10 10:10:56,118] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 18: [2023-05-10 10:10:56,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,123] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,123] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,123] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,124] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:56,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:56,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:56,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:56,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:56,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:56,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:56,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:56,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 23: [2023-05-10 10:10:56,166] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 23: [2023-05-10 10:10:56,166] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 23: [2023-05-10 10:10:56,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 23: [2023-05-10 10:10:56,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 23: [2023-05-10 10:10:56,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 23: [2023-05-10 10:10:56,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 23: [2023-05-10 10:10:56,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 23: [2023-05-10 10:10:56,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 23: [2023-05-10 10:10:56,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 23: [2023-05-10 10:10:56,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 23: [2023-05-10 10:10:56,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:56,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 18: [2023-05-10 10:10:56,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 23: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 8: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 8: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 8: [2023-05-10 10:10:56,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:56,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 8: [2023-05-10 10:10:56,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 23: [2023-05-10 10:10:56,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 8: [2023-05-10 10:10:56,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 23: [2023-05-10 10:10:56,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 23: [2023-05-10 10:10:56,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 23: [2023-05-10 10:10:56,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:56,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:56,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:56,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:56,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:56,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:56,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:56,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:56,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:56,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:56,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 31: [2023-05-10 10:10:56,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:56,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:56,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:56,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 31: [2023-05-10 10:10:56,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 31: [2023-05-10 10:10:56,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:56,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 31: [2023-05-10 10:10:56,187] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:56,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:56,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:56,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 31: [2023-05-10 10:10:56,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 18: [2023-05-10 10:10:56,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 18: [2023-05-10 10:10:56,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 18: [2023-05-10 10:10:56,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 18: [2023-05-10 10:10:56,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 31: [2023-05-10 10:10:56,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 31: [2023-05-10 10:10:56,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:56,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 31: [2023-05-10 10:10:56,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 31: [2023-05-10 10:10:56,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 21: [2023-05-10 10:10:56,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:56,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 1: [2023-05-10 10:10:56,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,207] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 21: [2023-05-10 10:10:56,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 21: [2023-05-10 10:10:56,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:56,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 21: [2023-05-10 10:10:56,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:56,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 21: [2023-05-10 10:10:56,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:56,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 5: [2023-05-10 10:10:56,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:56,211] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 1: [2023-05-10 10:10:56,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:56,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 5: [2023-05-10 10:10:56,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 5: [2023-05-10 10:10:56,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:56,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 5: [2023-05-10 10:10:56,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 1: [2023-05-10 10:10:56,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 1: [2023-05-10 10:10:56,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 1: [2023-05-10 10:10:56,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 27: [2023-05-10 10:10:56,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:56,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 1: [2023-05-10 10:10:56,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:56,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 27: [2023-05-10 10:10:56,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:56,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:56,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:56,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 12: [2023-05-10 10:10:56,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 5: [2023-05-10 10:10:56,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:56,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 18: [2023-05-10 10:10:56,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 12: [2023-05-10 10:10:56,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 29: [2023-05-10 10:10:56,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 29: [2023-05-10 10:10:56,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 29: [2023-05-10 10:10:56,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 29: [2023-05-10 10:10:56,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 29: [2023-05-10 10:10:56,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 29: [2023-05-10 10:10:56,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 5: [2023-05-10 10:10:56,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:56,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 27: [2023-05-10 10:10:56,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:56,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:56,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:56,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:56,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 12: [2023-05-10 10:10:56,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 23: [2023-05-10 10:10:56,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 27: [2023-05-10 10:10:56,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 18: [2023-05-10 10:10:56,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:56,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:56,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 12: [2023-05-10 10:10:56,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:56,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:56,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 11: [2023-05-10 10:10:56,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 7: [2023-05-10 10:10:56,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 18: [2023-05-10 10:10:56,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:56,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 11: [2023-05-10 10:10:56,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 11: [2023-05-10 10:10:56,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 11: [2023-05-10 10:10:56,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 11: [2023-05-10 10:10:56,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 11: [2023-05-10 10:10:56,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 12: [2023-05-10 10:10:56,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:56,225] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:56,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 7: [2023-05-10 10:10:56,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:56,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:56,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:56,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 7: [2023-05-10 10:10:56,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:56,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 3: [2023-05-10 10:10:56,227] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 3: [2023-05-10 10:10:56,227] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 11: [2023-05-10 10:10:56,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:56,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:56,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:56,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 16: [2023-05-10 10:10:56,228] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,228] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,228] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,228] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,228] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 23: [2023-05-10 10:10:56,228] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,228] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 3: [2023-05-10 10:10:56,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 3: [2023-05-10 10:10:56,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 3: [2023-05-10 10:10:56,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 3: [2023-05-10 10:10:56,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 3: [2023-05-10 10:10:56,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 3: [2023-05-10 10:10:56,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 23: [2023-05-10 10:10:56,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 8: [2023-05-10 10:10:56,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 12: [2023-05-10 10:10:56,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 12: [2023-05-10 10:10:56,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 3: [2023-05-10 10:10:56,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 3: [2023-05-10 10:10:56,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 8: [2023-05-10 10:10:56,233] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 12: [2023-05-10 10:10:56,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 3: [2023-05-10 10:10:56,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 23: [2023-05-10 10:10:56,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 11: [2023-05-10 10:10:56,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:56,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:56,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:56,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:56,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 4: [2023-05-10 10:10:56,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 11: [2023-05-10 10:10:56,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:56,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 3: [2023-05-10 10:10:56,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 3: [2023-05-10 10:10:56,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 3: [2023-05-10 10:10:56,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 3: [2023-05-10 10:10:56,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:56,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 3: [2023-05-10 10:10:56,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:56,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 23: [2023-05-10 10:10:56,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 4: [2023-05-10 10:10:56,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:56,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:56,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:56,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:56,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:56,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 23: [2023-05-10 10:10:56,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:56,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:56,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:56,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:56,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 8: [2023-05-10 10:10:56,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:56,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:56,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 23: [2023-05-10 10:10:56,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 23: [2023-05-10 10:10:56,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 23: [2023-05-10 10:10:56,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:56,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 10: [2023-05-10 10:10:56,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 14: [2023-05-10 10:10:56,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 14: [2023-05-10 10:10:56,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 23: [2023-05-10 10:10:56,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 23: [2023-05-10 10:10:56,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 15: [2023-05-10 10:10:56,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 15: [2023-05-10 10:10:56,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 16: [2023-05-10 10:10:56,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 14: [2023-05-10 10:10:56,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 16: [2023-05-10 10:10:56,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 26: [2023-05-10 10:10:56,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 8: [2023-05-10 10:10:56,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 23: [2023-05-10 10:10:56,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 26: [2023-05-10 10:10:56,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 26: [2023-05-10 10:10:56,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 26: [2023-05-10 10:10:56,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 8: [2023-05-10 10:10:56,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 4: [2023-05-10 10:10:56,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 4: [2023-05-10 10:10:56,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 14: [2023-05-10 10:10:56,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 29: [2023-05-10 10:10:56,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 16: [2023-05-10 10:10:56,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 26: [2023-05-10 10:10:56,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 26: [2023-05-10 10:10:56,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 15: [2023-05-10 10:10:56,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,268] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:56,268] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 1: [2023-05-10 10:10:56,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 4: [2023-05-10 10:10:56,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 4: [2023-05-10 10:10:56,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 5: [2023-05-10 10:10:56,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 5: [2023-05-10 10:10:56,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 8: [2023-05-10 10:10:56,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 23: [2023-05-10 10:10:56,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 4: [2023-05-10 10:10:56,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 8: [2023-05-10 10:10:56,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 8: [2023-05-10 10:10:56,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:56,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 26: [2023-05-10 10:10:56,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 26: [2023-05-10 10:10:56,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 26: [2023-05-10 10:10:56,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 26: [2023-05-10 10:10:56,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:56,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 26: [2023-05-10 10:10:56,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:56,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:56,277] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:56,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 21: [2023-05-10 10:10:56,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 23: [2023-05-10 10:10:56,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 23: [2023-05-10 10:10:56,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 23: [2023-05-10 10:10:56,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 11: [2023-05-10 10:10:56,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 14: [2023-05-10 10:10:56,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 1: [2023-05-10 10:10:56,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 14: [2023-05-10 10:10:56,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 5: [2023-05-10 10:10:56,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 5: [2023-05-10 10:10:56,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 5: [2023-05-10 10:10:56,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 7: [2023-05-10 10:10:56,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 5: [2023-05-10 10:10:56,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 5: [2023-05-10 10:10:56,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 19: [2023-05-10 10:10:56,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 14: [2023-05-10 10:10:56,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 3: [2023-05-10 10:10:56,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 3: [2023-05-10 10:10:56,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 3: [2023-05-10 10:10:56,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 1: [2023-05-10 10:10:56,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 1: [2023-05-10 10:10:56,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 24: [2023-05-10 10:10:56,290] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,292] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 20: [2023-05-10 10:10:56,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 1: [2023-05-10 10:10:56,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 29: [2023-05-10 10:10:56,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 29: [2023-05-10 10:10:56,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 21: [2023-05-10 10:10:56,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 24: [2023-05-10 10:10:56,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 12: [2023-05-10 10:10:56,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 3: [2023-05-10 10:10:56,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:56,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:56,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 29: [2023-05-10 10:10:56,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 15: [2023-05-10 10:10:56,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:56,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 24: [2023-05-10 10:10:56,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:56,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 3: [2023-05-10 10:10:56,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 1: [2023-05-10 10:10:56,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 15: [2023-05-10 10:10:56,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 20: [2023-05-10 10:10:56,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 24: [2023-05-10 10:10:56,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:56,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:56,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:56,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:56,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 31: [2023-05-10 10:10:56,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 24: [2023-05-10 10:10:56,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:56,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 24: [2023-05-10 10:10:56,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 31: [2023-05-10 10:10:56,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 15: [2023-05-10 10:10:56,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:56,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 28: [2023-05-10 10:10:56,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 28: [2023-05-10 10:10:56,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 28: [2023-05-10 10:10:56,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 28: [2023-05-10 10:10:56,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 31: [2023-05-10 10:10:56,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:56,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 28: [2023-05-10 10:10:56,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 19: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 0: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 0: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 0: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 0: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 0: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 0: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 20: [2023-05-10 10:10:56,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 20: [2023-05-10 10:10:56,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 20: [2023-05-10 10:10:56,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 11: [2023-05-10 10:10:56,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 5: [2023-05-10 10:10:56,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 20: [2023-05-10 10:10:56,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:56,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 20: [2023-05-10 10:10:56,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 11: [2023-05-10 10:10:56,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 11: [2023-05-10 10:10:56,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 11: [2023-05-10 10:10:56,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 21: [2023-05-10 10:10:56,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 11: [2023-05-10 10:10:56,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 9: [2023-05-10 10:10:56,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 9: [2023-05-10 10:10:56,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 15: [2023-05-10 10:10:56,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:56,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:56,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:56,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:56,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:56,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 3: [2023-05-10 10:10:56,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 0: [2023-05-10 10:10:56,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 3: [2023-05-10 10:10:56,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 3: [2023-05-10 10:10:56,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 3: [2023-05-10 10:10:56,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 0: [2023-05-10 10:10:56,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 9: [2023-05-10 10:10:56,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 9: [2023-05-10 10:10:56,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 9: [2023-05-10 10:10:56,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 9: [2023-05-10 10:10:56,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 9: [2023-05-10 10:10:56,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 0: [2023-05-10 10:10:56,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 9: [2023-05-10 10:10:56,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 5: [2023-05-10 10:10:56,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 29: [2023-05-10 10:10:56,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 0: [2023-05-10 10:10:56,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:56,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:56,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:56,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 28: [2023-05-10 10:10:56,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:56,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 28: [2023-05-10 10:10:56,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:56,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:56,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 21: [2023-05-10 10:10:56,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 3: [2023-05-10 10:10:56,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 3: [2023-05-10 10:10:56,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 1: [2023-05-10 10:10:56,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:56,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:56,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 1: [2023-05-10 10:10:56,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 12: [2023-05-10 10:10:56,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 7: [2023-05-10 10:10:56,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 1: [2023-05-10 10:10:56,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 19: [2023-05-10 10:10:56,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 7: [2023-05-10 10:10:56,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 2: [2023-05-10 10:10:56,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:56,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 2: [2023-05-10 10:10:56,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 5: [2023-05-10 10:10:56,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 1: [2023-05-10 10:10:56,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 2: [2023-05-10 10:10:56,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 2: [2023-05-10 10:10:56,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:56,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 19: [2023-05-10 10:10:56,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 27: [2023-05-10 10:10:56,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 10: [2023-05-10 10:10:56,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 9: [2023-05-10 10:10:56,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 9: [2023-05-10 10:10:56,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:56,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 26: [2023-05-10 10:10:56,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 26: [2023-05-10 10:10:56,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 3: [2023-05-10 10:10:56,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 22: [2023-05-10 10:10:56,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 22: [2023-05-10 10:10:56,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 22: [2023-05-10 10:10:56,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 29: [2023-05-10 10:10:56,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 22: [2023-05-10 10:10:56,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 22: [2023-05-10 10:10:56,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 22: [2023-05-10 10:10:56,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 27: [2023-05-10 10:10:56,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 3: [2023-05-10 10:10:56,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:56,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:56,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 17: [2023-05-10 10:10:56,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 17: [2023-05-10 10:10:56,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 17: [2023-05-10 10:10:56,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 17: [2023-05-10 10:10:56,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 17: [2023-05-10 10:10:56,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 17: [2023-05-10 10:10:56,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 17: [2023-05-10 10:10:56,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 12: [2023-05-10 10:10:56,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:56,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 12: [2023-05-10 10:10:56,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 13: [2023-05-10 10:10:56,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 17: [2023-05-10 10:10:56,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:56,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:56,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:56,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 22: [2023-05-10 10:10:56,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 26: [2023-05-10 10:10:56,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,337] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 10: [2023-05-10 10:10:56,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 10: [2023-05-10 10:10:56,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 12: [2023-05-10 10:10:56,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 3: [2023-05-10 10:10:56,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 17: [2023-05-10 10:10:56,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 17: [2023-05-10 10:10:56,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 3: [2023-05-10 10:10:56,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 17: [2023-05-10 10:10:56,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 3: [2023-05-10 10:10:56,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 10: [2023-05-10 10:10:56,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:56,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 26: [2023-05-10 10:10:56,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 25: [2023-05-10 10:10:56,344] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 28: [2023-05-10 10:10:56,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,346] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 26: [2023-05-10 10:10:56,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:56,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:56,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 26: [2023-05-10 10:10:56,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 6: [2023-05-10 10:10:56,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 12: [2023-05-10 10:10:56,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:56,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 30: [2023-05-10 10:10:56,352] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 30: [2023-05-10 10:10:56,353] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 13: [2023-05-10 10:10:56,353] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 13: [2023-05-10 10:10:56,353] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,353] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt... 10: [2023-05-10 10:10:56,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 20: [2023-05-10 10:10:56,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 20: [2023-05-10 10:10:56,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 28: [2023-05-10 10:10:56,360] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 26: [2023-05-10 10:10:56,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,361] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 0: [2023-05-10 10:10:56,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 0: [2023-05-10 10:10:56,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 28: [2023-05-10 10:10:56,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,369] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 13: [2023-05-10 10:10:56,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 24: [2023-05-10 10:10:56,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 22: [2023-05-10 10:10:56,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 13: [2023-05-10 10:10:56,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 13: [2023-05-10 10:10:56,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 13: [2023-05-10 10:10:56,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 0: [2023-05-10 10:10:56,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 0: [2023-05-10 10:10:56,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 0: [2023-05-10 10:10:56,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 28: [2023-05-10 10:10:56,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:56,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 9: [2023-05-10 10:10:56,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 26: [2023-05-10 10:10:56,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 20: [2023-05-10 10:10:56,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 9: [2023-05-10 10:10:56,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 9: [2023-05-10 10:10:56,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 2: [2023-05-10 10:10:56,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 9: [2023-05-10 10:10:56,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 0: [2023-05-10 10:10:56,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 28: [2023-05-10 10:10:56,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:56,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 28: [2023-05-10 10:10:56,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 28: [2023-05-10 10:10:56,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 28: [2023-05-10 10:10:56,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 25: [2023-05-10 10:10:56,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 2: [2023-05-10 10:10:56,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 6: [2023-05-10 10:10:56,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,399] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 24: [2023-05-10 10:10:56,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 24: [2023-05-10 10:10:56,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 24: [2023-05-10 10:10:56,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 24: [2023-05-10 10:10:56,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:56,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,405] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 22: [2023-05-10 10:10:56,405] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 28: [2023-05-10 10:10:56,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 24: [2023-05-10 10:10:56,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 24: [2023-05-10 10:10:56,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 20: [2023-05-10 10:10:56,407] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:56,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:56,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:56,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,414] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 22: [2023-05-10 10:10:56,414] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 22: [2023-05-10 10:10:56,414] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 20: [2023-05-10 10:10:56,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:56,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 17: [2023-05-10 10:10:56,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 2: [2023-05-10 10:10:56,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 20: [2023-05-10 10:10:56,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 20: [2023-05-10 10:10:56,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 20: [2023-05-10 10:10:56,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 20: [2023-05-10 10:10:56,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 2: [2023-05-10 10:10:56,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 17: [2023-05-10 10:10:56,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 17: [2023-05-10 10:10:56,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 17: [2023-05-10 10:10:56,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 17: [2023-05-10 10:10:56,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,425] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,425] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,427] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 2: [2023-05-10 10:10:56,427] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 2: [2023-05-10 10:10:56,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 17: [2023-05-10 10:10:56,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_22-model_00-model_states.pt. 30: [2023-05-10 10:10:56,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 18: [2023-05-10 10:10:56,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,631] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 18: [2023-05-10 10:10:56,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 18: [2023-05-10 10:10:56,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 18: [2023-05-10 10:10:56,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 18: [2023-05-10 10:10:56,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 18: [2023-05-10 10:10:56,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 18: [2023-05-10 10:10:56,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 18: [2023-05-10 10:10:56,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 16: [2023-05-10 10:10:56,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 16: [2023-05-10 10:10:56,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 16: [2023-05-10 10:10:56,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 16: [2023-05-10 10:10:56,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 16: [2023-05-10 10:10:56,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 16: [2023-05-10 10:10:56,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 16: [2023-05-10 10:10:56,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 16: [2023-05-10 10:10:56,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 16: [2023-05-10 10:10:56,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 16: [2023-05-10 10:10:56,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 16: [2023-05-10 10:10:56,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 31: [2023-05-10 10:10:56,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 16: [2023-05-10 10:10:56,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 31: [2023-05-10 10:10:56,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 31: [2023-05-10 10:10:56,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 31: [2023-05-10 10:10:56,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 31: [2023-05-10 10:10:56,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 16: [2023-05-10 10:10:56,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 16: [2023-05-10 10:10:56,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 16: [2023-05-10 10:10:56,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 16: [2023-05-10 10:10:56,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 18: [2023-05-10 10:10:56,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,704] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,712] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 18: [2023-05-10 10:10:56,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:56,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,719] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:56,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:56,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:56,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:56,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:56,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:56,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:56,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:56,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:56,722] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 31: [2023-05-10 10:10:56,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 31: [2023-05-10 10:10:56,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:56,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 18: [2023-05-10 10:10:56,727] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 18: [2023-05-10 10:10:56,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:56,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 18: [2023-05-10 10:10:56,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:56,732] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 16: [2023-05-10 10:10:56,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 18: [2023-05-10 10:10:56,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:56,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:56,742] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 31: [2023-05-10 10:10:56,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:56,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:56,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:56,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 16: [2023-05-10 10:10:56,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 16: [2023-05-10 10:10:56,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 16: [2023-05-10 10:10:56,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 31: [2023-05-10 10:10:56,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 31: [2023-05-10 10:10:56,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 31: [2023-05-10 10:10:56,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 31: [2023-05-10 10:10:56,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 31: [2023-05-10 10:10:56,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 12: [2023-05-10 10:10:56,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 12: [2023-05-10 10:10:56,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 12: [2023-05-10 10:10:56,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 16: [2023-05-10 10:10:56,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 12: [2023-05-10 10:10:56,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 12: [2023-05-10 10:10:56,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 12: [2023-05-10 10:10:56,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 15: [2023-05-10 10:10:56,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 16: [2023-05-10 10:10:56,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:56,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:56,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 15: [2023-05-10 10:10:56,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 15: [2023-05-10 10:10:56,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 15: [2023-05-10 10:10:56,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 16: [2023-05-10 10:10:56,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:56,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 31: [2023-05-10 10:10:56,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:56,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:56,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 15: [2023-05-10 10:10:56,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 16: [2023-05-10 10:10:56,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:56,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:56,782] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:56,782] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:56,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:56,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:56,788] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:56,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:56,797] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:56,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:56,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:56,800] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:56,800] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 8: [2023-05-10 10:10:56,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 8: [2023-05-10 10:10:56,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 8: [2023-05-10 10:10:56,808] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 8: [2023-05-10 10:10:56,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 8: [2023-05-10 10:10:56,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 8: [2023-05-10 10:10:56,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 8: [2023-05-10 10:10:56,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 8: [2023-05-10 10:10:56,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:56,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 17: [2023-05-10 10:10:56,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 17: [2023-05-10 10:10:56,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 17: [2023-05-10 10:10:56,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 17: [2023-05-10 10:10:56,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 17: [2023-05-10 10:10:56,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 17: [2023-05-10 10:10:56,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 17: [2023-05-10 10:10:56,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,824] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 21: [2023-05-10 10:10:56,824] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 21: [2023-05-10 10:10:56,824] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 21: [2023-05-10 10:10:56,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 21: [2023-05-10 10:10:56,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 21: [2023-05-10 10:10:56,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 21: [2023-05-10 10:10:56,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:56,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:56,826] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 29: [2023-05-10 10:10:56,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,826] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,826] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 11: [2023-05-10 10:10:56,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 12: [2023-05-10 10:10:56,831] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 17: [2023-05-10 10:10:56,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 6: [2023-05-10 10:10:56,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:56,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:56,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 12: [2023-05-10 10:10:56,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 12: [2023-05-10 10:10:56,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 15: [2023-05-10 10:10:56,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 29: [2023-05-10 10:10:56,839] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,846] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:56,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:56,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 12: [2023-05-10 10:10:56,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 8: [2023-05-10 10:10:56,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,857] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 14: [2023-05-10 10:10:56,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,858] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 14: [2023-05-10 10:10:56,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 25: [2023-05-10 10:10:56,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 14: [2023-05-10 10:10:56,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 14: [2023-05-10 10:10:56,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 25: [2023-05-10 10:10:56,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 25: [2023-05-10 10:10:56,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 12: [2023-05-10 10:10:56,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 14: [2023-05-10 10:10:56,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 25: [2023-05-10 10:10:56,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 25: [2023-05-10 10:10:56,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 12: [2023-05-10 10:10:56,868] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 14: [2023-05-10 10:10:56,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 14: [2023-05-10 10:10:56,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 17: [2023-05-10 10:10:56,871] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 25: [2023-05-10 10:10:56,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,873] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 15: [2023-05-10 10:10:56,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 12: [2023-05-10 10:10:56,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:56,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 8: [2023-05-10 10:10:56,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:56,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 8: [2023-05-10 10:10:56,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:56,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:56,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 12: [2023-05-10 10:10:56,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 12: [2023-05-10 10:10:56,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 12: [2023-05-10 10:10:56,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 29: [2023-05-10 10:10:56,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:56,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 23: [2023-05-10 10:10:56,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:56,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 23: [2023-05-10 10:10:56,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 23: [2023-05-10 10:10:56,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 23: [2023-05-10 10:10:56,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 23: [2023-05-10 10:10:56,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 23: [2023-05-10 10:10:56,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 4: [2023-05-10 10:10:56,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 4: [2023-05-10 10:10:56,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 4: [2023-05-10 10:10:56,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 4: [2023-05-10 10:10:56,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:56,890] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:56,890] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 4: [2023-05-10 10:10:56,890] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:56,890] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 7: [2023-05-10 10:10:56,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 7: [2023-05-10 10:10:56,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:56,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 23: [2023-05-10 10:10:56,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:56,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 15: [2023-05-10 10:10:56,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 1: [2023-05-10 10:10:56,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 1: [2023-05-10 10:10:56,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 4: [2023-05-10 10:10:56,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 4: [2023-05-10 10:10:56,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 4: [2023-05-10 10:10:56,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 1: [2023-05-10 10:10:56,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 1: [2023-05-10 10:10:56,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 23: [2023-05-10 10:10:56,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 8: [2023-05-10 10:10:56,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:56,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 4: [2023-05-10 10:10:56,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 23: [2023-05-10 10:10:56,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 23: [2023-05-10 10:10:56,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 15: [2023-05-10 10:10:56,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 27: [2023-05-10 10:10:56,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:56,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 1: [2023-05-10 10:10:56,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:56,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 1: [2023-05-10 10:10:56,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 11: [2023-05-10 10:10:56,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 1: [2023-05-10 10:10:56,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:56,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:56,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 4: [2023-05-10 10:10:56,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 4: [2023-05-10 10:10:56,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 4: [2023-05-10 10:10:56,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 7: [2023-05-10 10:10:56,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:56,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 21: [2023-05-10 10:10:56,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 21: [2023-05-10 10:10:56,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 17: [2023-05-10 10:10:56,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:56,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 17: [2023-05-10 10:10:56,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 7: [2023-05-10 10:10:56,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 29: [2023-05-10 10:10:56,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:56,906] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 17: [2023-05-10 10:10:56,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:56,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 8: [2023-05-10 10:10:56,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 8: [2023-05-10 10:10:56,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:56,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 8: [2023-05-10 10:10:56,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 8: [2023-05-10 10:10:56,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 8: [2023-05-10 10:10:56,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 14: [2023-05-10 10:10:56,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:56,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:56,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:56,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:56,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:56,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:56,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 19: [2023-05-10 10:10:56,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 19: [2023-05-10 10:10:56,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:56,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:56,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 19: [2023-05-10 10:10:56,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 19: [2023-05-10 10:10:56,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 19: [2023-05-10 10:10:56,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 11: [2023-05-10 10:10:56,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 19: [2023-05-10 10:10:56,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 19: [2023-05-10 10:10:56,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 20: [2023-05-10 10:10:56,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 19: [2023-05-10 10:10:56,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:56,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:56,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 20: [2023-05-10 10:10:56,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:56,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:56,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:56,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:56,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 27: [2023-05-10 10:10:56,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 27: [2023-05-10 10:10:56,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 19: [2023-05-10 10:10:56,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 27: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 29: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 30: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 30: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 30: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 30: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 30: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 30: [2023-05-10 10:10:56,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:56,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 24: [2023-05-10 10:10:56,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 17: [2023-05-10 10:10:56,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 17: [2023-05-10 10:10:56,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 17: [2023-05-10 10:10:56,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 19: [2023-05-10 10:10:56,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 24: [2023-05-10 10:10:56,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 24: [2023-05-10 10:10:56,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 24: [2023-05-10 10:10:56,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 10: [2023-05-10 10:10:56,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 19: [2023-05-10 10:10:56,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 24: [2023-05-10 10:10:56,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 10: [2023-05-10 10:10:56,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 10: [2023-05-10 10:10:56,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:56,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 14: [2023-05-10 10:10:56,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:56,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 10: [2023-05-10 10:10:56,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 27: [2023-05-10 10:10:56,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:56,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 19: [2023-05-10 10:10:56,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 21: [2023-05-10 10:10:56,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:56,925] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 19: [2023-05-10 10:10:56,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 9: [2023-05-10 10:10:56,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 19: [2023-05-10 10:10:56,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:56,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 27: [2023-05-10 10:10:56,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 19: [2023-05-10 10:10:56,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 19: [2023-05-10 10:10:56,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 14: [2023-05-10 10:10:56,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:56,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 20: [2023-05-10 10:10:56,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:56,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 25: [2023-05-10 10:10:56,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 25: [2023-05-10 10:10:56,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:56,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:56,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 20: [2023-05-10 10:10:56,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:56,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 9: [2023-05-10 10:10:56,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 9: [2023-05-10 10:10:56,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 9: [2023-05-10 10:10:56,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 9: [2023-05-10 10:10:56,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 9: [2023-05-10 10:10:56,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 9: [2023-05-10 10:10:56,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:56,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:56,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 30: [2023-05-10 10:10:56,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 30: [2023-05-10 10:10:56,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:56,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:56,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:56,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 21: [2023-05-10 10:10:56,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 14: [2023-05-10 10:10:56,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,933] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:56,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:56,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 30: [2023-05-10 10:10:56,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 25: [2023-05-10 10:10:56,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 25: [2023-05-10 10:10:56,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 3: [2023-05-10 10:10:56,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:56,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 22: [2023-05-10 10:10:56,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:56,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 30: [2023-05-10 10:10:56,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:56,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:56,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:56,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:56,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 17: [2023-05-10 10:10:56,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 26: [2023-05-10 10:10:56,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 13: [2023-05-10 10:10:56,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 13: [2023-05-10 10:10:56,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 13: [2023-05-10 10:10:56,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:56,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 13: [2023-05-10 10:10:56,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 13: [2023-05-10 10:10:56,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 13: [2023-05-10 10:10:56,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 13: [2023-05-10 10:10:56,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 13: [2023-05-10 10:10:56,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:56,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:56,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 3: [2023-05-10 10:10:56,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:56,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:56,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:56,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 9: [2023-05-10 10:10:56,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 25: [2023-05-10 10:10:56,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 9: [2023-05-10 10:10:56,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 22: [2023-05-10 10:10:56,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:56,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:56,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 17: [2023-05-10 10:10:56,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 11: [2023-05-10 10:10:56,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 3: [2023-05-10 10:10:56,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 3: [2023-05-10 10:10:56,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 9: [2023-05-10 10:10:56,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 11: [2023-05-10 10:10:56,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:56,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:56,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 11: [2023-05-10 10:10:56,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:56,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:56,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:56,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:56,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 3: [2023-05-10 10:10:56,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 3: [2023-05-10 10:10:56,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 3: [2023-05-10 10:10:56,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 0: [2023-05-10 10:10:56,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:56,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 3: [2023-05-10 10:10:56,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 13: [2023-05-10 10:10:56,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 3: [2023-05-10 10:10:56,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 13: [2023-05-10 10:10:56,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 1: [2023-05-10 10:10:56,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 23: [2023-05-10 10:10:56,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 23: [2023-05-10 10:10:56,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 5: [2023-05-10 10:10:56,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 26: [2023-05-10 10:10:56,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 4: [2023-05-10 10:10:56,948] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 13: [2023-05-10 10:10:56,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 26: [2023-05-10 10:10:56,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 29: [2023-05-10 10:10:56,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 29: [2023-05-10 10:10:56,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:56,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 13: [2023-05-10 10:10:56,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 13: [2023-05-10 10:10:56,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 14: [2023-05-10 10:10:56,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 1: [2023-05-10 10:10:56,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 13: [2023-05-10 10:10:56,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 13: [2023-05-10 10:10:56,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 4: [2023-05-10 10:10:56,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 29: [2023-05-10 10:10:56,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 13: [2023-05-10 10:10:56,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 5: [2023-05-10 10:10:56,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 17: [2023-05-10 10:10:56,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 17: [2023-05-10 10:10:56,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 25: [2023-05-10 10:10:56,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 17: [2023-05-10 10:10:56,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 17: [2023-05-10 10:10:56,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:56,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 4: [2023-05-10 10:10:56,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 4: [2023-05-10 10:10:56,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 25: [2023-05-10 10:10:56,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:56,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:56,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 25: [2023-05-10 10:10:56,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:56,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:56,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 4: [2023-05-10 10:10:56,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 4: [2023-05-10 10:10:56,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:56,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 1: [2023-05-10 10:10:56,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:56,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 14: [2023-05-10 10:10:56,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:56,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:56,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 25: [2023-05-10 10:10:56,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:56,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 23: [2023-05-10 10:10:56,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 23: [2023-05-10 10:10:56,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 23: [2023-05-10 10:10:56,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 25: [2023-05-10 10:10:56,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 14: [2023-05-10 10:10:56,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 19: [2023-05-10 10:10:56,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 25: [2023-05-10 10:10:56,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 25: [2023-05-10 10:10:56,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:56,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:56,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 14: [2023-05-10 10:10:56,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 25: [2023-05-10 10:10:56,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 1: [2023-05-10 10:10:56,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 1: [2023-05-10 10:10:56,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:56,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:56,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:56,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:56,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:56,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:56,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 0: [2023-05-10 10:10:56,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:56,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:56,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:56,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:56,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:56,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 19: [2023-05-10 10:10:56,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 19: [2023-05-10 10:10:56,980] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:56,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:56,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:56,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:56,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:56,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:56,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:56,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:56,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 4: [2023-05-10 10:10:56,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 1: [2023-05-10 10:10:56,988] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 1: [2023-05-10 10:10:56,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:56,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 26: [2023-05-10 10:10:56,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:56,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 19: [2023-05-10 10:10:56,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:56,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:56,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 19: [2023-05-10 10:10:56,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 19: [2023-05-10 10:10:56,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:56,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 19: [2023-05-10 10:10:56,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 19: [2023-05-10 10:10:56,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 19: [2023-05-10 10:10:56,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 9: [2023-05-10 10:10:56,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 9: [2023-05-10 10:10:56,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:56,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:56,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 1: [2023-05-10 10:10:56,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 1: [2023-05-10 10:10:56,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:56,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:56,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:56,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:56,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:56,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:56,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:56,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:56,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:56,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:56,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 2: [2023-05-10 10:10:56,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 23: [2023-05-10 10:10:56,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:56,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:56,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 9: [2023-05-10 10:10:56,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 9: [2023-05-10 10:10:56,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 9: [2023-05-10 10:10:56,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 23: [2023-05-10 10:10:56,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:56,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:56,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 24: [2023-05-10 10:10:56,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:56,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 9: [2023-05-10 10:10:56,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:57,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:57,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:57,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:57,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:57,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:57,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 2: [2023-05-10 10:10:57,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 2: [2023-05-10 10:10:57,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 10: [2023-05-10 10:10:57,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 2: [2023-05-10 10:10:57,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 3: [2023-05-10 10:10:57,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:57,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 24: [2023-05-10 10:10:57,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 19: [2023-05-10 10:10:57,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 19: [2023-05-10 10:10:57,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 3: [2023-05-10 10:10:57,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:57,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:57,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:57,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:57,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:57,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:57,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:57,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:57,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:57,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 26: [2023-05-10 10:10:57,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 13: [2023-05-10 10:10:57,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 30: [2023-05-10 10:10:57,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:57,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:57,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 30: [2023-05-10 10:10:57,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:57,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:57,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:57,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:57,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 26: [2023-05-10 10:10:57,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:57,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:57,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 30: [2023-05-10 10:10:57,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:57,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 3: [2023-05-10 10:10:57,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:57,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 19: [2023-05-10 10:10:57,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 19: [2023-05-10 10:10:57,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 19: [2023-05-10 10:10:57,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 19: [2023-05-10 10:10:57,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 19: [2023-05-10 10:10:57,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 13: [2023-05-10 10:10:57,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 10: [2023-05-10 10:10:57,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:57,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:57,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:57,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:57,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 13: [2023-05-10 10:10:57,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 20: [2023-05-10 10:10:57,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 3: [2023-05-10 10:10:57,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:57,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:57,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:57,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 3: [2023-05-10 10:10:57,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:57,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 30: [2023-05-10 10:10:57,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 30: [2023-05-10 10:10:57,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:57,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 13: [2023-05-10 10:10:57,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 3: [2023-05-10 10:10:57,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 26: [2023-05-10 10:10:57,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 13: [2023-05-10 10:10:57,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 13: [2023-05-10 10:10:57,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 13: [2023-05-10 10:10:57,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 13: [2023-05-10 10:10:57,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 3: [2023-05-10 10:10:57,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 13: [2023-05-10 10:10:57,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 26: [2023-05-10 10:10:57,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:57,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 30: [2023-05-10 10:10:57,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 26: [2023-05-10 10:10:57,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 26: [2023-05-10 10:10:57,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 26: [2023-05-10 10:10:57,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:57,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 26: [2023-05-10 10:10:57,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:57,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:57,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:57,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:57,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:57,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:57,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:57,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:57,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:57,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:57,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:57,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:57,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:57,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:57,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:57,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,074] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:57,074] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:57,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:57,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:57,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 2: [2023-05-10 10:10:57,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 13: [2023-05-10 10:10:57,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 13: [2023-05-10 10:10:57,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 13: [2023-05-10 10:10:57,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 13: [2023-05-10 10:10:57,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 13: [2023-05-10 10:10:57,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 28: [2023-05-10 10:10:57,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 28: [2023-05-10 10:10:57,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 28: [2023-05-10 10:10:57,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 28: [2023-05-10 10:10:57,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 28: [2023-05-10 10:10:57,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 28: [2023-05-10 10:10:57,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 28: [2023-05-10 10:10:57,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 28: [2023-05-10 10:10:57,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 28: [2023-05-10 10:10:57,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:57,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:57,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:57,093] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:57,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:57,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:57,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 28: [2023-05-10 10:10:57,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt... 2: [2023-05-10 10:10:57,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,118] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,118] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,128] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 28: [2023-05-10 10:10:57,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 28: [2023-05-10 10:10:57,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 31: [2023-05-10 10:10:57,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,159] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 28: [2023-05-10 10:10:57,160] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 28: [2023-05-10 10:10:57,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 28: [2023-05-10 10:10:57,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 28: [2023-05-10 10:10:57,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 28: [2023-05-10 10:10:57,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 28: [2023-05-10 10:10:57,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 28: [2023-05-10 10:10:57,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_23-model_00-model_states.pt. 6: [2023-05-10 10:10:57,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 28: [2023-05-10 10:10:57,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 28: [2023-05-10 10:10:57,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:57,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:57,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:57,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:57,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:57,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:57,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:57,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:57,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:57,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 12: [2023-05-10 10:10:57,225] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,225] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,225] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,227] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:57,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:57,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:57,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:57,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:57,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:57,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:57,233] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:57,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:57,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:57,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 12: [2023-05-10 10:10:57,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:57,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:57,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:57,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 21: [2023-05-10 10:10:57,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 21: [2023-05-10 10:10:57,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 21: [2023-05-10 10:10:57,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:57,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 21: [2023-05-10 10:10:57,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 21: [2023-05-10 10:10:57,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:57,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 12: [2023-05-10 10:10:57,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 18: [2023-05-10 10:10:57,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 18: [2023-05-10 10:10:57,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 18: [2023-05-10 10:10:57,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 18: [2023-05-10 10:10:57,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 18: [2023-05-10 10:10:57,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 18: [2023-05-10 10:10:57,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 18: [2023-05-10 10:10:57,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 18: [2023-05-10 10:10:57,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 21: [2023-05-10 10:10:57,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:57,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:57,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:57,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:57,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:57,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:57,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 18: [2023-05-10 10:10:57,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:57,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 18: [2023-05-10 10:10:57,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 18: [2023-05-10 10:10:57,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 18: [2023-05-10 10:10:57,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 18: [2023-05-10 10:10:57,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 18: [2023-05-10 10:10:57,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 18: [2023-05-10 10:10:57,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 18: [2023-05-10 10:10:57,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:57,258] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,260] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 29: [2023-05-10 10:10:57,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 29: [2023-05-10 10:10:57,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 29: [2023-05-10 10:10:57,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 29: [2023-05-10 10:10:57,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 29: [2023-05-10 10:10:57,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 29: [2023-05-10 10:10:57,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 29: [2023-05-10 10:10:57,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:57,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 14: [2023-05-10 10:10:57,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 14: [2023-05-10 10:10:57,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 14: [2023-05-10 10:10:57,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 14: [2023-05-10 10:10:57,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 14: [2023-05-10 10:10:57,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 14: [2023-05-10 10:10:57,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 14: [2023-05-10 10:10:57,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 14: [2023-05-10 10:10:57,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 14: [2023-05-10 10:10:57,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 14: [2023-05-10 10:10:57,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 16: [2023-05-10 10:10:57,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 14: [2023-05-10 10:10:57,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 14: [2023-05-10 10:10:57,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 14: [2023-05-10 10:10:57,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 14: [2023-05-10 10:10:57,292] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:57,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:57,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 21: [2023-05-10 10:10:57,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 18: [2023-05-10 10:10:57,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 21: [2023-05-10 10:10:57,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 18: [2023-05-10 10:10:57,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 18: [2023-05-10 10:10:57,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 18: [2023-05-10 10:10:57,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 18: [2023-05-10 10:10:57,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 18: [2023-05-10 10:10:57,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 21: [2023-05-10 10:10:57,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 18: [2023-05-10 10:10:57,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 22: [2023-05-10 10:10:57,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 21: [2023-05-10 10:10:57,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 21: [2023-05-10 10:10:57,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 21: [2023-05-10 10:10:57,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 21: [2023-05-10 10:10:57,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 17: [2023-05-10 10:10:57,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 17: [2023-05-10 10:10:57,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 17: [2023-05-10 10:10:57,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 17: [2023-05-10 10:10:57,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 17: [2023-05-10 10:10:57,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 17: [2023-05-10 10:10:57,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 17: [2023-05-10 10:10:57,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 17: [2023-05-10 10:10:57,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 16: [2023-05-10 10:10:57,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 16: [2023-05-10 10:10:57,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:57,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:57,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:57,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 22: [2023-05-10 10:10:57,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:57,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 22: [2023-05-10 10:10:57,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:57,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:57,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:57,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:57,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 17: [2023-05-10 10:10:57,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 17: [2023-05-10 10:10:57,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 18: [2023-05-10 10:10:57,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 17: [2023-05-10 10:10:57,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 17: [2023-05-10 10:10:57,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 21: [2023-05-10 10:10:57,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 17: [2023-05-10 10:10:57,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:57,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 29: [2023-05-10 10:10:57,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:57,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:57,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 29: [2023-05-10 10:10:57,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 14: [2023-05-10 10:10:57,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 14: [2023-05-10 10:10:57,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 29: [2023-05-10 10:10:57,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 29: [2023-05-10 10:10:57,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:57,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:57,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:57,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:57,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 7: [2023-05-10 10:10:57,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,352] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 7: [2023-05-10 10:10:57,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 7: [2023-05-10 10:10:57,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 7: [2023-05-10 10:10:57,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 7: [2023-05-10 10:10:57,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 7: [2023-05-10 10:10:57,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 14: [2023-05-10 10:10:57,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 14: [2023-05-10 10:10:57,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 14: [2023-05-10 10:10:57,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 14: [2023-05-10 10:10:57,355] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,356] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,356] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,357] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:57,357] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 7: [2023-05-10 10:10:57,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 12: [2023-05-10 10:10:57,359] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,359] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 7: [2023-05-10 10:10:57,359] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 7: [2023-05-10 10:10:57,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 11: [2023-05-10 10:10:57,361] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,361] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 7: [2023-05-10 10:10:57,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 11: [2023-05-10 10:10:57,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 29: [2023-05-10 10:10:57,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 7: [2023-05-10 10:10:57,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 14: [2023-05-10 10:10:57,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 4: [2023-05-10 10:10:57,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 11: [2023-05-10 10:10:57,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 29: [2023-05-10 10:10:57,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:57,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:57,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:57,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:57,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:57,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 11: [2023-05-10 10:10:57,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 11: [2023-05-10 10:10:57,372] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:57,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 11: [2023-05-10 10:10:57,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 11: [2023-05-10 10:10:57,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 11: [2023-05-10 10:10:57,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:57,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 11: [2023-05-10 10:10:57,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:57,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 5: [2023-05-10 10:10:57,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:57,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 5: [2023-05-10 10:10:57,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:57,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 5: [2023-05-10 10:10:57,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 5: [2023-05-10 10:10:57,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 5: [2023-05-10 10:10:57,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 5: [2023-05-10 10:10:57,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 29: [2023-05-10 10:10:57,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:57,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:57,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:57,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:57,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:57,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 14: [2023-05-10 10:10:57,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 5: [2023-05-10 10:10:57,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 17: [2023-05-10 10:10:57,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 5: [2023-05-10 10:10:57,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:57,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 14: [2023-05-10 10:10:57,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 22: [2023-05-10 10:10:57,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 5: [2023-05-10 10:10:57,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:57,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:57,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:57,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:57,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:57,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 17: [2023-05-10 10:10:57,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,402] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 25: [2023-05-10 10:10:57,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,403] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 7: [2023-05-10 10:10:57,406] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 25: [2023-05-10 10:10:57,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 25: [2023-05-10 10:10:57,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 25: [2023-05-10 10:10:57,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 25: [2023-05-10 10:10:57,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 17: [2023-05-10 10:10:57,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 17: [2023-05-10 10:10:57,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 17: [2023-05-10 10:10:57,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 17: [2023-05-10 10:10:57,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 25: [2023-05-10 10:10:57,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 25: [2023-05-10 10:10:57,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:57,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 2: [2023-05-10 10:10:57,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 2: [2023-05-10 10:10:57,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 2: [2023-05-10 10:10:57,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 2: [2023-05-10 10:10:57,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 2: [2023-05-10 10:10:57,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 2: [2023-05-10 10:10:57,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 2: [2023-05-10 10:10:57,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 2: [2023-05-10 10:10:57,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,419] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 2: [2023-05-10 10:10:57,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 17: [2023-05-10 10:10:57,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 7: [2023-05-10 10:10:57,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 7: [2023-05-10 10:10:57,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 7: [2023-05-10 10:10:57,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 7: [2023-05-10 10:10:57,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 2: [2023-05-10 10:10:57,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 7: [2023-05-10 10:10:57,425] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 2: [2023-05-10 10:10:57,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 7: [2023-05-10 10:10:57,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 2: [2023-05-10 10:10:57,427] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 8: [2023-05-10 10:10:57,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 8: [2023-05-10 10:10:57,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 8: [2023-05-10 10:10:57,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 8: [2023-05-10 10:10:57,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 8: [2023-05-10 10:10:57,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 8: [2023-05-10 10:10:57,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 8: [2023-05-10 10:10:57,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 8: [2023-05-10 10:10:57,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 7: [2023-05-10 10:10:57,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 22: [2023-05-10 10:10:57,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 27: [2023-05-10 10:10:57,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 27: [2023-05-10 10:10:57,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 27: [2023-05-10 10:10:57,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 27: [2023-05-10 10:10:57,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 27: [2023-05-10 10:10:57,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 27: [2023-05-10 10:10:57,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 27: [2023-05-10 10:10:57,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 17: [2023-05-10 10:10:57,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 27: [2023-05-10 10:10:57,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 22: [2023-05-10 10:10:57,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 4: [2023-05-10 10:10:57,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 7: [2023-05-10 10:10:57,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 8: [2023-05-10 10:10:57,433] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:57,433] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,433] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 8: [2023-05-10 10:10:57,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:57,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 8: [2023-05-10 10:10:57,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 8: [2023-05-10 10:10:57,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 27: [2023-05-10 10:10:57,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 8: [2023-05-10 10:10:57,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 8: [2023-05-10 10:10:57,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 8: [2023-05-10 10:10:57,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 27: [2023-05-10 10:10:57,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 8: [2023-05-10 10:10:57,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 22: [2023-05-10 10:10:57,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 22: [2023-05-10 10:10:57,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 27: [2023-05-10 10:10:57,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 27: [2023-05-10 10:10:57,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 27: [2023-05-10 10:10:57,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 27: [2023-05-10 10:10:57,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 27: [2023-05-10 10:10:57,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 27: [2023-05-10 10:10:57,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:57,439] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 11: [2023-05-10 10:10:57,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 5: [2023-05-10 10:10:57,442] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 5: [2023-05-10 10:10:57,442] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 17: [2023-05-10 10:10:57,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,453] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 4: [2023-05-10 10:10:57,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 4: [2023-05-10 10:10:57,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 5: [2023-05-10 10:10:57,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 5: [2023-05-10 10:10:57,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 5: [2023-05-10 10:10:57,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 5: [2023-05-10 10:10:57,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 5: [2023-05-10 10:10:57,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 25: [2023-05-10 10:10:57,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 4: [2023-05-10 10:10:57,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 5: [2023-05-10 10:10:57,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 4: [2023-05-10 10:10:57,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 4: [2023-05-10 10:10:57,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 4: [2023-05-10 10:10:57,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 25: [2023-05-10 10:10:57,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:57,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:57,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:57,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 4: [2023-05-10 10:10:57,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 0: [2023-05-10 10:10:57,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:57,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:57,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 5: [2023-05-10 10:10:57,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 0: [2023-05-10 10:10:57,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:57,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 2: [2023-05-10 10:10:57,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 11: [2023-05-10 10:10:57,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 11: [2023-05-10 10:10:57,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 11: [2023-05-10 10:10:57,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 25: [2023-05-10 10:10:57,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 11: [2023-05-10 10:10:57,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 11: [2023-05-10 10:10:57,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 11: [2023-05-10 10:10:57,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:57,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 30: [2023-05-10 10:10:57,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 30: [2023-05-10 10:10:57,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 30: [2023-05-10 10:10:57,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 30: [2023-05-10 10:10:57,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 30: [2023-05-10 10:10:57,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 30: [2023-05-10 10:10:57,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 30: [2023-05-10 10:10:57,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 27: [2023-05-10 10:10:57,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 9: [2023-05-10 10:10:57,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 10: [2023-05-10 10:10:57,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 10: [2023-05-10 10:10:57,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 10: [2023-05-10 10:10:57,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:57,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:57,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 10: [2023-05-10 10:10:57,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 10: [2023-05-10 10:10:57,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 10: [2023-05-10 10:10:57,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,486] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 13: [2023-05-10 10:10:57,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:57,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 13: [2023-05-10 10:10:57,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 25: [2023-05-10 10:10:57,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 9: [2023-05-10 10:10:57,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 9: [2023-05-10 10:10:57,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 9: [2023-05-10 10:10:57,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 9: [2023-05-10 10:10:57,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 8: [2023-05-10 10:10:57,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 8: [2023-05-10 10:10:57,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 27: [2023-05-10 10:10:57,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 8: [2023-05-10 10:10:57,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 2: [2023-05-10 10:10:57,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 10: [2023-05-10 10:10:57,490] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:57,490] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 24: [2023-05-10 10:10:57,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 5: [2023-05-10 10:10:57,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:57,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 5: [2023-05-10 10:10:57,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:57,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:57,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 5: [2023-05-10 10:10:57,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 2: [2023-05-10 10:10:57,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 5: [2023-05-10 10:10:57,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 2: [2023-05-10 10:10:57,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:57,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 3: [2023-05-10 10:10:57,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 24: [2023-05-10 10:10:57,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 10: [2023-05-10 10:10:57,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 3: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 27: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 30: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 1: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 3: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 3: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 3: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 30: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 1: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 3: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 30: [2023-05-10 10:10:57,494] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:57,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:57,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:57,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:57,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 9: [2023-05-10 10:10:57,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:57,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:57,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 3: [2023-05-10 10:10:57,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 3: [2023-05-10 10:10:57,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 3: [2023-05-10 10:10:57,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 27: [2023-05-10 10:10:57,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 1: [2023-05-10 10:10:57,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 1: [2023-05-10 10:10:57,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 1: [2023-05-10 10:10:57,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 20: [2023-05-10 10:10:57,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 20: [2023-05-10 10:10:57,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 20: [2023-05-10 10:10:57,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 20: [2023-05-10 10:10:57,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 20: [2023-05-10 10:10:57,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 8: [2023-05-10 10:10:57,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 8: [2023-05-10 10:10:57,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 8: [2023-05-10 10:10:57,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 8: [2023-05-10 10:10:57,502] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 3: [2023-05-10 10:10:57,502] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 1: [2023-05-10 10:10:57,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,503] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 20: [2023-05-10 10:10:57,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,503] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 3: [2023-05-10 10:10:57,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 3: [2023-05-10 10:10:57,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 25: [2023-05-10 10:10:57,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:57,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 25: [2023-05-10 10:10:57,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 25: [2023-05-10 10:10:57,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:57,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 27: [2023-05-10 10:10:57,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:57,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 25: [2023-05-10 10:10:57,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:57,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 27: [2023-05-10 10:10:57,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 27: [2023-05-10 10:10:57,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 27: [2023-05-10 10:10:57,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 27: [2023-05-10 10:10:57,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 25: [2023-05-10 10:10:57,510] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 2: [2023-05-10 10:10:57,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 2: [2023-05-10 10:10:57,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 26: [2023-05-10 10:10:57,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 26: [2023-05-10 10:10:57,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 26: [2023-05-10 10:10:57,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 26: [2023-05-10 10:10:57,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 26: [2023-05-10 10:10:57,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 27: [2023-05-10 10:10:57,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:57,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 6: [2023-05-10 10:10:57,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 6: [2023-05-10 10:10:57,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 6: [2023-05-10 10:10:57,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 6: [2023-05-10 10:10:57,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 6: [2023-05-10 10:10:57,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 6: [2023-05-10 10:10:57,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 6: [2023-05-10 10:10:57,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 20: [2023-05-10 10:10:57,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 26: [2023-05-10 10:10:57,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 26: [2023-05-10 10:10:57,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 2: [2023-05-10 10:10:57,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:57,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:57,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:57,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:57,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 27: [2023-05-10 10:10:57,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:57,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:57,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 23: [2023-05-10 10:10:57,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 26: [2023-05-10 10:10:57,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:57,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:57,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:57,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 6: [2023-05-10 10:10:57,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 27: [2023-05-10 10:10:57,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:57,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 23: [2023-05-10 10:10:57,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 23: [2023-05-10 10:10:57,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 23: [2023-05-10 10:10:57,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 23: [2023-05-10 10:10:57,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 23: [2023-05-10 10:10:57,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,524] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 0: [2023-05-10 10:10:57,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 23: [2023-05-10 10:10:57,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:57,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 0: [2023-05-10 10:10:57,527] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,527] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 23: [2023-05-10 10:10:57,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:57,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:57,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:57,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 23: [2023-05-10 10:10:57,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:57,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 2: [2023-05-10 10:10:57,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 27: [2023-05-10 10:10:57,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 27: [2023-05-10 10:10:57,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 0: [2023-05-10 10:10:57,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 27: [2023-05-10 10:10:57,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 0: [2023-05-10 10:10:57,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 2: [2023-05-10 10:10:57,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 2: [2023-05-10 10:10:57,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 27: [2023-05-10 10:10:57,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 8: [2023-05-10 10:10:57,502] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 8: [2023-05-10 10:10:57,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 8: [2023-05-10 10:10:57,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 8: [2023-05-10 10:10:57,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 8: [2023-05-10 10:10:57,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 8: [2023-05-10 10:10:57,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 8: [2023-05-10 10:10:57,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 8: [2023-05-10 10:10:57,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 8: [2023-05-10 10:10:57,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:57,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:57,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:57,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:57,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:57,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:57,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 10: [2023-05-10 10:10:57,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 10: [2023-05-10 10:10:57,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,546] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 20: [2023-05-10 10:10:57,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 10: [2023-05-10 10:10:57,546] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 9: [2023-05-10 10:10:57,546] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 0: [2023-05-10 10:10:57,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 10: [2023-05-10 10:10:57,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 0: [2023-05-10 10:10:57,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 9: [2023-05-10 10:10:57,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 19: [2023-05-10 10:10:57,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 19: [2023-05-10 10:10:57,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 3: [2023-05-10 10:10:57,553] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 3: [2023-05-10 10:10:57,553] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 3: [2023-05-10 10:10:57,553] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 10: [2023-05-10 10:10:57,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:57,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 31: [2023-05-10 10:10:57,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 31: [2023-05-10 10:10:57,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 31: [2023-05-10 10:10:57,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 31: [2023-05-10 10:10:57,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 31: [2023-05-10 10:10:57,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:57,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 19: [2023-05-10 10:10:57,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 19: [2023-05-10 10:10:57,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 19: [2023-05-10 10:10:57,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 9: [2023-05-10 10:10:57,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 9: [2023-05-10 10:10:57,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 9: [2023-05-10 10:10:57,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 9: [2023-05-10 10:10:57,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:57,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:57,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 9: [2023-05-10 10:10:57,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 19: [2023-05-10 10:10:57,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:57,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:57,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,558] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 3: [2023-05-10 10:10:57,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:57,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,561] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 13: [2023-05-10 10:10:57,561] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 19: [2023-05-10 10:10:57,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:57,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:57,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 10: [2023-05-10 10:10:57,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 10: [2023-05-10 10:10:57,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 10: [2023-05-10 10:10:57,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 19: [2023-05-10 10:10:57,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 19: [2023-05-10 10:10:57,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 31: [2023-05-10 10:10:57,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:57,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 26: [2023-05-10 10:10:57,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 3: [2023-05-10 10:10:57,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:57,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 6: [2023-05-10 10:10:57,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 0: [2023-05-10 10:10:57,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:57,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 20: [2023-05-10 10:10:57,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 0: [2023-05-10 10:10:57,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:57,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 9: [2023-05-10 10:10:57,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 10: [2023-05-10 10:10:57,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:57,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 0: [2023-05-10 10:10:57,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 10: [2023-05-10 10:10:57,573] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:57,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 3: [2023-05-10 10:10:57,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 3: [2023-05-10 10:10:57,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 3: [2023-05-10 10:10:57,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 10: [2023-05-10 10:10:57,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:57,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:57,577] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:57,577] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:57,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:57,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 10: [2023-05-10 10:10:57,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 10: [2023-05-10 10:10:57,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:57,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 23: [2023-05-10 10:10:57,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 23: [2023-05-10 10:10:57,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:57,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 13: [2023-05-10 10:10:57,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 20: [2023-05-10 10:10:57,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 20: [2023-05-10 10:10:57,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 26: [2023-05-10 10:10:57,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 20: [2023-05-10 10:10:57,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 1: [2023-05-10 10:10:57,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:57,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 9: [2023-05-10 10:10:57,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:57,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 9: [2023-05-10 10:10:57,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:57,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 30: [2023-05-10 10:10:57,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 30: [2023-05-10 10:10:57,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 6: [2023-05-10 10:10:57,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 26: [2023-05-10 10:10:57,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:57,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 26: [2023-05-10 10:10:57,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 30: [2023-05-10 10:10:57,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 28: [2023-05-10 10:10:57,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 26: [2023-05-10 10:10:57,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:57,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 9: [2023-05-10 10:10:57,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 13: [2023-05-10 10:10:57,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:57,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 6: [2023-05-10 10:10:57,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 23: [2023-05-10 10:10:57,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 10: [2023-05-10 10:10:57,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:57,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 26: [2023-05-10 10:10:57,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 26: [2023-05-10 10:10:57,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:57,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 16: [2023-05-10 10:10:57,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:57,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:57,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:57,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 16: [2023-05-10 10:10:57,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:57,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:57,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:57,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 28: [2023-05-10 10:10:57,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 13: [2023-05-10 10:10:57,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:57,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 10: [2023-05-10 10:10:57,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 10: [2023-05-10 10:10:57,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:57,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 20: [2023-05-10 10:10:57,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 20: [2023-05-10 10:10:57,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 20: [2023-05-10 10:10:57,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 28: [2023-05-10 10:10:57,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 20: [2023-05-10 10:10:57,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 30: [2023-05-10 10:10:57,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 28: [2023-05-10 10:10:57,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt... 15: [2023-05-10 10:10:57,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 23: [2023-05-10 10:10:57,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:57,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 16: [2023-05-10 10:10:57,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:57,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 6: [2023-05-10 10:10:57,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 6: [2023-05-10 10:10:57,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 23: [2023-05-10 10:10:57,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 16: [2023-05-10 10:10:57,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 16: [2023-05-10 10:10:57,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 16: [2023-05-10 10:10:57,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:57,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:57,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:57,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:57,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:57,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 16: [2023-05-10 10:10:57,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:57,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:57,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:57,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:57,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:57,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:57,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:57,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:57,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 20: [2023-05-10 10:10:57,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 19: [2023-05-10 10:10:57,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 19: [2023-05-10 10:10:57,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 31: [2023-05-10 10:10:57,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:57,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:57,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 13: [2023-05-10 10:10:57,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 20: [2023-05-10 10:10:57,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:57,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:57,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,619] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 13: [2023-05-10 10:10:57,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 13: [2023-05-10 10:10:57,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:57,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 15: [2023-05-10 10:10:57,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:57,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:57,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:57,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:57,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:57,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:57,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:57,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 26: [2023-05-10 10:10:57,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:57,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:57,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:57,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:57,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:57,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:57,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:57,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:57,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:57,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:57,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 19: [2023-05-10 10:10:57,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 23: [2023-05-10 10:10:57,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 23: [2023-05-10 10:10:57,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 20: [2023-05-10 10:10:57,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 20: [2023-05-10 10:10:57,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 20: [2023-05-10 10:10:57,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 15: [2023-05-10 10:10:57,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:57,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:57,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:57,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:57,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:57,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 31: [2023-05-10 10:10:57,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 31: [2023-05-10 10:10:57,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 31: [2023-05-10 10:10:57,639] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:57,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:57,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 12: [2023-05-10 10:10:57,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:57,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:57,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 19: [2023-05-10 10:10:57,651] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 16: [2023-05-10 10:10:57,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:57,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:57,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 19: [2023-05-10 10:10:57,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 28: [2023-05-10 10:10:57,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:57,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:57,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:57,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 28: [2023-05-10 10:10:57,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 19: [2023-05-10 10:10:57,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:57,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 28: [2023-05-10 10:10:57,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 28: [2023-05-10 10:10:57,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_24-model_00-model_states.pt. 15: [2023-05-10 10:10:57,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:57,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:57,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:57,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:57,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:57,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:57,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 15: [2023-05-10 10:10:57,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:57,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:57,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:57,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 15: [2023-05-10 10:10:57,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:57,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:57,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:57,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:57,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:57,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:57,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:57,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:57,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:57,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 28: [2023-05-10 10:10:57,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 16: [2023-05-10 10:10:57,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:57,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:57,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:57,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:57,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:57,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:57,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:57,716] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,716] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,716] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,716] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,719] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,729] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,730] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:57,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:57,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:57,740] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,740] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,740] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,740] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,740] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:57,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:57,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:57,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:57,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:57,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:57,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:57,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:57,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:57,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:57,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:57,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:57,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:57,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 29: [2023-05-10 10:10:57,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 29: [2023-05-10 10:10:57,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 22: [2023-05-10 10:10:57,762] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 22: [2023-05-10 10:10:57,762] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 22: [2023-05-10 10:10:57,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 22: [2023-05-10 10:10:57,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:57,763] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 22: [2023-05-10 10:10:57,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 22: [2023-05-10 10:10:57,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 22: [2023-05-10 10:10:57,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 22: [2023-05-10 10:10:57,763] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:57,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 29: [2023-05-10 10:10:57,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 29: [2023-05-10 10:10:57,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:57,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:57,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:57,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:57,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:57,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:57,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:57,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:57,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:57,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 22: [2023-05-10 10:10:57,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 22: [2023-05-10 10:10:57,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 22: [2023-05-10 10:10:57,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 22: [2023-05-10 10:10:57,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 22: [2023-05-10 10:10:57,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:57,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:57,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:57,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 14: [2023-05-10 10:10:57,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:57,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 14: [2023-05-10 10:10:57,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 14: [2023-05-10 10:10:57,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 14: [2023-05-10 10:10:57,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 14: [2023-05-10 10:10:57,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 14: [2023-05-10 10:10:57,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:57,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:57,779] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 14: [2023-05-10 10:10:57,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,784] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 14: [2023-05-10 10:10:57,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,786] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 17: [2023-05-10 10:10:57,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,799] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,800] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,800] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,802] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 7: [2023-05-10 10:10:57,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:57,811] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:57,811] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,812] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:57,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 22: [2023-05-10 10:10:57,823] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 22: [2023-05-10 10:10:57,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,826] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 7: [2023-05-10 10:10:57,827] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 7: [2023-05-10 10:10:57,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 7: [2023-05-10 10:10:57,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:57,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:57,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 7: [2023-05-10 10:10:57,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:57,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 27: [2023-05-10 10:10:57,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:57,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:57,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:57,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 27: [2023-05-10 10:10:57,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 27: [2023-05-10 10:10:57,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 27: [2023-05-10 10:10:57,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 27: [2023-05-10 10:10:57,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 27: [2023-05-10 10:10:57,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,837] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,837] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,837] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,837] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,837] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,837] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 27: [2023-05-10 10:10:57,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 27: [2023-05-10 10:10:57,839] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 27: [2023-05-10 10:10:57,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 2: [2023-05-10 10:10:57,839] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 27: [2023-05-10 10:10:57,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 4: [2023-05-10 10:10:57,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 4: [2023-05-10 10:10:57,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 2: [2023-05-10 10:10:57,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 4: [2023-05-10 10:10:57,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 27: [2023-05-10 10:10:57,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 11: [2023-05-10 10:10:57,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 25: [2023-05-10 10:10:57,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 25: [2023-05-10 10:10:57,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 27: [2023-05-10 10:10:57,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 4: [2023-05-10 10:10:57,841] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 11: [2023-05-10 10:10:57,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,841] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 25: [2023-05-10 10:10:57,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 22: [2023-05-10 10:10:57,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 22: [2023-05-10 10:10:57,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 25: [2023-05-10 10:10:57,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 25: [2023-05-10 10:10:57,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 27: [2023-05-10 10:10:57,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 25: [2023-05-10 10:10:57,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 25: [2023-05-10 10:10:57,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 25: [2023-05-10 10:10:57,842] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 27: [2023-05-10 10:10:57,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 4: [2023-05-10 10:10:57,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 4: [2023-05-10 10:10:57,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 29: [2023-05-10 10:10:57,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:57,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 29: [2023-05-10 10:10:57,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:57,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 14: [2023-05-10 10:10:57,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 11: [2023-05-10 10:10:57,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,846] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 22: [2023-05-10 10:10:57,846] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 22: [2023-05-10 10:10:57,846] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 22: [2023-05-10 10:10:57,846] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 22: [2023-05-10 10:10:57,846] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 25: [2023-05-10 10:10:57,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 2: [2023-05-10 10:10:57,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 25: [2023-05-10 10:10:57,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 22: [2023-05-10 10:10:57,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:57,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 29: [2023-05-10 10:10:57,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 24: [2023-05-10 10:10:57,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 2: [2023-05-10 10:10:57,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 25: [2023-05-10 10:10:57,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 22: [2023-05-10 10:10:57,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:57,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 25: [2023-05-10 10:10:57,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 25: [2023-05-10 10:10:57,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 25: [2023-05-10 10:10:57,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 25: [2023-05-10 10:10:57,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 24: [2023-05-10 10:10:57,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 24: [2023-05-10 10:10:57,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 24: [2023-05-10 10:10:57,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 24: [2023-05-10 10:10:57,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,852] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 11: [2023-05-10 10:10:57,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 24: [2023-05-10 10:10:57,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 11: [2023-05-10 10:10:57,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,853] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 14: [2023-05-10 10:10:57,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 14: [2023-05-10 10:10:57,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 14: [2023-05-10 10:10:57,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:57,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 24: [2023-05-10 10:10:57,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 18: [2023-05-10 10:10:57,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 14: [2023-05-10 10:10:57,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 18: [2023-05-10 10:10:57,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:57,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:57,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:57,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:57,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:57,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:57,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:57,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:57,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:57,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:57,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 18: [2023-05-10 10:10:57,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:57,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 22: [2023-05-10 10:10:57,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:57,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:57,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:57,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:57,887] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 14: [2023-05-10 10:10:57,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 11: [2023-05-10 10:10:57,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:57,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 4: [2023-05-10 10:10:57,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 18: [2023-05-10 10:10:57,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 18: [2023-05-10 10:10:57,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 18: [2023-05-10 10:10:57,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:57,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 4: [2023-05-10 10:10:57,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 18: [2023-05-10 10:10:57,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 18: [2023-05-10 10:10:57,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:57,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 24: [2023-05-10 10:10:57,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 24: [2023-05-10 10:10:57,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 25: [2023-05-10 10:10:57,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 25: [2023-05-10 10:10:57,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 25: [2023-05-10 10:10:57,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 27: [2023-05-10 10:10:57,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:57,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 27: [2023-05-10 10:10:57,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 27: [2023-05-10 10:10:57,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 27: [2023-05-10 10:10:57,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 27: [2023-05-10 10:10:57,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 25: [2023-05-10 10:10:57,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 25: [2023-05-10 10:10:57,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 25: [2023-05-10 10:10:57,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 25: [2023-05-10 10:10:57,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 11: [2023-05-10 10:10:57,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 17: [2023-05-10 10:10:57,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:57,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:57,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:57,914] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:57,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:57,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:57,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:57,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:57,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:57,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:57,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 4: [2023-05-10 10:10:57,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:57,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 2: [2023-05-10 10:10:57,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:57,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 5: [2023-05-10 10:10:57,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 24: [2023-05-10 10:10:57,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 24: [2023-05-10 10:10:57,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 24: [2023-05-10 10:10:57,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 15: [2023-05-10 10:10:57,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:57,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:57,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:57,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:57,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:57,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:57,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:57,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 11: [2023-05-10 10:10:57,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 5: [2023-05-10 10:10:57,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 5: [2023-05-10 10:10:57,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 5: [2023-05-10 10:10:57,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 5: [2023-05-10 10:10:57,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 5: [2023-05-10 10:10:57,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 4: [2023-05-10 10:10:57,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:57,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 5: [2023-05-10 10:10:57,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 5: [2023-05-10 10:10:57,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 4: [2023-05-10 10:10:57,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 15: [2023-05-10 10:10:57,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:57,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:57,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 11: [2023-05-10 10:10:57,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 24: [2023-05-10 10:10:57,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 24: [2023-05-10 10:10:57,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 24: [2023-05-10 10:10:57,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 24: [2023-05-10 10:10:57,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 24: [2023-05-10 10:10:57,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:57,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 31: [2023-05-10 10:10:57,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 31: [2023-05-10 10:10:57,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 31: [2023-05-10 10:10:57,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 31: [2023-05-10 10:10:57,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 31: [2023-05-10 10:10:57,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 31: [2023-05-10 10:10:57,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 27: [2023-05-10 10:10:57,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:57,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:57,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 27: [2023-05-10 10:10:57,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 15: [2023-05-10 10:10:57,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 4: [2023-05-10 10:10:57,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:57,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 25: [2023-05-10 10:10:57,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:57,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:57,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 15: [2023-05-10 10:10:57,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:57,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:57,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:57,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:57,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:57,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:57,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:57,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:57,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 4: [2023-05-10 10:10:57,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:57,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 4: [2023-05-10 10:10:57,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:57,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 15: [2023-05-10 10:10:57,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:57,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:57,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:57,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:57,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:57,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:57,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:57,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:57,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:57,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:57,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:57,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:57,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:57,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 24: [2023-05-10 10:10:57,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:57,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:57,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:57,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:57,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:57,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 11: [2023-05-10 10:10:57,959] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:57,960] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:57,960] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:57,960] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:57,960] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:57,960] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:57,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:57,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:57,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:57,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:57,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:57,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:57,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 11: [2023-05-10 10:10:57,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:57,963] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 8: [2023-05-10 10:10:57,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 11: [2023-05-10 10:10:57,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 11: [2023-05-10 10:10:57,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 11: [2023-05-10 10:10:57,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 24: [2023-05-10 10:10:57,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:57,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 2: [2023-05-10 10:10:57,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:57,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:57,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:57,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:57,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:57,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:57,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:57,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 24: [2023-05-10 10:10:57,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 19: [2023-05-10 10:10:57,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:57,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 8: [2023-05-10 10:10:57,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 8: [2023-05-10 10:10:57,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 24: [2023-05-10 10:10:57,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:57,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 16: [2023-05-10 10:10:57,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:57,968] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 13: [2023-05-10 10:10:57,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:57,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:57,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:57,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:57,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:57,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:57,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:57,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 24: [2023-05-10 10:10:57,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:57,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 24: [2023-05-10 10:10:57,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 19: [2023-05-10 10:10:57,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 16: [2023-05-10 10:10:57,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:57,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 19: [2023-05-10 10:10:57,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 16: [2023-05-10 10:10:57,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:57,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 19: [2023-05-10 10:10:57,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 16: [2023-05-10 10:10:57,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 19: [2023-05-10 10:10:57,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:57,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:57,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:57,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:57,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:57,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:57,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:57,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:57,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:57,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:57,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:57,977] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:57,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 5: [2023-05-10 10:10:57,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 9: [2023-05-10 10:10:57,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 9: [2023-05-10 10:10:57,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:57,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:57,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 28: [2023-05-10 10:10:57,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:57,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 13: [2023-05-10 10:10:57,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 9: [2023-05-10 10:10:57,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 9: [2023-05-10 10:10:57,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 9: [2023-05-10 10:10:57,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 9: [2023-05-10 10:10:57,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:57,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:57,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 13: [2023-05-10 10:10:57,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:57,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 13: [2023-05-10 10:10:57,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:57,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:57,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 13: [2023-05-10 10:10:57,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:57,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 13: [2023-05-10 10:10:57,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:57,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 28: [2023-05-10 10:10:57,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 28: [2023-05-10 10:10:57,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 28: [2023-05-10 10:10:57,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 28: [2023-05-10 10:10:57,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:57,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:57,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,987] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,987] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 5: [2023-05-10 10:10:57,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 5: [2023-05-10 10:10:57,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 9: [2023-05-10 10:10:57,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:57,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 30: [2023-05-10 10:10:57,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 30: [2023-05-10 10:10:57,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 30: [2023-05-10 10:10:57,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 30: [2023-05-10 10:10:57,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 30: [2023-05-10 10:10:57,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 30: [2023-05-10 10:10:57,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 30: [2023-05-10 10:10:57,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 30: [2023-05-10 10:10:57,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:57,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:57,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:57,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:57,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:57,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:57,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:57,995] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:57,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:57,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:57,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:57,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:57,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:58,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 15: [2023-05-10 10:10:58,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 30: [2023-05-10 10:10:58,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 21: [2023-05-10 10:10:58,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 5: [2023-05-10 10:10:58,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:58,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 5: [2023-05-10 10:10:58,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 5: [2023-05-10 10:10:58,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 30: [2023-05-10 10:10:58,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:58,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:58,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:58,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:58,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:58,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 30: [2023-05-10 10:10:58,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 8: [2023-05-10 10:10:58,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:58,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:58,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 31: [2023-05-10 10:10:58,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 31: [2023-05-10 10:10:58,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 5: [2023-05-10 10:10:58,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 23: [2023-05-10 10:10:58,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 23: [2023-05-10 10:10:58,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 6: [2023-05-10 10:10:58,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:58,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:58,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:58,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 23: [2023-05-10 10:10:58,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:58,018] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 23: [2023-05-10 10:10:58,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 23: [2023-05-10 10:10:58,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 23: [2023-05-10 10:10:58,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:58,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:58,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:58,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:58,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:58,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 23: [2023-05-10 10:10:58,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 20: [2023-05-10 10:10:58,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 20: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 20: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 10: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 10: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 20: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 20: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 20: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 10: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 20: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 3: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 3: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 10: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 10: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 10: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 20: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 3: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 3: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 10: [2023-05-10 10:10:58,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 10: [2023-05-10 10:10:58,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 3: [2023-05-10 10:10:58,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 3: [2023-05-10 10:10:58,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 3: [2023-05-10 10:10:58,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 3: [2023-05-10 10:10:58,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:58,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 0: [2023-05-10 10:10:58,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 0: [2023-05-10 10:10:58,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 0: [2023-05-10 10:10:58,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 0: [2023-05-10 10:10:58,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 0: [2023-05-10 10:10:58,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 0: [2023-05-10 10:10:58,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 0: [2023-05-10 10:10:58,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 0: [2023-05-10 10:10:58,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 20: [2023-05-10 10:10:58,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:58,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:58,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:58,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:58,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 23: [2023-05-10 10:10:58,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:58,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:58,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:58,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:58,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:58,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 3: [2023-05-10 10:10:58,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:58,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:58,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 6: [2023-05-10 10:10:58,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:58,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 0: [2023-05-10 10:10:58,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 16: [2023-05-10 10:10:58,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 10: [2023-05-10 10:10:58,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:58,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:58,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 0: [2023-05-10 10:10:58,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:58,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 0: [2023-05-10 10:10:58,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:58,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 10: [2023-05-10 10:10:58,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 10: [2023-05-10 10:10:58,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 31: [2023-05-10 10:10:58,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 31: [2023-05-10 10:10:58,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 31: [2023-05-10 10:10:58,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 31: [2023-05-10 10:10:58,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 20: [2023-05-10 10:10:58,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:58,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 23: [2023-05-10 10:10:58,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:58,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:58,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 0: [2023-05-10 10:10:58,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 12: [2023-05-10 10:10:58,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 0: [2023-05-10 10:10:58,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 23: [2023-05-10 10:10:58,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 10: [2023-05-10 10:10:58,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 10: [2023-05-10 10:10:58,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 0: [2023-05-10 10:10:58,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 20: [2023-05-10 10:10:58,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:58,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 0: [2023-05-10 10:10:58,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 10: [2023-05-10 10:10:58,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:58,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:58,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 0: [2023-05-10 10:10:58,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 10: [2023-05-10 10:10:58,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 3: [2023-05-10 10:10:58,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 10: [2023-05-10 10:10:58,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:58,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 20: [2023-05-10 10:10:58,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 20: [2023-05-10 10:10:58,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 20: [2023-05-10 10:10:58,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 5: [2023-05-10 10:10:58,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 5: [2023-05-10 10:10:58,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 8: [2023-05-10 10:10:58,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:58,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:58,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:58,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:58,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:58,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:58,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:58,035] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 5: [2023-05-10 10:10:58,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 1: [2023-05-10 10:10:58,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:58,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:58,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:58,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 28: [2023-05-10 10:10:58,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 28: [2023-05-10 10:10:58,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:58,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:58,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 19: [2023-05-10 10:10:58,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:58,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 16: [2023-05-10 10:10:58,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 16: [2023-05-10 10:10:58,042] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 19: [2023-05-10 10:10:58,042] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:58,042] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:58,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:58,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:58,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:58,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:58,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:58,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 19: [2023-05-10 10:10:58,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:58,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 9: [2023-05-10 10:10:58,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 9: [2023-05-10 10:10:58,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 9: [2023-05-10 10:10:58,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:58,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 26: [2023-05-10 10:10:58,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:58,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 28: [2023-05-10 10:10:58,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 9: [2023-05-10 10:10:58,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 9: [2023-05-10 10:10:58,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 9: [2023-05-10 10:10:58,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 26: [2023-05-10 10:10:58,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 9: [2023-05-10 10:10:58,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 9: [2023-05-10 10:10:58,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:58,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 15: [2023-05-10 10:10:58,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt... 1: [2023-05-10 10:10:58,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:58,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:58,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:58,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:58,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 15: [2023-05-10 10:10:58,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 15: [2023-05-10 10:10:58,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 15: [2023-05-10 10:10:58,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 15: [2023-05-10 10:10:58,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 19: [2023-05-10 10:10:58,057] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:58,057] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 19: [2023-05-10 10:10:58,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 13: [2023-05-10 10:10:58,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:58,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:58,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 1: [2023-05-10 10:10:58,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 1: [2023-05-10 10:10:58,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:58,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:58,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 28: [2023-05-10 10:10:58,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:58,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:58,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:58,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 28: [2023-05-10 10:10:58,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 8: [2023-05-10 10:10:58,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:58,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 1: [2023-05-10 10:10:58,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:58,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:58,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:58,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 28: [2023-05-10 10:10:58,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 28: [2023-05-10 10:10:58,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 21: [2023-05-10 10:10:58,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 19: [2023-05-10 10:10:58,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:58,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:58,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:58,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:58,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:58,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 19: [2023-05-10 10:10:58,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 19: [2023-05-10 10:10:58,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:58,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 19: [2023-05-10 10:10:58,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 28: [2023-05-10 10:10:58,074] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 19: [2023-05-10 10:10:58,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:58,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 9: [2023-05-10 10:10:58,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 9: [2023-05-10 10:10:58,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 28: [2023-05-10 10:10:58,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:58,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 9: [2023-05-10 10:10:58,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:58,077] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 23: [2023-05-10 10:10:58,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 23: [2023-05-10 10:10:58,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:58,077] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:58,078] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:58,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 3: [2023-05-10 10:10:58,078] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 3: [2023-05-10 10:10:58,078] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 3: [2023-05-10 10:10:58,078] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:58,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:58,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:58,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:58,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:58,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 13: [2023-05-10 10:10:58,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 16: [2023-05-10 10:10:58,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:58,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 1: [2023-05-10 10:10:58,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:58,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 1: [2023-05-10 10:10:58,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 10: [2023-05-10 10:10:58,081] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 9: [2023-05-10 10:10:58,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 9: [2023-05-10 10:10:58,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 9: [2023-05-10 10:10:58,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 10: [2023-05-10 10:10:58,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 10: [2023-05-10 10:10:58,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 1: [2023-05-10 10:10:58,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 1: [2023-05-10 10:10:58,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 0: [2023-05-10 10:10:58,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 0: [2023-05-10 10:10:58,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 0: [2023-05-10 10:10:58,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 20: [2023-05-10 10:10:58,084] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:58,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 29: [2023-05-10 10:10:58,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:58,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:58,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:58,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:58,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 23: [2023-05-10 10:10:58,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 3: [2023-05-10 10:10:58,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 6: [2023-05-10 10:10:58,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 13: [2023-05-10 10:10:58,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 6: [2023-05-10 10:10:58,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 10: [2023-05-10 10:10:58,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 30: [2023-05-10 10:10:58,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 12: [2023-05-10 10:10:58,093] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 23: [2023-05-10 10:10:58,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 23: [2023-05-10 10:10:58,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 23: [2023-05-10 10:10:58,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 10: [2023-05-10 10:10:58,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 10: [2023-05-10 10:10:58,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 10: [2023-05-10 10:10:58,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 10: [2023-05-10 10:10:58,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 3: [2023-05-10 10:10:58,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 3: [2023-05-10 10:10:58,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 3: [2023-05-10 10:10:58,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 3: [2023-05-10 10:10:58,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 0: [2023-05-10 10:10:58,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 0: [2023-05-10 10:10:58,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 0: [2023-05-10 10:10:58,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 0: [2023-05-10 10:10:58,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 0: [2023-05-10 10:10:58,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:58,098] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,098] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:58,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 28: [2023-05-10 10:10:58,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:58,098] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 30: [2023-05-10 10:10:58,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 30: [2023-05-10 10:10:58,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 30: [2023-05-10 10:10:58,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 30: [2023-05-10 10:10:58,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 30: [2023-05-10 10:10:58,099] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 28: [2023-05-10 10:10:58,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 23: [2023-05-10 10:10:58,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:58,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 23: [2023-05-10 10:10:58,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 28: [2023-05-10 10:10:58,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 23: [2023-05-10 10:10:58,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 3: [2023-05-10 10:10:58,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 3: [2023-05-10 10:10:58,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 3: [2023-05-10 10:10:58,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 12: [2023-05-10 10:10:58,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:58,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,107] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,107] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 10: [2023-05-10 10:10:58,107] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 10: [2023-05-10 10:10:58,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 0: [2023-05-10 10:10:58,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 0: [2023-05-10 10:10:58,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 0: [2023-05-10 10:10:58,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:58,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 13: [2023-05-10 10:10:58,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 13: [2023-05-10 10:10:58,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 13: [2023-05-10 10:10:58,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 13: [2023-05-10 10:10:58,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 13: [2023-05-10 10:10:58,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:58,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 26: [2023-05-10 10:10:58,117] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,117] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,117] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 23: [2023-05-10 10:10:58,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 26: [2023-05-10 10:10:58,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 10: [2023-05-10 10:10:58,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 3: [2023-05-10 10:10:58,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 20: [2023-05-10 10:10:58,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 20: [2023-05-10 10:10:58,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 26: [2023-05-10 10:10:58,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_25-model_00-model_states.pt. 6: [2023-05-10 10:10:58,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 0: [2023-05-10 10:10:58,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 23: [2023-05-10 10:10:58,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 23: [2023-05-10 10:10:58,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 0: [2023-05-10 10:10:58,124] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 0: [2023-05-10 10:10:58,124] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 23: [2023-05-10 10:10:58,124] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 23: [2023-05-10 10:10:58,124] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 3: [2023-05-10 10:10:58,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 0: [2023-05-10 10:10:58,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 0: [2023-05-10 10:10:58,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 3: [2023-05-10 10:10:58,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 3: [2023-05-10 10:10:58,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 10: [2023-05-10 10:10:58,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 3: [2023-05-10 10:10:58,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:58,128] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:58,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 10: [2023-05-10 10:10:58,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:58,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,130] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:58,132] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,132] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:58,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:58,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,134] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:58,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:58,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:58,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:58,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:58,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:58,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:58,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:58,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:58,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:58,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:58,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:58,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:58,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 12: [2023-05-10 10:10:58,143] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 12: [2023-05-10 10:10:58,143] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 12: [2023-05-10 10:10:58,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 26: [2023-05-10 10:10:58,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 26: [2023-05-10 10:10:58,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:58,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:58,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 26: [2023-05-10 10:10:58,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:58,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 26: [2023-05-10 10:10:58,158] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:58,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,210] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:58,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:58,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:58,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:58,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:58,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:58,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:58,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 7: [2023-05-10 10:10:58,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 7: [2023-05-10 10:10:58,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 7: [2023-05-10 10:10:58,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:58,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:58,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 17: [2023-05-10 10:10:58,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:58,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 17: [2023-05-10 10:10:58,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 17: [2023-05-10 10:10:58,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 17: [2023-05-10 10:10:58,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 17: [2023-05-10 10:10:58,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,268] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:58,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 7: [2023-05-10 10:10:58,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 7: [2023-05-10 10:10:58,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 4: [2023-05-10 10:10:58,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 4: [2023-05-10 10:10:58,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 4: [2023-05-10 10:10:58,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 4: [2023-05-10 10:10:58,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 4: [2023-05-10 10:10:58,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 4: [2023-05-10 10:10:58,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 4: [2023-05-10 10:10:58,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 4: [2023-05-10 10:10:58,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:58,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:58,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:58,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:58,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:58,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:58,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:58,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 5: [2023-05-10 10:10:58,287] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 5: [2023-05-10 10:10:58,288] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 5: [2023-05-10 10:10:58,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 5: [2023-05-10 10:10:58,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 5: [2023-05-10 10:10:58,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 5: [2023-05-10 10:10:58,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 5: [2023-05-10 10:10:58,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 5: [2023-05-10 10:10:58,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:58,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,302] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 18: [2023-05-10 10:10:58,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 11: [2023-05-10 10:10:58,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 11: [2023-05-10 10:10:58,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 11: [2023-05-10 10:10:58,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 11: [2023-05-10 10:10:58,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 11: [2023-05-10 10:10:58,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 11: [2023-05-10 10:10:58,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 11: [2023-05-10 10:10:58,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 11: [2023-05-10 10:10:58,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 18: [2023-05-10 10:10:58,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 18: [2023-05-10 10:10:58,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 18: [2023-05-10 10:10:58,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:58,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 11: [2023-05-10 10:10:58,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 11: [2023-05-10 10:10:58,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:58,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,315] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 11: [2023-05-10 10:10:58,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 11: [2023-05-10 10:10:58,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 11: [2023-05-10 10:10:58,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 14: [2023-05-10 10:10:58,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 11: [2023-05-10 10:10:58,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 11: [2023-05-10 10:10:58,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 11: [2023-05-10 10:10:58,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 7: [2023-05-10 10:10:58,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:58,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 15: [2023-05-10 10:10:58,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 15: [2023-05-10 10:10:58,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:58,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 15: [2023-05-10 10:10:58,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 15: [2023-05-10 10:10:58,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 27: [2023-05-10 10:10:58,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 27: [2023-05-10 10:10:58,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:58,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 27: [2023-05-10 10:10:58,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 27: [2023-05-10 10:10:58,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:58,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,333] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 31: [2023-05-10 10:10:58,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 31: [2023-05-10 10:10:58,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 31: [2023-05-10 10:10:58,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 31: [2023-05-10 10:10:58,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 31: [2023-05-10 10:10:58,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 31: [2023-05-10 10:10:58,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:58,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,337] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:58,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:58,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 7: [2023-05-10 10:10:58,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:58,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:58,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:58,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 15: [2023-05-10 10:10:58,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 15: [2023-05-10 10:10:58,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 15: [2023-05-10 10:10:58,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 15: [2023-05-10 10:10:58,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 15: [2023-05-10 10:10:58,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 15: [2023-05-10 10:10:58,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 15: [2023-05-10 10:10:58,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 5: [2023-05-10 10:10:58,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 14: [2023-05-10 10:10:58,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,353] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,357] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,357] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,357] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,358] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 17: [2023-05-10 10:10:58,358] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 8: [2023-05-10 10:10:58,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 8: [2023-05-10 10:10:58,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 8: [2023-05-10 10:10:58,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 8: [2023-05-10 10:10:58,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 8: [2023-05-10 10:10:58,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 8: [2023-05-10 10:10:58,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 8: [2023-05-10 10:10:58,359] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 8: [2023-05-10 10:10:58,360] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,363] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 11: [2023-05-10 10:10:58,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 7: [2023-05-10 10:10:58,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 4: [2023-05-10 10:10:58,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:58,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:58,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:58,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:58,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:58,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 17: [2023-05-10 10:10:58,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 17: [2023-05-10 10:10:58,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 17: [2023-05-10 10:10:58,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 17: [2023-05-10 10:10:58,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 17: [2023-05-10 10:10:58,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 8: [2023-05-10 10:10:58,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:58,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 5: [2023-05-10 10:10:58,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,369] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 5: [2023-05-10 10:10:58,369] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 5: [2023-05-10 10:10:58,369] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 5: [2023-05-10 10:10:58,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,373] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 24: [2023-05-10 10:10:58,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 24: [2023-05-10 10:10:58,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 5: [2023-05-10 10:10:58,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 27: [2023-05-10 10:10:58,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 24: [2023-05-10 10:10:58,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 24: [2023-05-10 10:10:58,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 24: [2023-05-10 10:10:58,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 24: [2023-05-10 10:10:58,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 24: [2023-05-10 10:10:58,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 9: [2023-05-10 10:10:58,384] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 9: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 9: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 9: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 9: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 9: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 9: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 25: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 25: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 25: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 25: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 25: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 25: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:58,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 16: [2023-05-10 10:10:58,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 16: [2023-05-10 10:10:58,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:58,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 16: [2023-05-10 10:10:58,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 15: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 16: [2023-05-10 10:10:58,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 25: [2023-05-10 10:10:58,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:58,390] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 11: [2023-05-10 10:10:58,391] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 11: [2023-05-10 10:10:58,391] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 11: [2023-05-10 10:10:58,391] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 25: [2023-05-10 10:10:58,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 9: [2023-05-10 10:10:58,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:58,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 9: [2023-05-10 10:10:58,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:58,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 25: [2023-05-10 10:10:58,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:58,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:58,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 25: [2023-05-10 10:10:58,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:58,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 9: [2023-05-10 10:10:58,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:58,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 9: [2023-05-10 10:10:58,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 9: [2023-05-10 10:10:58,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 9: [2023-05-10 10:10:58,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 18: [2023-05-10 10:10:58,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,393] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:58,394] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,394] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 9: [2023-05-10 10:10:58,394] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:58,394] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 16: [2023-05-10 10:10:58,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 16: [2023-05-10 10:10:58,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 16: [2023-05-10 10:10:58,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 18: [2023-05-10 10:10:58,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 16: [2023-05-10 10:10:58,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:58,406] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 20: [2023-05-10 10:10:58,406] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 20: [2023-05-10 10:10:58,406] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 20: [2023-05-10 10:10:58,407] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 20: [2023-05-10 10:10:58,407] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 20: [2023-05-10 10:10:58,407] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 20: [2023-05-10 10:10:58,407] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 20: [2023-05-10 10:10:58,407] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 2: [2023-05-10 10:10:58,407] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 2: [2023-05-10 10:10:58,407] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:58,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:58,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,408] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 12: [2023-05-10 10:10:58,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 12: [2023-05-10 10:10:58,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 12: [2023-05-10 10:10:58,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 12: [2023-05-10 10:10:58,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 12: [2023-05-10 10:10:58,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 12: [2023-05-10 10:10:58,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 12: [2023-05-10 10:10:58,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:58,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 2: [2023-05-10 10:10:58,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 2: [2023-05-10 10:10:58,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 2: [2023-05-10 10:10:58,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 2: [2023-05-10 10:10:58,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:58,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 20: [2023-05-10 10:10:58,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:58,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:58,411] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:58,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:58,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 27: [2023-05-10 10:10:58,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 27: [2023-05-10 10:10:58,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 27: [2023-05-10 10:10:58,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 27: [2023-05-10 10:10:58,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:58,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:58,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 15: [2023-05-10 10:10:58,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 15: [2023-05-10 10:10:58,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:58,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:58,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:58,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:58,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:58,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:58,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 2: [2023-05-10 10:10:58,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:58,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 8: [2023-05-10 10:10:58,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:58,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 15: [2023-05-10 10:10:58,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 15: [2023-05-10 10:10:58,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 15: [2023-05-10 10:10:58,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 15: [2023-05-10 10:10:58,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 12: [2023-05-10 10:10:58,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 31: [2023-05-10 10:10:58,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 31: [2023-05-10 10:10:58,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 31: [2023-05-10 10:10:58,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:58,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:58,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 6: [2023-05-10 10:10:58,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 12: [2023-05-10 10:10:58,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 12: [2023-05-10 10:10:58,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 12: [2023-05-10 10:10:58,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 1: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,425] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,425] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 6: [2023-05-10 10:10:58,425] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 6: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 6: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 6: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 11: [2023-05-10 10:10:58,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 1: [2023-05-10 10:10:58,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 24: [2023-05-10 10:10:58,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 21: [2023-05-10 10:10:58,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 21: [2023-05-10 10:10:58,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 21: [2023-05-10 10:10:58,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 21: [2023-05-10 10:10:58,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 21: [2023-05-10 10:10:58,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 21: [2023-05-10 10:10:58,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 21: [2023-05-10 10:10:58,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 1: [2023-05-10 10:10:58,433] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 1: [2023-05-10 10:10:58,433] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:58,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 8: [2023-05-10 10:10:58,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 8: [2023-05-10 10:10:58,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 8: [2023-05-10 10:10:58,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 28: [2023-05-10 10:10:58,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:58,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:58,435] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:58,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,436] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 6: [2023-05-10 10:10:58,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:58,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:58,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:58,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:58,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:58,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 21: [2023-05-10 10:10:58,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 13: [2023-05-10 10:10:58,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 13: [2023-05-10 10:10:58,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:58,439] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:58,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 13: [2023-05-10 10:10:58,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 21: [2023-05-10 10:10:58,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 13: [2023-05-10 10:10:58,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 27: [2023-05-10 10:10:58,443] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 25: [2023-05-10 10:10:58,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:58,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 16: [2023-05-10 10:10:58,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 19: [2023-05-10 10:10:58,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 19: [2023-05-10 10:10:58,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 19: [2023-05-10 10:10:58,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 19: [2023-05-10 10:10:58,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 19: [2023-05-10 10:10:58,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 19: [2023-05-10 10:10:58,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:58,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 19: [2023-05-10 10:10:58,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 28: [2023-05-10 10:10:58,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:58,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 28: [2023-05-10 10:10:58,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 28: [2023-05-10 10:10:58,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:58,450] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 30: [2023-05-10 10:10:58,450] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 30: [2023-05-10 10:10:58,450] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 30: [2023-05-10 10:10:58,451] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 30: [2023-05-10 10:10:58,451] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 30: [2023-05-10 10:10:58,451] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 30: [2023-05-10 10:10:58,451] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 30: [2023-05-10 10:10:58,451] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 31: [2023-05-10 10:10:58,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:58,451] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 8: [2023-05-10 10:10:58,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 19: [2023-05-10 10:10:58,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 31: [2023-05-10 10:10:58,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:58,452] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 25: [2023-05-10 10:10:58,452] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 25: [2023-05-10 10:10:58,452] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 31: [2023-05-10 10:10:58,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,453] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 19: [2023-05-10 10:10:58,453] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 26: [2023-05-10 10:10:58,453] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 26: [2023-05-10 10:10:58,453] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 31: [2023-05-10 10:10:58,453] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:58,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 29: [2023-05-10 10:10:58,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 24: [2023-05-10 10:10:58,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 26: [2023-05-10 10:10:58,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 26: [2023-05-10 10:10:58,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 26: [2023-05-10 10:10:58,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 19: [2023-05-10 10:10:58,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 19: [2023-05-10 10:10:58,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 26: [2023-05-10 10:10:58,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 26: [2023-05-10 10:10:58,455] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 26: [2023-05-10 10:10:58,456] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 9: [2023-05-10 10:10:58,456] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 24: [2023-05-10 10:10:58,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 15: [2023-05-10 10:10:58,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 29: [2023-05-10 10:10:58,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 26: [2023-05-10 10:10:58,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 19: [2023-05-10 10:10:58,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 19: [2023-05-10 10:10:58,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:58,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 19: [2023-05-10 10:10:58,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 29: [2023-05-10 10:10:58,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 15: [2023-05-10 10:10:58,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:58,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 25: [2023-05-10 10:10:58,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 15: [2023-05-10 10:10:58,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:58,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:58,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:58,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:58,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 30: [2023-05-10 10:10:58,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:58,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:58,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 9: [2023-05-10 10:10:58,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 2: [2023-05-10 10:10:58,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 2: [2023-05-10 10:10:58,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 9: [2023-05-10 10:10:58,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 30: [2023-05-10 10:10:58,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:58,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:58,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 26: [2023-05-10 10:10:58,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:58,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:58,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 26: [2023-05-10 10:10:58,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:58,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 12: [2023-05-10 10:10:58,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 26: [2023-05-10 10:10:58,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:58,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 16: [2023-05-10 10:10:58,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 16: [2023-05-10 10:10:58,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 16: [2023-05-10 10:10:58,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 16: [2023-05-10 10:10:58,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 16: [2023-05-10 10:10:58,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 16: [2023-05-10 10:10:58,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:58,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 30: [2023-05-10 10:10:58,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 8: [2023-05-10 10:10:58,469] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,469] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 25: [2023-05-10 10:10:58,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 3: [2023-05-10 10:10:58,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 3: [2023-05-10 10:10:58,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 3: [2023-05-10 10:10:58,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 3: [2023-05-10 10:10:58,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 3: [2023-05-10 10:10:58,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 3: [2023-05-10 10:10:58,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 3: [2023-05-10 10:10:58,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 16: [2023-05-10 10:10:58,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:58,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 9: [2023-05-10 10:10:58,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 3: [2023-05-10 10:10:58,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 3: [2023-05-10 10:10:58,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 3: [2023-05-10 10:10:58,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 25: [2023-05-10 10:10:58,477] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,477] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 6: [2023-05-10 10:10:58,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 3: [2023-05-10 10:10:58,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 3: [2023-05-10 10:10:58,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 3: [2023-05-10 10:10:58,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 3: [2023-05-10 10:10:58,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 3: [2023-05-10 10:10:58,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 2: [2023-05-10 10:10:58,481] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 2: [2023-05-10 10:10:58,481] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 25: [2023-05-10 10:10:58,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 25: [2023-05-10 10:10:58,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 25: [2023-05-10 10:10:58,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 2: [2023-05-10 10:10:58,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 2: [2023-05-10 10:10:58,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 9: [2023-05-10 10:10:58,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 24: [2023-05-10 10:10:58,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 25: [2023-05-10 10:10:58,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 25: [2023-05-10 10:10:58,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 24: [2023-05-10 10:10:58,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 24: [2023-05-10 10:10:58,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 1: [2023-05-10 10:10:58,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,487] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:58,490] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,490] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:58,490] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 20: [2023-05-10 10:10:58,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 20: [2023-05-10 10:10:58,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 9: [2023-05-10 10:10:58,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 12: [2023-05-10 10:10:58,492] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,492] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 24: [2023-05-10 10:10:58,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 24: [2023-05-10 10:10:58,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 0: [2023-05-10 10:10:58,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 0: [2023-05-10 10:10:58,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 0: [2023-05-10 10:10:58,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:58,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 21: [2023-05-10 10:10:58,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:58,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 0: [2023-05-10 10:10:58,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 0: [2023-05-10 10:10:58,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 0: [2023-05-10 10:10:58,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 0: [2023-05-10 10:10:58,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:58,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:58,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 2: [2023-05-10 10:10:58,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 2: [2023-05-10 10:10:58,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:58,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 12: [2023-05-10 10:10:58,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 12: [2023-05-10 10:10:58,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:58,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 0: [2023-05-10 10:10:58,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:58,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:58,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 0: [2023-05-10 10:10:58,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 6: [2023-05-10 10:10:58,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:58,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:58,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 6: [2023-05-10 10:10:58,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 22: [2023-05-10 10:10:58,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 22: [2023-05-10 10:10:58,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 22: [2023-05-10 10:10:58,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 22: [2023-05-10 10:10:58,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 22: [2023-05-10 10:10:58,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:58,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 22: [2023-05-10 10:10:58,500] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:58,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 16: [2023-05-10 10:10:58,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 16: [2023-05-10 10:10:58,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 16: [2023-05-10 10:10:58,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:58,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 26: [2023-05-10 10:10:58,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 26: [2023-05-10 10:10:58,501] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,502] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,502] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,503] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 21: [2023-05-10 10:10:58,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 16: [2023-05-10 10:10:58,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 16: [2023-05-10 10:10:58,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 22: [2023-05-10 10:10:58,505] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 2: [2023-05-10 10:10:58,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 16: [2023-05-10 10:10:58,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 6: [2023-05-10 10:10:58,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 21: [2023-05-10 10:10:58,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 21: [2023-05-10 10:10:58,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 21: [2023-05-10 10:10:58,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:58,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 2: [2023-05-10 10:10:58,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 19: [2023-05-10 10:10:58,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 1: [2023-05-10 10:10:58,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 19: [2023-05-10 10:10:58,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 19: [2023-05-10 10:10:58,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 1: [2023-05-10 10:10:58,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 12: [2023-05-10 10:10:58,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 12: [2023-05-10 10:10:58,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 12: [2023-05-10 10:10:58,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 22: [2023-05-10 10:10:58,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 12: [2023-05-10 10:10:58,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 22: [2023-05-10 10:10:58,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:58,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 23: [2023-05-10 10:10:58,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 23: [2023-05-10 10:10:58,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 20: [2023-05-10 10:10:58,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 20: [2023-05-10 10:10:58,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 20: [2023-05-10 10:10:58,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 6: [2023-05-10 10:10:58,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:58,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 23: [2023-05-10 10:10:58,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 23: [2023-05-10 10:10:58,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 23: [2023-05-10 10:10:58,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 23: [2023-05-10 10:10:58,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:58,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:58,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:58,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:58,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:58,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 23: [2023-05-10 10:10:58,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 23: [2023-05-10 10:10:58,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 23: [2023-05-10 10:10:58,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 28: [2023-05-10 10:10:58,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 28: [2023-05-10 10:10:58,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:58,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 21: [2023-05-10 10:10:58,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 2: [2023-05-10 10:10:58,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 29: [2023-05-10 10:10:58,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:58,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 21: [2023-05-10 10:10:58,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:58,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 23: [2023-05-10 10:10:58,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 23: [2023-05-10 10:10:58,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 21: [2023-05-10 10:10:58,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:58,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 20: [2023-05-10 10:10:58,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:58,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 2: [2023-05-10 10:10:58,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 19: [2023-05-10 10:10:58,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 26: [2023-05-10 10:10:58,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 13: [2023-05-10 10:10:58,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 3: [2023-05-10 10:10:58,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 3: [2023-05-10 10:10:58,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 3: [2023-05-10 10:10:58,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 19: [2023-05-10 10:10:58,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 12: [2023-05-10 10:10:58,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 12: [2023-05-10 10:10:58,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:58,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 21: [2023-05-10 10:10:58,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:58,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 26: [2023-05-10 10:10:58,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 19: [2023-05-10 10:10:58,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 19: [2023-05-10 10:10:58,535] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 1: [2023-05-10 10:10:58,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 21: [2023-05-10 10:10:58,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 2: [2023-05-10 10:10:58,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:58,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 26: [2023-05-10 10:10:58,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 26: [2023-05-10 10:10:58,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 21: [2023-05-10 10:10:58,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 21: [2023-05-10 10:10:58,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 29: [2023-05-10 10:10:58,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 30: [2023-05-10 10:10:58,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 30: [2023-05-10 10:10:58,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 30: [2023-05-10 10:10:58,539] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 6: [2023-05-10 10:10:58,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:58,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 10: [2023-05-10 10:10:58,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 10: [2023-05-10 10:10:58,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 30: [2023-05-10 10:10:58,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 10: [2023-05-10 10:10:58,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 10: [2023-05-10 10:10:58,542] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 10: [2023-05-10 10:10:58,542] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 10: [2023-05-10 10:10:58,542] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 10: [2023-05-10 10:10:58,542] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 10: [2023-05-10 10:10:58,542] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 3: [2023-05-10 10:10:58,542] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 20: [2023-05-10 10:10:58,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 12: [2023-05-10 10:10:58,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 12: [2023-05-10 10:10:58,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 13: [2023-05-10 10:10:58,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 12: [2023-05-10 10:10:58,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 6: [2023-05-10 10:10:58,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 10: [2023-05-10 10:10:58,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 10: [2023-05-10 10:10:58,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 10: [2023-05-10 10:10:58,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 10: [2023-05-10 10:10:58,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 10: [2023-05-10 10:10:58,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 13: [2023-05-10 10:10:58,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 12: [2023-05-10 10:10:58,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 6: [2023-05-10 10:10:58,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 10: [2023-05-10 10:10:58,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 3: [2023-05-10 10:10:58,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 3: [2023-05-10 10:10:58,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 3: [2023-05-10 10:10:58,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 10: [2023-05-10 10:10:58,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 0: [2023-05-10 10:10:58,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 0: [2023-05-10 10:10:58,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 3: [2023-05-10 10:10:58,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt... 28: [2023-05-10 10:10:58,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 3: [2023-05-10 10:10:58,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 3: [2023-05-10 10:10:58,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 28: [2023-05-10 10:10:58,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,559] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 0: [2023-05-10 10:10:58,561] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 0: [2023-05-10 10:10:58,561] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 30: [2023-05-10 10:10:58,561] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 19: [2023-05-10 10:10:58,562] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 0: [2023-05-10 10:10:58,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 30: [2023-05-10 10:10:58,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 13: [2023-05-10 10:10:58,568] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,569] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 30: [2023-05-10 10:10:58,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 30: [2023-05-10 10:10:58,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 29: [2023-05-10 10:10:58,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 29: [2023-05-10 10:10:58,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:58,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:58,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 23: [2023-05-10 10:10:58,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 23: [2023-05-10 10:10:58,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 3: [2023-05-10 10:10:58,573] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 30: [2023-05-10 10:10:58,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 3: [2023-05-10 10:10:58,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:58,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 30: [2023-05-10 10:10:58,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 30: [2023-05-10 10:10:58,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 30: [2023-05-10 10:10:58,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 26: [2023-05-10 10:10:58,577] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 0: [2023-05-10 10:10:58,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 0: [2023-05-10 10:10:58,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 0: [2023-05-10 10:10:58,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 0: [2023-05-10 10:10:58,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 3: [2023-05-10 10:10:58,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 3: [2023-05-10 10:10:58,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 3: [2023-05-10 10:10:58,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 22: [2023-05-10 10:10:58,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 22: [2023-05-10 10:10:58,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 22: [2023-05-10 10:10:58,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 22: [2023-05-10 10:10:58,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:58,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 0: [2023-05-10 10:10:58,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:58,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 23: [2023-05-10 10:10:58,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 23: [2023-05-10 10:10:58,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 23: [2023-05-10 10:10:58,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 23: [2023-05-10 10:10:58,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 0: [2023-05-10 10:10:58,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 30: [2023-05-10 10:10:58,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 30: [2023-05-10 10:10:58,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:58,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:58,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 30: [2023-05-10 10:10:58,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 10: [2023-05-10 10:10:58,605] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 10: [2023-05-10 10:10:58,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 0: [2023-05-10 10:10:58,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 0: [2023-05-10 10:10:58,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 0: [2023-05-10 10:10:58,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,609] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 17: [2023-05-10 10:10:58,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 17: [2023-05-10 10:10:58,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 17: [2023-05-10 10:10:58,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 10: [2023-05-10 10:10:58,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 10: [2023-05-10 10:10:58,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_26-model_00-model_states.pt. 22: [2023-05-10 10:10:58,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 22: [2023-05-10 10:10:58,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:58,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 22: [2023-05-10 10:10:58,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 10: [2023-05-10 10:10:58,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 10: [2023-05-10 10:10:58,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 17: [2023-05-10 10:10:58,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 17: [2023-05-10 10:10:58,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 14: [2023-05-10 10:10:58,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,726] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 14: [2023-05-10 10:10:58,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,727] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:58,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 17: [2023-05-10 10:10:58,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 14: [2023-05-10 10:10:58,730] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 17: [2023-05-10 10:10:58,732] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 17: [2023-05-10 10:10:58,732] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 14: [2023-05-10 10:10:58,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,736] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 7: [2023-05-10 10:10:58,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 7: [2023-05-10 10:10:58,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 7: [2023-05-10 10:10:58,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 7: [2023-05-10 10:10:58,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 7: [2023-05-10 10:10:58,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 7: [2023-05-10 10:10:58,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 7: [2023-05-10 10:10:58,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,756] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,756] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,758] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,758] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,758] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,758] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,758] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 7: [2023-05-10 10:10:58,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,763] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,763] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 18: [2023-05-10 10:10:58,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 18: [2023-05-10 10:10:58,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 18: [2023-05-10 10:10:58,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 18: [2023-05-10 10:10:58,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 18: [2023-05-10 10:10:58,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 18: [2023-05-10 10:10:58,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,782] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,798] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,804] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 7: [2023-05-10 10:10:58,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:58,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:58,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 14: [2023-05-10 10:10:58,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 14: [2023-05-10 10:10:58,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 11: [2023-05-10 10:10:58,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,818] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 24: [2023-05-10 10:10:58,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 24: [2023-05-10 10:10:58,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 24: [2023-05-10 10:10:58,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 24: [2023-05-10 10:10:58,822] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 24: [2023-05-10 10:10:58,822] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,821] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,822] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 7: [2023-05-10 10:10:58,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 7: [2023-05-10 10:10:58,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 7: [2023-05-10 10:10:58,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 18: [2023-05-10 10:10:58,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 8: [2023-05-10 10:10:58,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 8: [2023-05-10 10:10:58,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 8: [2023-05-10 10:10:58,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 8: [2023-05-10 10:10:58,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 8: [2023-05-10 10:10:58,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 8: [2023-05-10 10:10:58,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 9: [2023-05-10 10:10:58,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 7: [2023-05-10 10:10:58,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 7: [2023-05-10 10:10:58,828] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 18: [2023-05-10 10:10:58,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 18: [2023-05-10 10:10:58,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:58,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:58,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,834] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:58,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,835] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:58,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 18: [2023-05-10 10:10:58,837] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 18: [2023-05-10 10:10:58,837] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 18: [2023-05-10 10:10:58,837] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 14: [2023-05-10 10:10:58,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:58,837] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,838] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 18: [2023-05-10 10:10:58,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:58,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:58,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:58,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:58,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 25: [2023-05-10 10:10:58,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:58,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:58,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,851] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 7: [2023-05-10 10:10:58,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:58,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:58,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:58,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:58,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 4: [2023-05-10 10:10:58,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 9: [2023-05-10 10:10:58,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:58,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 25: [2023-05-10 10:10:58,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 9: [2023-05-10 10:10:58,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 9: [2023-05-10 10:10:58,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:58,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:58,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 25: [2023-05-10 10:10:58,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 28: [2023-05-10 10:10:58,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 7: [2023-05-10 10:10:58,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:58,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 6: [2023-05-10 10:10:58,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 6: [2023-05-10 10:10:58,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 6: [2023-05-10 10:10:58,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 6: [2023-05-10 10:10:58,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 6: [2023-05-10 10:10:58,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 6: [2023-05-10 10:10:58,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 6: [2023-05-10 10:10:58,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 6: [2023-05-10 10:10:58,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:58,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 18: [2023-05-10 10:10:58,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:58,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:58,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 28: [2023-05-10 10:10:58,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 28: [2023-05-10 10:10:58,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 28: [2023-05-10 10:10:58,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 28: [2023-05-10 10:10:58,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 28: [2023-05-10 10:10:58,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 28: [2023-05-10 10:10:58,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 28: [2023-05-10 10:10:58,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 27: [2023-05-10 10:10:58,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 11: [2023-05-10 10:10:58,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 6: [2023-05-10 10:10:58,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 6: [2023-05-10 10:10:58,866] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:58,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 27: [2023-05-10 10:10:58,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:58,869] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:58,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:58,870] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:58,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,870] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 6: [2023-05-10 10:10:58,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 6: [2023-05-10 10:10:58,871] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 6: [2023-05-10 10:10:58,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 28: [2023-05-10 10:10:58,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 6: [2023-05-10 10:10:58,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 6: [2023-05-10 10:10:58,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 28: [2023-05-10 10:10:58,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:58,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 5: [2023-05-10 10:10:58,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:58,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:58,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 1: [2023-05-10 10:10:58,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 1: [2023-05-10 10:10:58,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 27: [2023-05-10 10:10:58,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:58,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 5: [2023-05-10 10:10:58,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:58,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 1: [2023-05-10 10:10:58,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 4: [2023-05-10 10:10:58,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 27: [2023-05-10 10:10:58,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 27: [2023-05-10 10:10:58,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 11: [2023-05-10 10:10:58,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:58,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 27: [2023-05-10 10:10:58,889] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:58,889] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,890] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,890] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 4: [2023-05-10 10:10:58,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 4: [2023-05-10 10:10:58,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:58,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 8: [2023-05-10 10:10:58,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:58,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 4: [2023-05-10 10:10:58,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:58,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 24: [2023-05-10 10:10:58,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 11: [2023-05-10 10:10:58,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,901] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:58,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 8: [2023-05-10 10:10:58,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 8: [2023-05-10 10:10:58,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 8: [2023-05-10 10:10:58,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,913] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:58,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:58,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:58,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:58,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:58,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 28: [2023-05-10 10:10:58,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 28: [2023-05-10 10:10:58,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 6: [2023-05-10 10:10:58,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 6: [2023-05-10 10:10:58,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:58,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:58,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:58,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:58,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 19: [2023-05-10 10:10:58,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 19: [2023-05-10 10:10:58,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 8: [2023-05-10 10:10:58,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:58,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 26: [2023-05-10 10:10:58,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 26: [2023-05-10 10:10:58,922] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:58,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,923] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 5: [2023-05-10 10:10:58,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:58,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 11: [2023-05-10 10:10:58,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:58,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 29: [2023-05-10 10:10:58,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:58,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 3: [2023-05-10 10:10:58,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:58,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 29: [2023-05-10 10:10:58,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:58,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 29: [2023-05-10 10:10:58,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 29: [2023-05-10 10:10:58,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:58,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 5: [2023-05-10 10:10:58,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 29: [2023-05-10 10:10:58,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 25: [2023-05-10 10:10:58,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 5: [2023-05-10 10:10:58,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:58,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 3: [2023-05-10 10:10:58,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 3: [2023-05-10 10:10:58,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 3: [2023-05-10 10:10:58,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 3: [2023-05-10 10:10:58,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 3: [2023-05-10 10:10:58,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 3: [2023-05-10 10:10:58,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 6: [2023-05-10 10:10:58,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:58,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 30: [2023-05-10 10:10:58,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 30: [2023-05-10 10:10:58,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 30: [2023-05-10 10:10:58,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 30: [2023-05-10 10:10:58,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 30: [2023-05-10 10:10:58,931] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 30: [2023-05-10 10:10:58,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:58,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:58,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 3: [2023-05-10 10:10:58,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 3: [2023-05-10 10:10:58,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:58,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 28: [2023-05-10 10:10:58,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 28: [2023-05-10 10:10:58,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 8: [2023-05-10 10:10:58,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:58,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:58,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:58,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:58,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 13: [2023-05-10 10:10:58,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 13: [2023-05-10 10:10:58,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 13: [2023-05-10 10:10:58,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 13: [2023-05-10 10:10:58,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 13: [2023-05-10 10:10:58,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 13: [2023-05-10 10:10:58,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 13: [2023-05-10 10:10:58,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 13: [2023-05-10 10:10:58,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 29: [2023-05-10 10:10:58,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 11: [2023-05-10 10:10:58,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 28: [2023-05-10 10:10:58,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 28: [2023-05-10 10:10:58,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:58,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:58,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:58,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 24: [2023-05-10 10:10:58,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:58,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:58,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:58,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:58,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:58,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:58,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 1: [2023-05-10 10:10:58,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 11: [2023-05-10 10:10:58,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:58,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:58,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 29: [2023-05-10 10:10:58,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:58,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 11: [2023-05-10 10:10:58,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 11: [2023-05-10 10:10:58,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 6: [2023-05-10 10:10:58,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 25: [2023-05-10 10:10:58,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:58,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:58,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:58,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:58,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:58,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:58,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:58,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:58,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:58,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 28: [2023-05-10 10:10:58,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 28: [2023-05-10 10:10:58,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 30: [2023-05-10 10:10:58,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 30: [2023-05-10 10:10:58,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 30: [2023-05-10 10:10:58,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 30: [2023-05-10 10:10:58,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 30: [2023-05-10 10:10:58,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 10: [2023-05-10 10:10:58,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:58,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 10: [2023-05-10 10:10:58,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 10: [2023-05-10 10:10:58,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 10: [2023-05-10 10:10:58,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 10: [2023-05-10 10:10:58,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 10: [2023-05-10 10:10:58,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 25: [2023-05-10 10:10:58,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:58,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 0: [2023-05-10 10:10:58,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 25: [2023-05-10 10:10:58,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:58,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 0: [2023-05-10 10:10:58,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 6: [2023-05-10 10:10:58,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 6: [2023-05-10 10:10:58,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 6: [2023-05-10 10:10:58,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 6: [2023-05-10 10:10:58,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:58,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:58,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:58,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:58,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 20: [2023-05-10 10:10:58,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:58,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 1: [2023-05-10 10:10:58,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 13: [2023-05-10 10:10:58,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 0: [2023-05-10 10:10:58,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 0: [2023-05-10 10:10:58,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 20: [2023-05-10 10:10:58,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:58,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 1: [2023-05-10 10:10:58,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:58,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 1: [2023-05-10 10:10:58,952] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 13: [2023-05-10 10:10:58,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 10: [2023-05-10 10:10:58,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 13: [2023-05-10 10:10:58,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,955] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 10: [2023-05-10 10:10:58,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 31: [2023-05-10 10:10:58,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:58,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 28: [2023-05-10 10:10:58,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 31: [2023-05-10 10:10:58,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 6: [2023-05-10 10:10:58,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:58,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 31: [2023-05-10 10:10:58,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 31: [2023-05-10 10:10:58,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 31: [2023-05-10 10:10:58,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 31: [2023-05-10 10:10:58,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 26: [2023-05-10 10:10:58,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 31: [2023-05-10 10:10:58,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 26: [2023-05-10 10:10:58,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:58,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:58,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 26: [2023-05-10 10:10:58,958] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:58,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 15: [2023-05-10 10:10:58,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 15: [2023-05-10 10:10:58,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 15: [2023-05-10 10:10:58,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 15: [2023-05-10 10:10:58,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 15: [2023-05-10 10:10:58,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 15: [2023-05-10 10:10:58,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:58,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 31: [2023-05-10 10:10:58,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 31: [2023-05-10 10:10:58,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:58,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:58,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:58,962] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:58,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 28: [2023-05-10 10:10:58,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:58,964] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:58,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 31: [2023-05-10 10:10:58,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:58,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 31: [2023-05-10 10:10:58,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:58,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 31: [2023-05-10 10:10:58,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 19: [2023-05-10 10:10:58,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 31: [2023-05-10 10:10:58,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 31: [2023-05-10 10:10:58,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 31: [2023-05-10 10:10:58,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 15: [2023-05-10 10:10:58,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:58,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 22: [2023-05-10 10:10:58,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:58,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 15: [2023-05-10 10:10:58,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 22: [2023-05-10 10:10:58,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:58,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:58,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:58,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:58,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:58,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 15: [2023-05-10 10:10:58,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:58,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:58,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:58,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 6: [2023-05-10 10:10:58,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:58,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,975] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 22: [2023-05-10 10:10:58,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 28: [2023-05-10 10:10:58,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:58,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:58,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:58,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 28: [2023-05-10 10:10:58,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:58,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:58,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:58,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:58,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:58,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 22: [2023-05-10 10:10:58,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:58,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:58,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:58,979] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 22: [2023-05-10 10:10:58,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:58,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 1: [2023-05-10 10:10:58,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 22: [2023-05-10 10:10:58,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 6: [2023-05-10 10:10:58,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:58,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:58,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 2: [2023-05-10 10:10:58,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 22: [2023-05-10 10:10:58,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:58,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 22: [2023-05-10 10:10:58,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 6: [2023-05-10 10:10:58,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 6: [2023-05-10 10:10:58,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 1: [2023-05-10 10:10:58,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 6: [2023-05-10 10:10:58,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:58,984] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:58,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:58,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,984] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:58,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:58,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:58,988] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:58,988] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 2: [2023-05-10 10:10:58,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 2: [2023-05-10 10:10:58,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 2: [2023-05-10 10:10:58,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 23: [2023-05-10 10:10:58,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 16: [2023-05-10 10:10:58,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 16: [2023-05-10 10:10:58,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:58,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 16: [2023-05-10 10:10:58,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 16: [2023-05-10 10:10:58,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 16: [2023-05-10 10:10:58,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 16: [2023-05-10 10:10:58,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 16: [2023-05-10 10:10:58,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 21: [2023-05-10 10:10:58,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 21: [2023-05-10 10:10:58,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 2: [2023-05-10 10:10:58,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 2: [2023-05-10 10:10:58,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt... 16: [2023-05-10 10:10:58,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:58,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 21: [2023-05-10 10:10:58,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:58,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:58,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:58,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:58,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 21: [2023-05-10 10:10:58,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 21: [2023-05-10 10:10:58,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 21: [2023-05-10 10:10:58,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 21: [2023-05-10 10:10:58,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 21: [2023-05-10 10:10:58,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:58,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 16: [2023-05-10 10:10:58,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 16: [2023-05-10 10:10:58,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 17: [2023-05-10 10:10:58,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:58,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:58,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:58,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:58,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:58,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:58,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:58,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:58,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:58,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 21: [2023-05-10 10:10:58,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 16: [2023-05-10 10:10:58,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 21: [2023-05-10 10:10:58,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 16: [2023-05-10 10:10:58,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 16: [2023-05-10 10:10:58,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:59,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 16: [2023-05-10 10:10:59,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:59,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 17: [2023-05-10 10:10:59,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 21: [2023-05-10 10:10:59,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 21: [2023-05-10 10:10:59,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 21: [2023-05-10 10:10:59,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 16: [2023-05-10 10:10:59,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 21: [2023-05-10 10:10:59,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:59,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 3: [2023-05-10 10:10:59,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:59,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 16: [2023-05-10 10:10:59,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 29: [2023-05-10 10:10:59,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 21: [2023-05-10 10:10:59,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 21: [2023-05-10 10:10:59,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:59,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:59,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 29: [2023-05-10 10:10:59,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 29: [2023-05-10 10:10:59,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 3: [2023-05-10 10:10:59,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 3: [2023-05-10 10:10:59,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:59,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 10: [2023-05-10 10:10:59,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 10: [2023-05-10 10:10:59,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 10: [2023-05-10 10:10:59,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:59,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 17: [2023-05-10 10:10:59,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 17: [2023-05-10 10:10:59,010] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:59,010] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:59,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:59,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:59,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 17: [2023-05-10 10:10:59,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 31: [2023-05-10 10:10:59,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 17: [2023-05-10 10:10:59,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 31: [2023-05-10 10:10:59,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:59,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:59,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 29: [2023-05-10 10:10:59,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 29: [2023-05-10 10:10:59,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 29: [2023-05-10 10:10:59,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 29: [2023-05-10 10:10:59,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 22: [2023-05-10 10:10:59,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:59,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 3: [2023-05-10 10:10:59,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 12: [2023-05-10 10:10:59,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 12: [2023-05-10 10:10:59,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 12: [2023-05-10 10:10:59,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 12: [2023-05-10 10:10:59,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 12: [2023-05-10 10:10:59,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 12: [2023-05-10 10:10:59,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 12: [2023-05-10 10:10:59,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 12: [2023-05-10 10:10:59,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 10: [2023-05-10 10:10:59,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 10: [2023-05-10 10:10:59,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 10: [2023-05-10 10:10:59,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:59,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 31: [2023-05-10 10:10:59,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:59,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:59,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 12: [2023-05-10 10:10:59,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:59,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:59,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:59,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 23: [2023-05-10 10:10:59,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 13: [2023-05-10 10:10:59,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 3: [2023-05-10 10:10:59,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 13: [2023-05-10 10:10:59,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 30: [2023-05-10 10:10:59,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 29: [2023-05-10 10:10:59,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 13: [2023-05-10 10:10:59,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 0: [2023-05-10 10:10:59,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 12: [2023-05-10 10:10:59,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 12: [2023-05-10 10:10:59,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 12: [2023-05-10 10:10:59,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 13: [2023-05-10 10:10:59,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:59,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:59,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 12: [2023-05-10 10:10:59,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 22: [2023-05-10 10:10:59,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 12: [2023-05-10 10:10:59,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 29: [2023-05-10 10:10:59,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 12: [2023-05-10 10:10:59,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 12: [2023-05-10 10:10:59,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 10: [2023-05-10 10:10:59,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:59,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 30: [2023-05-10 10:10:59,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 30: [2023-05-10 10:10:59,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 30: [2023-05-10 10:10:59,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 30: [2023-05-10 10:10:59,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 10: [2023-05-10 10:10:59,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:59,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:59,037] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 31: [2023-05-10 10:10:59,038] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 10: [2023-05-10 10:10:59,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:59,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:59,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 22: [2023-05-10 10:10:59,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 15: [2023-05-10 10:10:59,042] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 15: [2023-05-10 10:10:59,042] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 31: [2023-05-10 10:10:59,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 31: [2023-05-10 10:10:59,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 31: [2023-05-10 10:10:59,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 31: [2023-05-10 10:10:59,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 31: [2023-05-10 10:10:59,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:59,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 15: [2023-05-10 10:10:59,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 15: [2023-05-10 10:10:59,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 29: [2023-05-10 10:10:59,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 13: [2023-05-10 10:10:59,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 13: [2023-05-10 10:10:59,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 13: [2023-05-10 10:10:59,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 13: [2023-05-10 10:10:59,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 15: [2023-05-10 10:10:59,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 29: [2023-05-10 10:10:59,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 16: [2023-05-10 10:10:59,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 16: [2023-05-10 10:10:59,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 10: [2023-05-10 10:10:59,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 10: [2023-05-10 10:10:59,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:59,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:59,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 22: [2023-05-10 10:10:59,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 21: [2023-05-10 10:10:59,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 21: [2023-05-10 10:10:59,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:59,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:59,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:59,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:59,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:59,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 13: [2023-05-10 10:10:59,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:59,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:59,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 22: [2023-05-10 10:10:59,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:59,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:59,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 2: [2023-05-10 10:10:59,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_27-model_00-model_states.pt. 21: [2023-05-10 10:10:59,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 21: [2023-05-10 10:10:59,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 15: [2023-05-10 10:10:59,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 21: [2023-05-10 10:10:59,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 21: [2023-05-10 10:10:59,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 21: [2023-05-10 10:10:59,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 21: [2023-05-10 10:10:59,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 15: [2023-05-10 10:10:59,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:59,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 13: [2023-05-10 10:10:59,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 16: [2023-05-10 10:10:59,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 31: [2023-05-10 10:10:59,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:59,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 16: [2023-05-10 10:10:59,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 31: [2023-05-10 10:10:59,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 15: [2023-05-10 10:10:59,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,074] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 16: [2023-05-10 10:10:59,074] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 16: [2023-05-10 10:10:59,074] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 16: [2023-05-10 10:10:59,074] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 16: [2023-05-10 10:10:59,074] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 31: [2023-05-10 10:10:59,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,077] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 15: [2023-05-10 10:10:59,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 12: [2023-05-10 10:10:59,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,077] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:59,077] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:59,077] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 21: [2023-05-10 10:10:59,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 13: [2023-05-10 10:10:59,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 21: [2023-05-10 10:10:59,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 15: [2023-05-10 10:10:59,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 13: [2023-05-10 10:10:59,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 13: [2023-05-10 10:10:59,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:59,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,081] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 22: [2023-05-10 10:10:59,084] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:59,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:59,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:59,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 22: [2023-05-10 10:10:59,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:59,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 14: [2023-05-10 10:10:59,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:59,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 14: [2023-05-10 10:10:59,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 14: [2023-05-10 10:10:59,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 14: [2023-05-10 10:10:59,093] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 22: [2023-05-10 10:10:59,093] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:59,093] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 14: [2023-05-10 10:10:59,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 21: [2023-05-10 10:10:59,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:59,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:59,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 15: [2023-05-10 10:10:59,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 15: [2023-05-10 10:10:59,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:59,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 21: [2023-05-10 10:10:59,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 12: [2023-05-10 10:10:59,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 21: [2023-05-10 10:10:59,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 7: [2023-05-10 10:10:59,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 21: [2023-05-10 10:10:59,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 7: [2023-05-10 10:10:59,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 7: [2023-05-10 10:10:59,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 7: [2023-05-10 10:10:59,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 7: [2023-05-10 10:10:59,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 7: [2023-05-10 10:10:59,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 21: [2023-05-10 10:10:59,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 21: [2023-05-10 10:10:59,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 12: [2023-05-10 10:10:59,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 7: [2023-05-10 10:10:59,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:59,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:59,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:59,107] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 17: [2023-05-10 10:10:59,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:59,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 7: [2023-05-10 10:10:59,107] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:59,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:59,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 17: [2023-05-10 10:10:59,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 12: [2023-05-10 10:10:59,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 16: [2023-05-10 10:10:59,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 12: [2023-05-10 10:10:59,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 12: [2023-05-10 10:10:59,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 12: [2023-05-10 10:10:59,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 12: [2023-05-10 10:10:59,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 12: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 27: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 27: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 27: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 27: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 27: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 27: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 27: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 27: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:59,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,118] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:59,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:59,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 17: [2023-05-10 10:10:59,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:59,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:59,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 27: [2023-05-10 10:10:59,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 27: [2023-05-10 10:10:59,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:59,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 27: [2023-05-10 10:10:59,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:59,123] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 27: [2023-05-10 10:10:59,123] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:59,123] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 27: [2023-05-10 10:10:59,124] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 27: [2023-05-10 10:10:59,124] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:59,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 27: [2023-05-10 10:10:59,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 12: [2023-05-10 10:10:59,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 12: [2023-05-10 10:10:59,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 12: [2023-05-10 10:10:59,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,148] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,148] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,149] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 12: [2023-05-10 10:10:59,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 12: [2023-05-10 10:10:59,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 12: [2023-05-10 10:10:59,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 12: [2023-05-10 10:10:59,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 5: [2023-05-10 10:10:59,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 5: [2023-05-10 10:10:59,156] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:59,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,159] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 14: [2023-05-10 10:10:59,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,160] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 5: [2023-05-10 10:10:59,160] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:59,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:59,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:59,162] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,162] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,162] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,162] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,162] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,162] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:59,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:59,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 7: [2023-05-10 10:10:59,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 7: [2023-05-10 10:10:59,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,169] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:59,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:59,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:59,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 27: [2023-05-10 10:10:59,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:59,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 7: [2023-05-10 10:10:59,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 7: [2023-05-10 10:10:59,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 7: [2023-05-10 10:10:59,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,178] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 27: [2023-05-10 10:10:59,178] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 7: [2023-05-10 10:10:59,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 7: [2023-05-10 10:10:59,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 27: [2023-05-10 10:10:59,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 27: [2023-05-10 10:10:59,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 27: [2023-05-10 10:10:59,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 27: [2023-05-10 10:10:59,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 27: [2023-05-10 10:10:59,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 14: [2023-05-10 10:10:59,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,209] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,211] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,211] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 28: [2023-05-10 10:10:59,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,215] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 18: [2023-05-10 10:10:59,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 28: [2023-05-10 10:10:59,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:59,220] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 18: [2023-05-10 10:10:59,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,225] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 5: [2023-05-10 10:10:59,227] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,227] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,227] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,227] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 28: [2023-05-10 10:10:59,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 28: [2023-05-10 10:10:59,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 28: [2023-05-10 10:10:59,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 28: [2023-05-10 10:10:59,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 24: [2023-05-10 10:10:59,235] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 5: [2023-05-10 10:10:59,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 23: [2023-05-10 10:10:59,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 8: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 8: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 8: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 8: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 8: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 8: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:59,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 25: [2023-05-10 10:10:59,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 25: [2023-05-10 10:10:59,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 25: [2023-05-10 10:10:59,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 25: [2023-05-10 10:10:59,244] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 25: [2023-05-10 10:10:59,245] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 25: [2023-05-10 10:10:59,245] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:59,245] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:59,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:59,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:59,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:59,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:59,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:59,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:59,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 19: [2023-05-10 10:10:59,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:59,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:59,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:59,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:59,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:59,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:59,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:59,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:59,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:59,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:59,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:59,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:59,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:59,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:59,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:59,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:59,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:59,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:59,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:59,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:59,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:59,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:59,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:59,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 1: [2023-05-10 10:10:59,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 5: [2023-05-10 10:10:59,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 5: [2023-05-10 10:10:59,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 28: [2023-05-10 10:10:59,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 4: [2023-05-10 10:10:59,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 4: [2023-05-10 10:10:59,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 4: [2023-05-10 10:10:59,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 4: [2023-05-10 10:10:59,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 4: [2023-05-10 10:10:59,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 4: [2023-05-10 10:10:59,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 4: [2023-05-10 10:10:59,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 24: [2023-05-10 10:10:59,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 11: [2023-05-10 10:10:59,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 11: [2023-05-10 10:10:59,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 11: [2023-05-10 10:10:59,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 11: [2023-05-10 10:10:59,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 11: [2023-05-10 10:10:59,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 11: [2023-05-10 10:10:59,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 11: [2023-05-10 10:10:59,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 11: [2023-05-10 10:10:59,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 4: [2023-05-10 10:10:59,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 4: [2023-05-10 10:10:59,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 4: [2023-05-10 10:10:59,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 4: [2023-05-10 10:10:59,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 4: [2023-05-10 10:10:59,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 4: [2023-05-10 10:10:59,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 4: [2023-05-10 10:10:59,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 4: [2023-05-10 10:10:59,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 11: [2023-05-10 10:10:59,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 11: [2023-05-10 10:10:59,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:59,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,292] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 2: [2023-05-10 10:10:59,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 11: [2023-05-10 10:10:59,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:59,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 11: [2023-05-10 10:10:59,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 11: [2023-05-10 10:10:59,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 11: [2023-05-10 10:10:59,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:59,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 2: [2023-05-10 10:10:59,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 2: [2023-05-10 10:10:59,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 2: [2023-05-10 10:10:59,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 2: [2023-05-10 10:10:59,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 11: [2023-05-10 10:10:59,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:59,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:59,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 11: [2023-05-10 10:10:59,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:59,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:59,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:59,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:59,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 30: [2023-05-10 10:10:59,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:59,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 25: [2023-05-10 10:10:59,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:59,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:59,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:59,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:59,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:59,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:59,303] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:59,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:59,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:59,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 20: [2023-05-10 10:10:59,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:59,306] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 28: [2023-05-10 10:10:59,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 28: [2023-05-10 10:10:59,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:59,309] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:59,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:59,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:59,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 25: [2023-05-10 10:10:59,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 25: [2023-05-10 10:10:59,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,311] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 23: [2023-05-10 10:10:59,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 9: [2023-05-10 10:10:59,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 9: [2023-05-10 10:10:59,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 8: [2023-05-10 10:10:59,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 8: [2023-05-10 10:10:59,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 8: [2023-05-10 10:10:59,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 8: [2023-05-10 10:10:59,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 8: [2023-05-10 10:10:59,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 25: [2023-05-10 10:10:59,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 25: [2023-05-10 10:10:59,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 23: [2023-05-10 10:10:59,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 9: [2023-05-10 10:10:59,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 9: [2023-05-10 10:10:59,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 9: [2023-05-10 10:10:59,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 9: [2023-05-10 10:10:59,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 9: [2023-05-10 10:10:59,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 19: [2023-05-10 10:10:59,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 9: [2023-05-10 10:10:59,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 9: [2023-05-10 10:10:59,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:59,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 29: [2023-05-10 10:10:59,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 29: [2023-05-10 10:10:59,325] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 29: [2023-05-10 10:10:59,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 29: [2023-05-10 10:10:59,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 29: [2023-05-10 10:10:59,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 29: [2023-05-10 10:10:59,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 19: [2023-05-10 10:10:59,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 19: [2023-05-10 10:10:59,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 9: [2023-05-10 10:10:59,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 9: [2023-05-10 10:10:59,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 9: [2023-05-10 10:10:59,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:59,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 1: [2023-05-10 10:10:59,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,329] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 9: [2023-05-10 10:10:59,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 9: [2023-05-10 10:10:59,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 25: [2023-05-10 10:10:59,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 28: [2023-05-10 10:10:59,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 1: [2023-05-10 10:10:59,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 23: [2023-05-10 10:10:59,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 23: [2023-05-10 10:10:59,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 28: [2023-05-10 10:10:59,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 29: [2023-05-10 10:10:59,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 13: [2023-05-10 10:10:59,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 13: [2023-05-10 10:10:59,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 13: [2023-05-10 10:10:59,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 13: [2023-05-10 10:10:59,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 28: [2023-05-10 10:10:59,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 13: [2023-05-10 10:10:59,333] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 13: [2023-05-10 10:10:59,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 13: [2023-05-10 10:10:59,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 13: [2023-05-10 10:10:59,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 29: [2023-05-10 10:10:59,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 28: [2023-05-10 10:10:59,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 1: [2023-05-10 10:10:59,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:59,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 26: [2023-05-10 10:10:59,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 29: [2023-05-10 10:10:59,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 26: [2023-05-10 10:10:59,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 26: [2023-05-10 10:10:59,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 26: [2023-05-10 10:10:59,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 29: [2023-05-10 10:10:59,335] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 8: [2023-05-10 10:10:59,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 0: [2023-05-10 10:10:59,337] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,337] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,337] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,297] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 2: [2023-05-10 10:10:59,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:59,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:59,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:59,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:59,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:59,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:59,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 2: [2023-05-10 10:10:59,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 28: [2023-05-10 10:10:59,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 25: [2023-05-10 10:10:59,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 8: [2023-05-10 10:10:59,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 25: [2023-05-10 10:10:59,339] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 11: [2023-05-10 10:10:59,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 11: [2023-05-10 10:10:59,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 1: [2023-05-10 10:10:59,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 4: [2023-05-10 10:10:59,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 4: [2023-05-10 10:10:59,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 25: [2023-05-10 10:10:59,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 8: [2023-05-10 10:10:59,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 8: [2023-05-10 10:10:59,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 23: [2023-05-10 10:10:59,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 8: [2023-05-10 10:10:59,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 23: [2023-05-10 10:10:59,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:59,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:59,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:59,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 26: [2023-05-10 10:10:59,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 8: [2023-05-10 10:10:59,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 25: [2023-05-10 10:10:59,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 23: [2023-05-10 10:10:59,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 4: [2023-05-10 10:10:59,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 4: [2023-05-10 10:10:59,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 8: [2023-05-10 10:10:59,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 23: [2023-05-10 10:10:59,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,348] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 25: [2023-05-10 10:10:59,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 25: [2023-05-10 10:10:59,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 23: [2023-05-10 10:10:59,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,351] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 1: [2023-05-10 10:10:59,352] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,352] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 1: [2023-05-10 10:10:59,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 1: [2023-05-10 10:10:59,357] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 10: [2023-05-10 10:10:59,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 10: [2023-05-10 10:10:59,359] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 10: [2023-05-10 10:10:59,359] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 3: [2023-05-10 10:10:59,360] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,360] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 10: [2023-05-10 10:10:59,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 10: [2023-05-10 10:10:59,361] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 11: [2023-05-10 10:10:59,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:59,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 22: [2023-05-10 10:10:59,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt... 11: [2023-05-10 10:10:59,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 11: [2023-05-10 10:10:59,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 20: [2023-05-10 10:10:59,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:59,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 0: [2023-05-10 10:10:59,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:59,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 22: [2023-05-10 10:10:59,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 22: [2023-05-10 10:10:59,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 22: [2023-05-10 10:10:59,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 22: [2023-05-10 10:10:59,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:59,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 11: [2023-05-10 10:10:59,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 11: [2023-05-10 10:10:59,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 11: [2023-05-10 10:10:59,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 11: [2023-05-10 10:10:59,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 0: [2023-05-10 10:10:59,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:59,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:59,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 22: [2023-05-10 10:10:59,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,372] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 9: [2023-05-10 10:10:59,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 29: [2023-05-10 10:10:59,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 22: [2023-05-10 10:10:59,374] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:59,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:59,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:59,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 16: [2023-05-10 10:10:59,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 16: [2023-05-10 10:10:59,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 16: [2023-05-10 10:10:59,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 16: [2023-05-10 10:10:59,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 16: [2023-05-10 10:10:59,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 16: [2023-05-10 10:10:59,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 16: [2023-05-10 10:10:59,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 31: [2023-05-10 10:10:59,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 31: [2023-05-10 10:10:59,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 31: [2023-05-10 10:10:59,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 31: [2023-05-10 10:10:59,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 31: [2023-05-10 10:10:59,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 31: [2023-05-10 10:10:59,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 31: [2023-05-10 10:10:59,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 17: [2023-05-10 10:10:59,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 17: [2023-05-10 10:10:59,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 17: [2023-05-10 10:10:59,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 17: [2023-05-10 10:10:59,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 17: [2023-05-10 10:10:59,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 17: [2023-05-10 10:10:59,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 17: [2023-05-10 10:10:59,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 17: [2023-05-10 10:10:59,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 2: [2023-05-10 10:10:59,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 2: [2023-05-10 10:10:59,367] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 2: [2023-05-10 10:10:59,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 2: [2023-05-10 10:10:59,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 2: [2023-05-10 10:10:59,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 20: [2023-05-10 10:10:59,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 4: [2023-05-10 10:10:59,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,384] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 9: [2023-05-10 10:10:59,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 9: [2023-05-10 10:10:59,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 4: [2023-05-10 10:10:59,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 16: [2023-05-10 10:10:59,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 11: [2023-05-10 10:10:59,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:59,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,391] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 9: [2023-05-10 10:10:59,391] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 2: [2023-05-10 10:10:59,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:59,394] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 17: [2023-05-10 10:10:59,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 17: [2023-05-10 10:10:59,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,401] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 11: [2023-05-10 10:10:59,400] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 20: [2023-05-10 10:10:59,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,401] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 11: [2023-05-10 10:10:59,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 11: [2023-05-10 10:10:59,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,404] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,406] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 11: [2023-05-10 10:10:59,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,406] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,407] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 6: [2023-05-10 10:10:59,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 6: [2023-05-10 10:10:59,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 6: [2023-05-10 10:10:59,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 6: [2023-05-10 10:10:59,408] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 6: [2023-05-10 10:10:59,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 6: [2023-05-10 10:10:59,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 6: [2023-05-10 10:10:59,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 6: [2023-05-10 10:10:59,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 20: [2023-05-10 10:10:59,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 6: [2023-05-10 10:10:59,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 6: [2023-05-10 10:10:59,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 20: [2023-05-10 10:10:59,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,414] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,414] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 26: [2023-05-10 10:10:59,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 26: [2023-05-10 10:10:59,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 26: [2023-05-10 10:10:59,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 2: [2023-05-10 10:10:59,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 29: [2023-05-10 10:10:59,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 29: [2023-05-10 10:10:59,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 29: [2023-05-10 10:10:59,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 26: [2023-05-10 10:10:59,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 26: [2023-05-10 10:10:59,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 6: [2023-05-10 10:10:59,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 6: [2023-05-10 10:10:59,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 6: [2023-05-10 10:10:59,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 6: [2023-05-10 10:10:59,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 6: [2023-05-10 10:10:59,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 6: [2023-05-10 10:10:59,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:59,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 3: [2023-05-10 10:10:59,424] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,425] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 13: [2023-05-10 10:10:59,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 13: [2023-05-10 10:10:59,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 30: [2023-05-10 10:10:59,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:59,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 20: [2023-05-10 10:10:59,430] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 20: [2023-05-10 10:10:59,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 10: [2023-05-10 10:10:59,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_28-model_00-model_states.pt. 20: [2023-05-10 10:10:59,433] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,433] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 29: [2023-05-10 10:10:59,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 22: [2023-05-10 10:10:59,435] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 31: [2023-05-10 10:10:59,435] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 31: [2023-05-10 10:10:59,436] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,437] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 13: [2023-05-10 10:10:59,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,439] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,443] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,443] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:59,443] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 22: [2023-05-10 10:10:59,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 29: [2023-05-10 10:10:59,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 29: [2023-05-10 10:10:59,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 29: [2023-05-10 10:10:59,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 29: [2023-05-10 10:10:59,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 22: [2023-05-10 10:10:59,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,451] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:59,452] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:10:59,453] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:59,453] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 31: [2023-05-10 10:10:59,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 16: [2023-05-10 10:10:59,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 31: [2023-05-10 10:10:59,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 16: [2023-05-10 10:10:59,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 16: [2023-05-10 10:10:59,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 16: [2023-05-10 10:10:59,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 16: [2023-05-10 10:10:59,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 16: [2023-05-10 10:10:59,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 22: [2023-05-10 10:10:59,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 22: [2023-05-10 10:10:59,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 22: [2023-05-10 10:10:59,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 16: [2023-05-10 10:10:59,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 16: [2023-05-10 10:10:59,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 13: [2023-05-10 10:10:59,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 13: [2023-05-10 10:10:59,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 31: [2023-05-10 10:10:59,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 31: [2023-05-10 10:10:59,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 6: [2023-05-10 10:10:59,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 6: [2023-05-10 10:10:59,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 17: [2023-05-10 10:10:59,465] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 17: [2023-05-10 10:10:59,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 31: [2023-05-10 10:10:59,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,469] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 22: [2023-05-10 10:10:59,469] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 17: [2023-05-10 10:10:59,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 17: [2023-05-10 10:10:59,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 17: [2023-05-10 10:10:59,476] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 18: [2023-05-10 10:10:59,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 18: [2023-05-10 10:10:59,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 18: [2023-05-10 10:10:59,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 18: [2023-05-10 10:10:59,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 18: [2023-05-10 10:10:59,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 18: [2023-05-10 10:10:59,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 18: [2023-05-10 10:10:59,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,480] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,480] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,480] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 22: [2023-05-10 10:10:59,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 17: [2023-05-10 10:10:59,481] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 12: [2023-05-10 10:10:59,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:59,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:10:59,482] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 12: [2023-05-10 10:10:59,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,483] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 12: [2023-05-10 10:10:59,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 12: [2023-05-10 10:10:59,484] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 6: [2023-05-10 10:10:59,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:10:59,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 31: [2023-05-10 10:10:59,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:10:59,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 6: [2023-05-10 10:10:59,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 6: [2023-05-10 10:10:59,485] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 6: [2023-05-10 10:10:59,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:10:59,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 21: [2023-05-10 10:10:59,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 21: [2023-05-10 10:10:59,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 16: [2023-05-10 10:10:59,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 27: [2023-05-10 10:10:59,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 15: [2023-05-10 10:10:59,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 6: [2023-05-10 10:10:59,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 21: [2023-05-10 10:10:59,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 21: [2023-05-10 10:10:59,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 21: [2023-05-10 10:10:59,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 21: [2023-05-10 10:10:59,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 21: [2023-05-10 10:10:59,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 21: [2023-05-10 10:10:59,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,490] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 22: [2023-05-10 10:10:59,490] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 22: [2023-05-10 10:10:59,491] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:10:59,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 21: [2023-05-10 10:10:59,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 31: [2023-05-10 10:10:59,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:10:59,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 31: [2023-05-10 10:10:59,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 16: [2023-05-10 10:10:59,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:10:59,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 21: [2023-05-10 10:10:59,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 31: [2023-05-10 10:10:59,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 15: [2023-05-10 10:10:59,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 21: [2023-05-10 10:10:59,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 21: [2023-05-10 10:10:59,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 6: [2023-05-10 10:10:59,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 21: [2023-05-10 10:10:59,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 15: [2023-05-10 10:10:59,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:10:59,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 16: [2023-05-10 10:10:59,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 16: [2023-05-10 10:10:59,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 31: [2023-05-10 10:10:59,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,502] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 7: [2023-05-10 10:10:59,502] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 6: [2023-05-10 10:10:59,503] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,504] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 17: [2023-05-10 10:10:59,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 17: [2023-05-10 10:10:59,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 6: [2023-05-10 10:10:59,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 17: [2023-05-10 10:10:59,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 17: [2023-05-10 10:10:59,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 6: [2023-05-10 10:10:59,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 6: [2023-05-10 10:10:59,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 6: [2023-05-10 10:10:59,524] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 19: [2023-05-10 10:10:59,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 19: [2023-05-10 10:10:59,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 19: [2023-05-10 10:10:59,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 19: [2023-05-10 10:10:59,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 19: [2023-05-10 10:10:59,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 19: [2023-05-10 10:10:59,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 19: [2023-05-10 10:10:59,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 19: [2023-05-10 10:10:59,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,531] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 6: [2023-05-10 10:10:59,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:10:59,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,535] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 14: [2023-05-10 10:10:59,535] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 14: [2023-05-10 10:10:59,535] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 14: [2023-05-10 10:10:59,535] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 14: [2023-05-10 10:10:59,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 14: [2023-05-10 10:10:59,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,535] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 14: [2023-05-10 10:10:59,536] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 6: [2023-05-10 10:10:59,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:10:59,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 15: [2023-05-10 10:10:59,538] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,538] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 23: [2023-05-10 10:10:59,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,539] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 23: [2023-05-10 10:10:59,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 23: [2023-05-10 10:10:59,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 23: [2023-05-10 10:10:59,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 23: [2023-05-10 10:10:59,541] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 6: [2023-05-10 10:10:59,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 27: [2023-05-10 10:10:59,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,545] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 14: [2023-05-10 10:10:59,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 18: [2023-05-10 10:10:59,547] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 18: [2023-05-10 10:10:59,547] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 18: [2023-05-10 10:10:59,547] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,548] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:10:59,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:10:59,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 18: [2023-05-10 10:10:59,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 18: [2023-05-10 10:10:59,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,552] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 21: [2023-05-10 10:10:59,553] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,553] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 21: [2023-05-10 10:10:59,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,554] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 24: [2023-05-10 10:10:59,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 21: [2023-05-10 10:10:59,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,560] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,560] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,560] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,560] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,560] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 18: [2023-05-10 10:10:59,560] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:10:59,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 25: [2023-05-10 10:10:59,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 12: [2023-05-10 10:10:59,565] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 25: [2023-05-10 10:10:59,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 25: [2023-05-10 10:10:59,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 25: [2023-05-10 10:10:59,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 25: [2023-05-10 10:10:59,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 25: [2023-05-10 10:10:59,566] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 21: [2023-05-10 10:10:59,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 21: [2023-05-10 10:10:59,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,573] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 7: [2023-05-10 10:10:59,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 27: [2023-05-10 10:10:59,573] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:10:59,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 21: [2023-05-10 10:10:59,577] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:10:59,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:10:59,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:10:59,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:10:59,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:10:59,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:10:59,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:10:59,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 19: [2023-05-10 10:10:59,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 18: [2023-05-10 10:10:59,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:10:59,583] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 21: [2023-05-10 10:10:59,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:10:59,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 21: [2023-05-10 10:10:59,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 27: [2023-05-10 10:10:59,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:10:59,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:10:59,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:10:59,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:10:59,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 27: [2023-05-10 10:10:59,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 27: [2023-05-10 10:10:59,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:10:59,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 27: [2023-05-10 10:10:59,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 7: [2023-05-10 10:10:59,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 7: [2023-05-10 10:10:59,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:10:59,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 21: [2023-05-10 10:10:59,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:10:59,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 8: [2023-05-10 10:10:59,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 8: [2023-05-10 10:10:59,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 8: [2023-05-10 10:10:59,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 1: [2023-05-10 10:10:59,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 8: [2023-05-10 10:10:59,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 8: [2023-05-10 10:10:59,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:10:59,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 8: [2023-05-10 10:10:59,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:10:59,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:10:59,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 1: [2023-05-10 10:10:59,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 1: [2023-05-10 10:10:59,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 24: [2023-05-10 10:10:59,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:10:59,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:10:59,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 21: [2023-05-10 10:10:59,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 14: [2023-05-10 10:10:59,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 14: [2023-05-10 10:10:59,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 7: [2023-05-10 10:10:59,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:10:59,597] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 28: [2023-05-10 10:10:59,597] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 28: [2023-05-10 10:10:59,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 28: [2023-05-10 10:10:59,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 28: [2023-05-10 10:10:59,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 28: [2023-05-10 10:10:59,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 28: [2023-05-10 10:10:59,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 28: [2023-05-10 10:10:59,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 14: [2023-05-10 10:10:59,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:10:59,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 14: [2023-05-10 10:10:59,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 21: [2023-05-10 10:10:59,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:10:59,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:10:59,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 7: [2023-05-10 10:10:59,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 7: [2023-05-10 10:10:59,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 7: [2023-05-10 10:10:59,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 20: [2023-05-10 10:10:59,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 12: [2023-05-10 10:10:59,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:10:59,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 28: [2023-05-10 10:10:59,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 28: [2023-05-10 10:10:59,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 19: [2023-05-10 10:10:59,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:10:59,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:10:59,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,608] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 28: [2023-05-10 10:10:59,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 20: [2023-05-10 10:10:59,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 20: [2023-05-10 10:10:59,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 28: [2023-05-10 10:10:59,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 5: [2023-05-10 10:10:59,609] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:10:59,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:10:59,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:10:59,611] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 14: [2023-05-10 10:10:59,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 20: [2023-05-10 10:10:59,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 20: [2023-05-10 10:10:59,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 20: [2023-05-10 10:10:59,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 0: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 0: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 0: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 0: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 0: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:10:59,614] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 0: [2023-05-10 10:10:59,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 15: [2023-05-10 10:10:59,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:10:59,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:10:59,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:10:59,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 14: [2023-05-10 10:10:59,617] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 0: [2023-05-10 10:10:59,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 23: [2023-05-10 10:10:59,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 14: [2023-05-10 10:10:59,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 25: [2023-05-10 10:10:59,618] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 14: [2023-05-10 10:10:59,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:10:59,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:10:59,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 19: [2023-05-10 10:10:59,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:10:59,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 14: [2023-05-10 10:10:59,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:10:59,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 25: [2023-05-10 10:10:59,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 5: [2023-05-10 10:10:59,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:10:59,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 25: [2023-05-10 10:10:59,625] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 14: [2023-05-10 10:10:59,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 25: [2023-05-10 10:10:59,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 14: [2023-05-10 10:10:59,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 25: [2023-05-10 10:10:59,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 19: [2023-05-10 10:10:59,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:10:59,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:10:59,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:10:59,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:10:59,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 23: [2023-05-10 10:10:59,635] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:10:59,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:10:59,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:10:59,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:10:59,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:10:59,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:10:59,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:10:59,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:10:59,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 25: [2023-05-10 10:10:59,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 14: [2023-05-10 10:10:59,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 14: [2023-05-10 10:10:59,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:10:59,645] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 14: [2023-05-10 10:10:59,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 14: [2023-05-10 10:10:59,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:10:59,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,648] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:10:59,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,652] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,652] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,652] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,652] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,652] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,652] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:10:59,652] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 28: [2023-05-10 10:10:59,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:10:59,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 25: [2023-05-10 10:10:59,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 25: [2023-05-10 10:10:59,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:10:59,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,655] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 11: [2023-05-10 10:10:59,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 11: [2023-05-10 10:10:59,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 25: [2023-05-10 10:10:59,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 25: [2023-05-10 10:10:59,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:10:59,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,657] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:10:59,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 20: [2023-05-10 10:10:59,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 26: [2023-05-10 10:10:59,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 11: [2023-05-10 10:10:59,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 11: [2023-05-10 10:10:59,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 11: [2023-05-10 10:10:59,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 11: [2023-05-10 10:10:59,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 11: [2023-05-10 10:10:59,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 3: [2023-05-10 10:10:59,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 3: [2023-05-10 10:10:59,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 3: [2023-05-10 10:10:59,665] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 2: [2023-05-10 10:10:59,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 3: [2023-05-10 10:10:59,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 3: [2023-05-10 10:10:59,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 1: [2023-05-10 10:10:59,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 3: [2023-05-10 10:10:59,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 4: [2023-05-10 10:10:59,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 28: [2023-05-10 10:10:59,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 1: [2023-05-10 10:10:59,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:10:59,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:10:59,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 3: [2023-05-10 10:10:59,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 10: [2023-05-10 10:10:59,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 28: [2023-05-10 10:10:59,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 3: [2023-05-10 10:10:59,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 30: [2023-05-10 10:10:59,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 0: [2023-05-10 10:10:59,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 0: [2023-05-10 10:10:59,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 28: [2023-05-10 10:10:59,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 28: [2023-05-10 10:10:59,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 20: [2023-05-10 10:10:59,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:10:59,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:10:59,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 20: [2023-05-10 10:10:59,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 28: [2023-05-10 10:10:59,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:10:59,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 28: [2023-05-10 10:10:59,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 28: [2023-05-10 10:10:59,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 8: [2023-05-10 10:10:59,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:10:59,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:10:59,688] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 0: [2023-05-10 10:10:59,688] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 0: [2023-05-10 10:10:59,688] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 0: [2023-05-10 10:10:59,688] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 0: [2023-05-10 10:10:59,688] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:10:59,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:10:59,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:10:59,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 20: [2023-05-10 10:10:59,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 20: [2023-05-10 10:10:59,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 1: [2023-05-10 10:10:59,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:10:59,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:10:59,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:10:59,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:10:59,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:10:59,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:10:59,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:10:59,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:10:59,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:10:59,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:10:59,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:10:59,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:10:59,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 4: [2023-05-10 10:10:59,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:10:59,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:10:59,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 20: [2023-05-10 10:10:59,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:10:59,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:10:59,714] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 28: [2023-05-10 10:10:59,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:10:59,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:10:59,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,717] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:10:59,718] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:10:59,719] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:10:59,719] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:10:59,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 20: [2023-05-10 10:10:59,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,721] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 0: [2023-05-10 10:10:59,723] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:10:59,723] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,723] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 9: [2023-05-10 10:10:59,723] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,723] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 3: [2023-05-10 10:10:59,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 3: [2023-05-10 10:10:59,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 3: [2023-05-10 10:10:59,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 13: [2023-05-10 10:10:59,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt... 20: [2023-05-10 10:10:59,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 10: [2023-05-10 10:10:59,725] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,726] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 20: [2023-05-10 10:10:59,726] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 20: [2023-05-10 10:10:59,726] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:10:59,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:10:59,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:10:59,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:10:59,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 3: [2023-05-10 10:10:59,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:10:59,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,734] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:10:59,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:10:59,735] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 9: [2023-05-10 10:10:59,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 10: [2023-05-10 10:10:59,740] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,740] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,740] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,740] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,740] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 3: [2023-05-10 10:10:59,741] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 3: [2023-05-10 10:10:59,741] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 3: [2023-05-10 10:10:59,741] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 3: [2023-05-10 10:10:59,741] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 3: [2023-05-10 10:10:59,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:10:59,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 26: [2023-05-10 10:10:59,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:10:59,749] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 3: [2023-05-10 10:10:59,749] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 3: [2023-05-10 10:10:59,749] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 10: [2023-05-10 10:10:59,751] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:10:59,752] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:10:59,753] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 10: [2023-05-10 10:10:59,753] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:10:59,754] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 10: [2023-05-10 10:10:59,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:10:59,755] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:10:59,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:10:59,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:10:59,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:10:59,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 2: [2023-05-10 10:10:59,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 10: [2023-05-10 10:10:59,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:10:59,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:10:59,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 3: [2023-05-10 10:10:59,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:10:59,763] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:10:59,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:10:59,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:10:59,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:10:59,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 11: [2023-05-10 10:10:59,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:10:59,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:10:59,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:10:59,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:10:59,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 3: [2023-05-10 10:10:59,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 3: [2023-05-10 10:10:59,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 3: [2023-05-10 10:10:59,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:10:59,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 10: [2023-05-10 10:10:59,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 3: [2023-05-10 10:10:59,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 10: [2023-05-10 10:10:59,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 10: [2023-05-10 10:10:59,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:10:59,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 10: [2023-05-10 10:10:59,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:10:59,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:10:59,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:10:59,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:10:59,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:10:59,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:10:59,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:10:59,801] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,800] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 30: [2023-05-10 10:10:59,802] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:10:59,802] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,804] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,804] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:10:59,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_29-model_00-model_states.pt. 13: [2023-05-10 10:10:59,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,848] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,855] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:10:59,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 31: [2023-05-10 10:10:59,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 31: [2023-05-10 10:10:59,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 31: [2023-05-10 10:10:59,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,890] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,890] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 31: [2023-05-10 10:10:59,890] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 31: [2023-05-10 10:10:59,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 31: [2023-05-10 10:10:59,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 31: [2023-05-10 10:10:59,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,898] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,899] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,901] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:10:59,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:10:59,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:10:59,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:10:59,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:10:59,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:10:59,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:10:59,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:10:59,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:10:59,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:10:59,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:10:59,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:10:59,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:10:59,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:10:59,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:10:59,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:10:59,923] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 12: [2023-05-10 10:10:59,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 12: [2023-05-10 10:10:59,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 12: [2023-05-10 10:10:59,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 12: [2023-05-10 10:10:59,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 12: [2023-05-10 10:10:59,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 12: [2023-05-10 10:10:59,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 12: [2023-05-10 10:10:59,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 12: [2023-05-10 10:10:59,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:10:59,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 31: [2023-05-10 10:10:59,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,948] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,957] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:10:59,961] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 7: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 7: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,965] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 7: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 7: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 7: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 7: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 7: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 31: [2023-05-10 10:10:59,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:10:59,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:10:59,970] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 7: [2023-05-10 10:10:59,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 7: [2023-05-10 10:10:59,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 7: [2023-05-10 10:10:59,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 7: [2023-05-10 10:10:59,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 7: [2023-05-10 10:10:59,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 7: [2023-05-10 10:10:59,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 31: [2023-05-10 10:10:59,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:10:59,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 7: [2023-05-10 10:10:59,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 29: [2023-05-10 10:10:59,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 29: [2023-05-10 10:10:59,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 29: [2023-05-10 10:10:59,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 12: [2023-05-10 10:10:59,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 29: [2023-05-10 10:10:59,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:10:59,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 29: [2023-05-10 10:10:59,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 29: [2023-05-10 10:10:59,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 29: [2023-05-10 10:10:59,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 29: [2023-05-10 10:10:59,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 29: [2023-05-10 10:10:59,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 29: [2023-05-10 10:10:59,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 29: [2023-05-10 10:10:59,987] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 21: [2023-05-10 10:10:59,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 29: [2023-05-10 10:10:59,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 29: [2023-05-10 10:10:59,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 29: [2023-05-10 10:10:59,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 29: [2023-05-10 10:10:59,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:10:59,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 29: [2023-05-10 10:10:59,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:10:59,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 24: [2023-05-10 10:10:59,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:10:59,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:10:59,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:10:59,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:10:59,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:10:59,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:10:59,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:10:59,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 1: [2023-05-10 10:10:59,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:10:59,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:10:59,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:10:59,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:10:59,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:10:59,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:10:59,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:10:59,999] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:10:59,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:11:00,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 1: [2023-05-10 10:11:00,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:11:00,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:11:00,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:11:00,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:11:00,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 1: [2023-05-10 10:11:00,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:11:00,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:11:00,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:11:00,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:11:00,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 12: [2023-05-10 10:11:00,012] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,013] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,018] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:11:00,018] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:11:00,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:11:00,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 12: [2023-05-10 10:11:00,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 12: [2023-05-10 10:11:00,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 12: [2023-05-10 10:11:00,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 12: [2023-05-10 10:11:00,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,021] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:11:00,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:11:00,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:11:00,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:11:00,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:11:00,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:11:00,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 7: [2023-05-10 10:11:00,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:11:00,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:11:00,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 15: [2023-05-10 10:11:00,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:11:00,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 7: [2023-05-10 10:11:00,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 24: [2023-05-10 10:11:00,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 7: [2023-05-10 10:11:00,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 7: [2023-05-10 10:11:00,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 7: [2023-05-10 10:11:00,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 29: [2023-05-10 10:11:00,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:11:00,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 15: [2023-05-10 10:11:00,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 24: [2023-05-10 10:11:00,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 15: [2023-05-10 10:11:00,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:11:00,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 5: [2023-05-10 10:11:00,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:11:00,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 5: [2023-05-10 10:11:00,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 5: [2023-05-10 10:11:00,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 5: [2023-05-10 10:11:00,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 5: [2023-05-10 10:11:00,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 5: [2023-05-10 10:11:00,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 29: [2023-05-10 10:11:00,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 29: [2023-05-10 10:11:00,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 5: [2023-05-10 10:11:00,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:11:00,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:11:00,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:11:00,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 5: [2023-05-10 10:11:00,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:11:00,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:11:00,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:11:00,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:11:00,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 12: [2023-05-10 10:11:00,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 12: [2023-05-10 10:11:00,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 12: [2023-05-10 10:11:00,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 12: [2023-05-10 10:11:00,056] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 29: [2023-05-10 10:11:00,056] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 7: [2023-05-10 10:11:00,057] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 12: [2023-05-10 10:11:00,058] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 12: [2023-05-10 10:11:00,058] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 29: [2023-05-10 10:11:00,058] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 22: [2023-05-10 10:11:00,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 7: [2023-05-10 10:11:00,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 22: [2023-05-10 10:11:00,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:11:00,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:11:00,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,060] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:11:00,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 29: [2023-05-10 10:11:00,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 29: [2023-05-10 10:11:00,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 29: [2023-05-10 10:11:00,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 29: [2023-05-10 10:11:00,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 29: [2023-05-10 10:11:00,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 29: [2023-05-10 10:11:00,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 22: [2023-05-10 10:11:00,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:11:00,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 22: [2023-05-10 10:11:00,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 22: [2023-05-10 10:11:00,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 22: [2023-05-10 10:11:00,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:11:00,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:11:00,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:11:00,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:11:00,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:11:00,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:11:00,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 29: [2023-05-10 10:11:00,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 25: [2023-05-10 10:11:00,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 25: [2023-05-10 10:11:00,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 25: [2023-05-10 10:11:00,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 25: [2023-05-10 10:11:00,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 25: [2023-05-10 10:11:00,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:11:00,084] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,084] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:11:00,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:11:00,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:11:00,093] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 15: [2023-05-10 10:11:00,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 29: [2023-05-10 10:11:00,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 5: [2023-05-10 10:11:00,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 5: [2023-05-10 10:11:00,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 29: [2023-05-10 10:11:00,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 5: [2023-05-10 10:11:00,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 23: [2023-05-10 10:11:00,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,097] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,097] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,097] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,097] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,097] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,098] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 29: [2023-05-10 10:11:00,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 29: [2023-05-10 10:11:00,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:11:00,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,101] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:11:00,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:11:00,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:11:00,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 1: [2023-05-10 10:11:00,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 16: [2023-05-10 10:11:00,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:11:00,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 16: [2023-05-10 10:11:00,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 16: [2023-05-10 10:11:00,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:11:00,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 5: [2023-05-10 10:11:00,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 5: [2023-05-10 10:11:00,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,107] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:11:00,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,108] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:11:00,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 8: [2023-05-10 10:11:00,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 8: [2023-05-10 10:11:00,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 8: [2023-05-10 10:11:00,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 8: [2023-05-10 10:11:00,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 8: [2023-05-10 10:11:00,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 8: [2023-05-10 10:11:00,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 8: [2023-05-10 10:11:00,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:11:00,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 5: [2023-05-10 10:11:00,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:11:00,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:11:00,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:11:00,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 17: [2023-05-10 10:11:00,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 5: [2023-05-10 10:11:00,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:11:00,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 17: [2023-05-10 10:11:00,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:11:00,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:11:00,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:11:00,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:11:00,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:11:00,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:11:00,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:11:00,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:11:00,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:11:00,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:11:00,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:11:00,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:11:00,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:11:00,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:11:00,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 4: [2023-05-10 10:11:00,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:11:00,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 5: [2023-05-10 10:11:00,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 19: [2023-05-10 10:11:00,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 19: [2023-05-10 10:11:00,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 5: [2023-05-10 10:11:00,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 5: [2023-05-10 10:11:00,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,123] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 17: [2023-05-10 10:11:00,123] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:11:00,124] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 19: [2023-05-10 10:11:00,124] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 19: [2023-05-10 10:11:00,124] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 19: [2023-05-10 10:11:00,124] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 19: [2023-05-10 10:11:00,124] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 19: [2023-05-10 10:11:00,124] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:11:00,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 17: [2023-05-10 10:11:00,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 17: [2023-05-10 10:11:00,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:11:00,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 17: [2023-05-10 10:11:00,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 17: [2023-05-10 10:11:00,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 23: [2023-05-10 10:11:00,128] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 17: [2023-05-10 10:11:00,128] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:11:00,128] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:11:00,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:11:00,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 22: [2023-05-10 10:11:00,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 19: [2023-05-10 10:11:00,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:11:00,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:11:00,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 19: [2023-05-10 10:11:00,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 5: [2023-05-10 10:11:00,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,134] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,135] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 5: [2023-05-10 10:11:00,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,136] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 5: [2023-05-10 10:11:00,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:11:00,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 25: [2023-05-10 10:11:00,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 5: [2023-05-10 10:11:00,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 5: [2023-05-10 10:11:00,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,144] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 15: [2023-05-10 10:11:00,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 22: [2023-05-10 10:11:00,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 22: [2023-05-10 10:11:00,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,147] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,147] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,147] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,147] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:11:00,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:11:00,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:11:00,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:11:00,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:11:00,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:11:00,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:11:00,147] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:11:00,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:11:00,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:11:00,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 26: [2023-05-10 10:11:00,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 22: [2023-05-10 10:11:00,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 30: [2023-05-10 10:11:00,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:11:00,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 15: [2023-05-10 10:11:00,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 15: [2023-05-10 10:11:00,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 15: [2023-05-10 10:11:00,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 22: [2023-05-10 10:11:00,158] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,162] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 27: [2023-05-10 10:11:00,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 25: [2023-05-10 10:11:00,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 4: [2023-05-10 10:11:00,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 6: [2023-05-10 10:11:00,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 6: [2023-05-10 10:11:00,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 27: [2023-05-10 10:11:00,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 16: [2023-05-10 10:11:00,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,166] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 20: [2023-05-10 10:11:00,166] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 27: [2023-05-10 10:11:00,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 6: [2023-05-10 10:11:00,166] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 6: [2023-05-10 10:11:00,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 6: [2023-05-10 10:11:00,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 6: [2023-05-10 10:11:00,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 6: [2023-05-10 10:11:00,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 6: [2023-05-10 10:11:00,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 6: [2023-05-10 10:11:00,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:11:00,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 20: [2023-05-10 10:11:00,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 20: [2023-05-10 10:11:00,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 20: [2023-05-10 10:11:00,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 20: [2023-05-10 10:11:00,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 20: [2023-05-10 10:11:00,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 20: [2023-05-10 10:11:00,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 20: [2023-05-10 10:11:00,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 20: [2023-05-10 10:11:00,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 22: [2023-05-10 10:11:00,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 8: [2023-05-10 10:11:00,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 8: [2023-05-10 10:11:00,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 8: [2023-05-10 10:11:00,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 6: [2023-05-10 10:11:00,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 14: [2023-05-10 10:11:00,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,175] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,176] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 22: [2023-05-10 10:11:00,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 25: [2023-05-10 10:11:00,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 25: [2023-05-10 10:11:00,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 6: [2023-05-10 10:11:00,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 16: [2023-05-10 10:11:00,178] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,178] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 28: [2023-05-10 10:11:00,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 25: [2023-05-10 10:11:00,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 20: [2023-05-10 10:11:00,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 22: [2023-05-10 10:11:00,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 22: [2023-05-10 10:11:00,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 17: [2023-05-10 10:11:00,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 16: [2023-05-10 10:11:00,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 20: [2023-05-10 10:11:00,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 20: [2023-05-10 10:11:00,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 14: [2023-05-10 10:11:00,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 20: [2023-05-10 10:11:00,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:11:00,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 19: [2023-05-10 10:11:00,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 8: [2023-05-10 10:11:00,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 8: [2023-05-10 10:11:00,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 8: [2023-05-10 10:11:00,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,182] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:11:00,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 8: [2023-05-10 10:11:00,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 8: [2023-05-10 10:11:00,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,183] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:11:00,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 14: [2023-05-10 10:11:00,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 16: [2023-05-10 10:11:00,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 14: [2023-05-10 10:11:00,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:11:00,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 9: [2023-05-10 10:11:00,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 9: [2023-05-10 10:11:00,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 9: [2023-05-10 10:11:00,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 10: [2023-05-10 10:11:00,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 9: [2023-05-10 10:11:00,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 4: [2023-05-10 10:11:00,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 9: [2023-05-10 10:11:00,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 9: [2023-05-10 10:11:00,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 9: [2023-05-10 10:11:00,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 10: [2023-05-10 10:11:00,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 10: [2023-05-10 10:11:00,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 10: [2023-05-10 10:11:00,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 10: [2023-05-10 10:11:00,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 10: [2023-05-10 10:11:00,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 10: [2023-05-10 10:11:00,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 9: [2023-05-10 10:11:00,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 10: [2023-05-10 10:11:00,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 19: [2023-05-10 10:11:00,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 19: [2023-05-10 10:11:00,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:11:00,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,195] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:11:00,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:11:00,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:11:00,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:11:00,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:11:00,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 9: [2023-05-10 10:11:00,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 3: [2023-05-10 10:11:00,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 3: [2023-05-10 10:11:00,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 3: [2023-05-10 10:11:00,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:11:00,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 11: [2023-05-10 10:11:00,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 10: [2023-05-10 10:11:00,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 10: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 10: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 10: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 11: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 11: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 11: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 11: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 11: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 9: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 3: [2023-05-10 10:11:00,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:11:00,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:11:00,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 8: [2023-05-10 10:11:00,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 0: [2023-05-10 10:11:00,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:11:00,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:11:00,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 10: [2023-05-10 10:11:00,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 10: [2023-05-10 10:11:00,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:11:00,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 17: [2023-05-10 10:11:00,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 17: [2023-05-10 10:11:00,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 10: [2023-05-10 10:11:00,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 3: [2023-05-10 10:11:00,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 3: [2023-05-10 10:11:00,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:11:00,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 18: [2023-05-10 10:11:00,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 10: [2023-05-10 10:11:00,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:11:00,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:11:00,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:11:00,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 18: [2023-05-10 10:11:00,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 18: [2023-05-10 10:11:00,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 18: [2023-05-10 10:11:00,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 18: [2023-05-10 10:11:00,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:11:00,202] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:11:00,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:11:00,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:11:00,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 2: [2023-05-10 10:11:00,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:11:00,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 19: [2023-05-10 10:11:00,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 4: [2023-05-10 10:11:00,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 8: [2023-05-10 10:11:00,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 8: [2023-05-10 10:11:00,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:11:00,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:11:00,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 27: [2023-05-10 10:11:00,208] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 18: [2023-05-10 10:11:00,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:11:00,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:11:00,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:11:00,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:11:00,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:11:00,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:11:00,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 11: [2023-05-10 10:11:00,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:11:00,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 8: [2023-05-10 10:11:00,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 4: [2023-05-10 10:11:00,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 18: [2023-05-10 10:11:00,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 19: [2023-05-10 10:11:00,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 4: [2023-05-10 10:11:00,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 4: [2023-05-10 10:11:00,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 8: [2023-05-10 10:11:00,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 4: [2023-05-10 10:11:00,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 20: [2023-05-10 10:11:00,214] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 8: [2023-05-10 10:11:00,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 8: [2023-05-10 10:11:00,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 8: [2023-05-10 10:11:00,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 17: [2023-05-10 10:11:00,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 16: [2023-05-10 10:11:00,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 19: [2023-05-10 10:11:00,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 6: [2023-05-10 10:11:00,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 6: [2023-05-10 10:11:00,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 19: [2023-05-10 10:11:00,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 28: [2023-05-10 10:11:00,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 30: [2023-05-10 10:11:00,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,224] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 19: [2023-05-10 10:11:00,226] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 19: [2023-05-10 10:11:00,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 17: [2023-05-10 10:11:00,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,229] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,233] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 20: [2023-05-10 10:11:00,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,239] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 6: [2023-05-10 10:11:00,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 20: [2023-05-10 10:11:00,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 20: [2023-05-10 10:11:00,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,247] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 20: [2023-05-10 10:11:00,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 17: [2023-05-10 10:11:00,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,250] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 17: [2023-05-10 10:11:00,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 9: [2023-05-10 10:11:00,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 9: [2023-05-10 10:11:00,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 30: [2023-05-10 10:11:00,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 10: [2023-05-10 10:11:00,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 10: [2023-05-10 10:11:00,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 17: [2023-05-10 10:11:00,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 6: [2023-05-10 10:11:00,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 6: [2023-05-10 10:11:00,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 6: [2023-05-10 10:11:00,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 11: [2023-05-10 10:11:00,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 9: [2023-05-10 10:11:00,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 9: [2023-05-10 10:11:00,256] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,257] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 9: [2023-05-10 10:11:00,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 9: [2023-05-10 10:11:00,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 9: [2023-05-10 10:11:00,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 9: [2023-05-10 10:11:00,257] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 26: [2023-05-10 10:11:00,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 30: [2023-05-10 10:11:00,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 20: [2023-05-10 10:11:00,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 20: [2023-05-10 10:11:00,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 0: [2023-05-10 10:11:00,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 20: [2023-05-10 10:11:00,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 14: [2023-05-10 10:11:00,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 18: [2023-05-10 10:11:00,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 18: [2023-05-10 10:11:00,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 18: [2023-05-10 10:11:00,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,268] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 0: [2023-05-10 10:11:00,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 0: [2023-05-10 10:11:00,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 10: [2023-05-10 10:11:00,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 18: [2023-05-10 10:11:00,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 18: [2023-05-10 10:11:00,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 18: [2023-05-10 10:11:00,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 18: [2023-05-10 10:11:00,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 11: [2023-05-10 10:11:00,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 11: [2023-05-10 10:11:00,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 27: [2023-05-10 10:11:00,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 13: [2023-05-10 10:11:00,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 13: [2023-05-10 10:11:00,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 13: [2023-05-10 10:11:00,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 13: [2023-05-10 10:11:00,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 13: [2023-05-10 10:11:00,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 13: [2023-05-10 10:11:00,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 13: [2023-05-10 10:11:00,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 3: [2023-05-10 10:11:00,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 10: [2023-05-10 10:11:00,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 3: [2023-05-10 10:11:00,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 9: [2023-05-10 10:11:00,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 9: [2023-05-10 10:11:00,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 30: [2023-05-10 10:11:00,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,282] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 3: [2023-05-10 10:11:00,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 2: [2023-05-10 10:11:00,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 30: [2023-05-10 10:11:00,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 9: [2023-05-10 10:11:00,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 9: [2023-05-10 10:11:00,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 11: [2023-05-10 10:11:00,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 11: [2023-05-10 10:11:00,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 13: [2023-05-10 10:11:00,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:11:00,285] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:11:00,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 30: [2023-05-10 10:11:00,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 30: [2023-05-10 10:11:00,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:11:00,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 6: [2023-05-10 10:11:00,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 13: [2023-05-10 10:11:00,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 0: [2023-05-10 10:11:00,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt... 9: [2023-05-10 10:11:00,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 3: [2023-05-10 10:11:00,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 9: [2023-05-10 10:11:00,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 9: [2023-05-10 10:11:00,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 0: [2023-05-10 10:11:00,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 0: [2023-05-10 10:11:00,290] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 0: [2023-05-10 10:11:00,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 0: [2023-05-10 10:11:00,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 0: [2023-05-10 10:11:00,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 0: [2023-05-10 10:11:00,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,303] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 3: [2023-05-10 10:11:00,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 3: [2023-05-10 10:11:00,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 3: [2023-05-10 10:11:00,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,310] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,318] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:11:00,360] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 21: [2023-05-10 10:11:00,360] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 21: [2023-05-10 10:11:00,360] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 21: [2023-05-10 10:11:00,360] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 21: [2023-05-10 10:11:00,360] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 21: [2023-05-10 10:11:00,360] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 21: [2023-05-10 10:11:00,360] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 21: [2023-05-10 10:11:00,360] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,362] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 13: [2023-05-10 10:11:00,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 13: [2023-05-10 10:11:00,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 13: [2023-05-10 10:11:00,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 21: [2023-05-10 10:11:00,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:11:00,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:11:00,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:11:00,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:11:00,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:11:00,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:11:00,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 21: [2023-05-10 10:11:00,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,371] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 1: [2023-05-10 10:11:00,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 1: [2023-05-10 10:11:00,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 1: [2023-05-10 10:11:00,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 1: [2023-05-10 10:11:00,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 1: [2023-05-10 10:11:00,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 13: [2023-05-10 10:11:00,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 13: [2023-05-10 10:11:00,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 13: [2023-05-10 10:11:00,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_30-model_00-model_states.pt. 1: [2023-05-10 10:11:00,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 1: [2023-05-10 10:11:00,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 1: [2023-05-10 10:11:00,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 1: [2023-05-10 10:11:00,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,398] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 23: [2023-05-10 10:11:00,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 23: [2023-05-10 10:11:00,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 23: [2023-05-10 10:11:00,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 31: [2023-05-10 10:11:00,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 31: [2023-05-10 10:11:00,409] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 23: [2023-05-10 10:11:00,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 23: [2023-05-10 10:11:00,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 23: [2023-05-10 10:11:00,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 31: [2023-05-10 10:11:00,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 31: [2023-05-10 10:11:00,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 31: [2023-05-10 10:11:00,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 23: [2023-05-10 10:11:00,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 31: [2023-05-10 10:11:00,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 31: [2023-05-10 10:11:00,410] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 31: [2023-05-10 10:11:00,411] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 23: [2023-05-10 10:11:00,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:11:00,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 21: [2023-05-10 10:11:00,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 21: [2023-05-10 10:11:00,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,427] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,428] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 21: [2023-05-10 10:11:00,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:11:00,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 21: [2023-05-10 10:11:00,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:11:00,438] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,438] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:11:00,439] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 15: [2023-05-10 10:11:00,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,440] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 1: [2023-05-10 10:11:00,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,442] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,442] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 21: [2023-05-10 10:11:00,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:00,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 1: [2023-05-10 10:11:00,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 1: [2023-05-10 10:11:00,451] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:00,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 7: [2023-05-10 10:11:00,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:00,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 12: [2023-05-10 10:11:00,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 12: [2023-05-10 10:11:00,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 12: [2023-05-10 10:11:00,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 12: [2023-05-10 10:11:00,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 12: [2023-05-10 10:11:00,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 12: [2023-05-10 10:11:00,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 12: [2023-05-10 10:11:00,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 23: [2023-05-10 10:11:00,463] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 1: [2023-05-10 10:11:00,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 12: [2023-05-10 10:11:00,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 8: [2023-05-10 10:11:00,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 8: [2023-05-10 10:11:00,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 8: [2023-05-10 10:11:00,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 7: [2023-05-10 10:11:00,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 8: [2023-05-10 10:11:00,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:11:00,469] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,468] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 21: [2023-05-10 10:11:00,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,469] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:00,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,472] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:00,472] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,472] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 7: [2023-05-10 10:11:00,472] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 1: [2023-05-10 10:11:00,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:00,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 12: [2023-05-10 10:11:00,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 12: [2023-05-10 10:11:00,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 12: [2023-05-10 10:11:00,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 12: [2023-05-10 10:11:00,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 1: [2023-05-10 10:11:00,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 23: [2023-05-10 10:11:00,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:00,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,490] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,490] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 31: [2023-05-10 10:11:00,490] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:00,490] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:00,490] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 23: [2023-05-10 10:11:00,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 23: [2023-05-10 10:11:00,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 23: [2023-05-10 10:11:00,492] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 1: [2023-05-10 10:11:00,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 1: [2023-05-10 10:11:00,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 31: [2023-05-10 10:11:00,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:00,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 31: [2023-05-10 10:11:00,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 31: [2023-05-10 10:11:00,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 31: [2023-05-10 10:11:00,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 31: [2023-05-10 10:11:00,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:00,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 15: [2023-05-10 10:11:00,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 29: [2023-05-10 10:11:00,503] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,503] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,503] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,503] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,503] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,504] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,507] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 6: [2023-05-10 10:11:00,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,508] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 29: [2023-05-10 10:11:00,509] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 15: [2023-05-10 10:11:00,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 6: [2023-05-10 10:11:00,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 6: [2023-05-10 10:11:00,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 6: [2023-05-10 10:11:00,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 6: [2023-05-10 10:11:00,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 6: [2023-05-10 10:11:00,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 6: [2023-05-10 10:11:00,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 6: [2023-05-10 10:11:00,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 16: [2023-05-10 10:11:00,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 29: [2023-05-10 10:11:00,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 29: [2023-05-10 10:11:00,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 16: [2023-05-10 10:11:00,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 16: [2023-05-10 10:11:00,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 16: [2023-05-10 10:11:00,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 16: [2023-05-10 10:11:00,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 17: [2023-05-10 10:11:00,515] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 17: [2023-05-10 10:11:00,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 12: [2023-05-10 10:11:00,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 16: [2023-05-10 10:11:00,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 8: [2023-05-10 10:11:00,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 23: [2023-05-10 10:11:00,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 16: [2023-05-10 10:11:00,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 23: [2023-05-10 10:11:00,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 26: [2023-05-10 10:11:00,524] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 8: [2023-05-10 10:11:00,526] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 12: [2023-05-10 10:11:00,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 17: [2023-05-10 10:11:00,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 17: [2023-05-10 10:11:00,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 17: [2023-05-10 10:11:00,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 17: [2023-05-10 10:11:00,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 31: [2023-05-10 10:11:00,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:00,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:00,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:00,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:00,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 17: [2023-05-10 10:11:00,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 17: [2023-05-10 10:11:00,531] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 8: [2023-05-10 10:11:00,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 12: [2023-05-10 10:11:00,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,540] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,550] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:00,549] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:00,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,556] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,557] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 12: [2023-05-10 10:11:00,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 12: [2023-05-10 10:11:00,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 12: [2023-05-10 10:11:00,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 12: [2023-05-10 10:11:00,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 12: [2023-05-10 10:11:00,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,559] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,560] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,560] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,560] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 6: [2023-05-10 10:11:00,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 6: [2023-05-10 10:11:00,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 29: [2023-05-10 10:11:00,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 8: [2023-05-10 10:11:00,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,564] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 26: [2023-05-10 10:11:00,564] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,564] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 20: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 15: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 20: [2023-05-10 10:11:00,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 2: [2023-05-10 10:11:00,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,573] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 4: [2023-05-10 10:11:00,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 22: [2023-05-10 10:11:00,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 22: [2023-05-10 10:11:00,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 4: [2023-05-10 10:11:00,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,575] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,578] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 4: [2023-05-10 10:11:00,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 22: [2023-05-10 10:11:00,579] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 22: [2023-05-10 10:11:00,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 29: [2023-05-10 10:11:00,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 22: [2023-05-10 10:11:00,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 24: [2023-05-10 10:11:00,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 24: [2023-05-10 10:11:00,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 22: [2023-05-10 10:11:00,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 24: [2023-05-10 10:11:00,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,581] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 4: [2023-05-10 10:11:00,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 4: [2023-05-10 10:11:00,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 22: [2023-05-10 10:11:00,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 4: [2023-05-10 10:11:00,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 4: [2023-05-10 10:11:00,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 20: [2023-05-10 10:11:00,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 2: [2023-05-10 10:11:00,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 4: [2023-05-10 10:11:00,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 29: [2023-05-10 10:11:00,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 4: [2023-05-10 10:11:00,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 24: [2023-05-10 10:11:00,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 24: [2023-05-10 10:11:00,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 17: [2023-05-10 10:11:00,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 6: [2023-05-10 10:11:00,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 6: [2023-05-10 10:11:00,586] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 30: [2023-05-10 10:11:00,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:00,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 26: [2023-05-10 10:11:00,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 26: [2023-05-10 10:11:00,588] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 29: [2023-05-10 10:11:00,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:00,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 24: [2023-05-10 10:11:00,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 30: [2023-05-10 10:11:00,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 24: [2023-05-10 10:11:00,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 30: [2023-05-10 10:11:00,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 24: [2023-05-10 10:11:00,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 19: [2023-05-10 10:11:00,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 19: [2023-05-10 10:11:00,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 19: [2023-05-10 10:11:00,589] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 6: [2023-05-10 10:11:00,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 24: [2023-05-10 10:11:00,591] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 19: [2023-05-10 10:11:00,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 25: [2023-05-10 10:11:00,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:00,593] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 30: [2023-05-10 10:11:00,592] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,594] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 26: [2023-05-10 10:11:00,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:00,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 25: [2023-05-10 10:11:00,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:00,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:00,595] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,597] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 16: [2023-05-10 10:11:00,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 16: [2023-05-10 10:11:00,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,596] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 17: [2023-05-10 10:11:00,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 6: [2023-05-10 10:11:00,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 6: [2023-05-10 10:11:00,603] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:00,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,607] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 26: [2023-05-10 10:11:00,610] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 18: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 18: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 18: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 18: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 18: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 18: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 29: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 0: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,612] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 17: [2023-05-10 10:11:00,613] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 5: [2023-05-10 10:11:00,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 29: [2023-05-10 10:11:00,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 16: [2023-05-10 10:11:00,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 29: [2023-05-10 10:11:00,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 6: [2023-05-10 10:11:00,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 29: [2023-05-10 10:11:00,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:00,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 17: [2023-05-10 10:11:00,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 0: [2023-05-10 10:11:00,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 5: [2023-05-10 10:11:00,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 10: [2023-05-10 10:11:00,617] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 10: [2023-05-10 10:11:00,617] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 10: [2023-05-10 10:11:00,617] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 10: [2023-05-10 10:11:00,617] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 10: [2023-05-10 10:11:00,617] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 10: [2023-05-10 10:11:00,617] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 26: [2023-05-10 10:11:00,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:00,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,617] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 5: [2023-05-10 10:11:00,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 0: [2023-05-10 10:11:00,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 6: [2023-05-10 10:11:00,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:00,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 6: [2023-05-10 10:11:00,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:00,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 0: [2023-05-10 10:11:00,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 14: [2023-05-10 10:11:00,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 0: [2023-05-10 10:11:00,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:00,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 3: [2023-05-10 10:11:00,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 3: [2023-05-10 10:11:00,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 18: [2023-05-10 10:11:00,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 3: [2023-05-10 10:11:00,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 14: [2023-05-10 10:11:00,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 14: [2023-05-10 10:11:00,622] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 14: [2023-05-10 10:11:00,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 14: [2023-05-10 10:11:00,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 14: [2023-05-10 10:11:00,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:00,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:00,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,624] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:00,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 14: [2023-05-10 10:11:00,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 28: [2023-05-10 10:11:00,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,629] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:00,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:00,630] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 14: [2023-05-10 10:11:00,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:00,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 16: [2023-05-10 10:11:00,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 6: [2023-05-10 10:11:00,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:00,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 6: [2023-05-10 10:11:00,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 11: [2023-05-10 10:11:00,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 11: [2023-05-10 10:11:00,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 24: [2023-05-10 10:11:00,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 26: [2023-05-10 10:11:00,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 17: [2023-05-10 10:11:00,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 4: [2023-05-10 10:11:00,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:00,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 16: [2023-05-10 10:11:00,642] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 17: [2023-05-10 10:11:00,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:00,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,644] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,646] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:00,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:00,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 2: [2023-05-10 10:11:00,648] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 20: [2023-05-10 10:11:00,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:00,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,649] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 4: [2023-05-10 10:11:00,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,649] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,650] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 20: [2023-05-10 10:11:00,650] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 17: [2023-05-10 10:11:00,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:00,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:00,652] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,652] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:00,654] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 2: [2023-05-10 10:11:00,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 17: [2023-05-10 10:11:00,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 17: [2023-05-10 10:11:00,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:00,657] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 2: [2023-05-10 10:11:00,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 20: [2023-05-10 10:11:00,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,663] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,663] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 2: [2023-05-10 10:11:00,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 4: [2023-05-10 10:11:00,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:00,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:00,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:00,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 4: [2023-05-10 10:11:00,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:00,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 18: [2023-05-10 10:11:00,670] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:00,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:00,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 27: [2023-05-10 10:11:00,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:00,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 27: [2023-05-10 10:11:00,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 27: [2023-05-10 10:11:00,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 27: [2023-05-10 10:11:00,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 27: [2023-05-10 10:11:00,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 27: [2023-05-10 10:11:00,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,678] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 4: [2023-05-10 10:11:00,680] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:00,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 10: [2023-05-10 10:11:00,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:00,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 18: [2023-05-10 10:11:00,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 18: [2023-05-10 10:11:00,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 10: [2023-05-10 10:11:00,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:00,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 5: [2023-05-10 10:11:00,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 14: [2023-05-10 10:11:00,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 14: [2023-05-10 10:11:00,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 27: [2023-05-10 10:11:00,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 3: [2023-05-10 10:11:00,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 18: [2023-05-10 10:11:00,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 18: [2023-05-10 10:11:00,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 18: [2023-05-10 10:11:00,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 18: [2023-05-10 10:11:00,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 22: [2023-05-10 10:11:00,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:00,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:00,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,684] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 2: [2023-05-10 10:11:00,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:00,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 9: [2023-05-10 10:11:00,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,685] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 10: [2023-05-10 10:11:00,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 27: [2023-05-10 10:11:00,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 9: [2023-05-10 10:11:00,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 9: [2023-05-10 10:11:00,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,688] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,688] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,688] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,688] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 0: [2023-05-10 10:11:00,688] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 19: [2023-05-10 10:11:00,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 2: [2023-05-10 10:11:00,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:00,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:00,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 3: [2023-05-10 10:11:00,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 28: [2023-05-10 10:11:00,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:00,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:00,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:00,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:00,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:00,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:00,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:00,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 2: [2023-05-10 10:11:00,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:00,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:00,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:00,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:00,692] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 24: [2023-05-10 10:11:00,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:00,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 22: [2023-05-10 10:11:00,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:00,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:00,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 13: [2023-05-10 10:11:00,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 19: [2023-05-10 10:11:00,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 24: [2023-05-10 10:11:00,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:00,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:00,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 24: [2023-05-10 10:11:00,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:00,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 3: [2023-05-10 10:11:00,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:00,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:00,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 10: [2023-05-10 10:11:00,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 10: [2023-05-10 10:11:00,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 5: [2023-05-10 10:11:00,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:00,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 24: [2023-05-10 10:11:00,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:00,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:00,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:00,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt... 14: [2023-05-10 10:11:00,697] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 24: [2023-05-10 10:11:00,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:00,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 30: [2023-05-10 10:11:00,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 2: [2023-05-10 10:11:00,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:00,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:00,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 11: [2023-05-10 10:11:00,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 11: [2023-05-10 10:11:00,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:00,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:00,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:00,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:00,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 11: [2023-05-10 10:11:00,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 20: [2023-05-10 10:11:00,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 20: [2023-05-10 10:11:00,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 20: [2023-05-10 10:11:00,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 20: [2023-05-10 10:11:00,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 20: [2023-05-10 10:11:00,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 11: [2023-05-10 10:11:00,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,703] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,704] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,704] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,704] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 10: [2023-05-10 10:11:00,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:00,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 10: [2023-05-10 10:11:00,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:00,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:00,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:00,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:00,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:00,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:00,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:00,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:00,712] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:00,712] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:00,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:00,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:00,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:00,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:00,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:00,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:00,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:00,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:00,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:00,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:00,719] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:00,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:00,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:00,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:00,721] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:00,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:00,722] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:00,723] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:00,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:00,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:00,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:00,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:00,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:00,726] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:00,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:00,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:00,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:00,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:00,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:00,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 27: [2023-05-10 10:11:00,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 14: [2023-05-10 10:11:00,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:00,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:00,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 11: [2023-05-10 10:11:00,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:00,739] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 11: [2023-05-10 10:11:00,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 11: [2023-05-10 10:11:00,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 11: [2023-05-10 10:11:00,741] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 11: [2023-05-10 10:11:00,741] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:00,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 27: [2023-05-10 10:11:00,752] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:00,753] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:00,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 27: [2023-05-10 10:11:00,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 27: [2023-05-10 10:11:00,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 27: [2023-05-10 10:11:00,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 27: [2023-05-10 10:11:00,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 27: [2023-05-10 10:11:00,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:00,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:00,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:00,771] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,771] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,771] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 9: [2023-05-10 10:11:00,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:00,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:00,776] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:00,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:00,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:00,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:00,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:00,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:00,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:00,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:00,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 13: [2023-05-10 10:11:00,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_31-model_00-model_states.pt. 27: [2023-05-10 10:11:00,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:00,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:00,804] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:00,804] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:00,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:00,820] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:00,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:00,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:00,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,835] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,867] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,874] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:00,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:00,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:00,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:00,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:00,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:00,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:00,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,879] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,885] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,886] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,886] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,886] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,886] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:00,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,899] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 23: [2023-05-10 10:11:00,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 23: [2023-05-10 10:11:00,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 15: [2023-05-10 10:11:00,917] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:00,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:00,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:00,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:00,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:00,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:00,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:00,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:00,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:00,926] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:00,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 21: [2023-05-10 10:11:00,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:00,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 23: [2023-05-10 10:11:00,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 21: [2023-05-10 10:11:00,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:00,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:00,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:00,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 23: [2023-05-10 10:11:00,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 15: [2023-05-10 10:11:00,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:00,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 31: [2023-05-10 10:11:00,937] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 31: [2023-05-10 10:11:00,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 31: [2023-05-10 10:11:00,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 31: [2023-05-10 10:11:00,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 31: [2023-05-10 10:11:00,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 31: [2023-05-10 10:11:00,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:00,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:00,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:00,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:00,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:00,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:00,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:00,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:00,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 31: [2023-05-10 10:11:00,939] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 23: [2023-05-10 10:11:00,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:00,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:00,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:00,945] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:00,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 31: [2023-05-10 10:11:00,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 31: [2023-05-10 10:11:00,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:00,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:00,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:00,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:00,956] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 15: [2023-05-10 10:11:00,961] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:00,962] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:00,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:00,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:00,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:00,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:00,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:00,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:00,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:00,971] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:00,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:00,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:00,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:00,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:00,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:00,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:00,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 6: [2023-05-10 10:11:00,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 15: [2023-05-10 10:11:00,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 15: [2023-05-10 10:11:00,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:00,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:00,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:00,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 6: [2023-05-10 10:11:00,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:00,984] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:00,984] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 6: [2023-05-10 10:11:00,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 6: [2023-05-10 10:11:00,985] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:00,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:00,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:00,989] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:00,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:00,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:00,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:00,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:00,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:00,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:00,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:00,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:00,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 12: [2023-05-10 10:11:00,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:00,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:00,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 15: [2023-05-10 10:11:00,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 15: [2023-05-10 10:11:00,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 1: [2023-05-10 10:11:00,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 1: [2023-05-10 10:11:00,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:00,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:00,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:00,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:00,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 1: [2023-05-10 10:11:00,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 1: [2023-05-10 10:11:00,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 1: [2023-05-10 10:11:00,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 1: [2023-05-10 10:11:00,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 1: [2023-05-10 10:11:01,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:01,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:01,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 12: [2023-05-10 10:11:01,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 12: [2023-05-10 10:11:01,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 8: [2023-05-10 10:11:01,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:01,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 12: [2023-05-10 10:11:01,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 12: [2023-05-10 10:11:01,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 12: [2023-05-10 10:11:01,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 12: [2023-05-10 10:11:01,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 10: [2023-05-10 10:11:01,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:01,009] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:01,010] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:01,010] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:01,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:01,012] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:01,013] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:01,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 21: [2023-05-10 10:11:01,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:01,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 21: [2023-05-10 10:11:01,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:01,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 8: [2023-05-10 10:11:01,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:01,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:01,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:01,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:01,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:01,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:01,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:01,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 31: [2023-05-10 10:11:01,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 31: [2023-05-10 10:11:01,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 31: [2023-05-10 10:11:01,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 31: [2023-05-10 10:11:01,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:01,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 25: [2023-05-10 10:11:01,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:01,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:01,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 25: [2023-05-10 10:11:01,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:01,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:01,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 22: [2023-05-10 10:11:01,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 6: [2023-05-10 10:11:01,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 21: [2023-05-10 10:11:01,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 25: [2023-05-10 10:11:01,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:01,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:01,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 22: [2023-05-10 10:11:01,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 22: [2023-05-10 10:11:01,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 22: [2023-05-10 10:11:01,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 22: [2023-05-10 10:11:01,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 22: [2023-05-10 10:11:01,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 21: [2023-05-10 10:11:01,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:01,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:01,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 8: [2023-05-10 10:11:01,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 22: [2023-05-10 10:11:01,033] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:01,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:01,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:01,038] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 22: [2023-05-10 10:11:01,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:01,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:01,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:01,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:01,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 8: [2023-05-10 10:11:01,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 8: [2023-05-10 10:11:01,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:01,042] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:01,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 8: [2023-05-10 10:11:01,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:01,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 1: [2023-05-10 10:11:01,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:01,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:01,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:01,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:01,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:01,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:01,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:01,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 1: [2023-05-10 10:11:01,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:01,054] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:01,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 31: [2023-05-10 10:11:01,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:01,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:01,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,055] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:01,056] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,056] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:01,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:01,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:01,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:01,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 1: [2023-05-10 10:11:01,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:01,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:01,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:01,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:01,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:01,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:01,065] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:01,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 6: [2023-05-10 10:11:01,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:01,067] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:01,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:01,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,071] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:01,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 1: [2023-05-10 10:11:01,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 25: [2023-05-10 10:11:01,076] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,080] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 2: [2023-05-10 10:11:01,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 2: [2023-05-10 10:11:01,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:01,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 27: [2023-05-10 10:11:01,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:01,084] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 26: [2023-05-10 10:11:01,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 14: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 30: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 30: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 30: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 30: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 30: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 30: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 30: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 7: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:01,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 16: [2023-05-10 10:11:01,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 2: [2023-05-10 10:11:01,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 16: [2023-05-10 10:11:01,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:01,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 16: [2023-05-10 10:11:01,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 22: [2023-05-10 10:11:01,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 7: [2023-05-10 10:11:01,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:01,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 2: [2023-05-10 10:11:01,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:01,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 14: [2023-05-10 10:11:01,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 6: [2023-05-10 10:11:01,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 14: [2023-05-10 10:11:01,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 14: [2023-05-10 10:11:01,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 14: [2023-05-10 10:11:01,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 14: [2023-05-10 10:11:01,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 7: [2023-05-10 10:11:01,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 7: [2023-05-10 10:11:01,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:01,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 25: [2023-05-10 10:11:01,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:01,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:01,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 19: [2023-05-10 10:11:01,088] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 2: [2023-05-10 10:11:01,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 2: [2023-05-10 10:11:01,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 2: [2023-05-10 10:11:01,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 2: [2023-05-10 10:11:01,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 2: [2023-05-10 10:11:01,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 2: [2023-05-10 10:11:01,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 24: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 24: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:01,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:01,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 19: [2023-05-10 10:11:01,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 22: [2023-05-10 10:11:01,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 19: [2023-05-10 10:11:01,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 19: [2023-05-10 10:11:01,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 19: [2023-05-10 10:11:01,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 1: [2023-05-10 10:11:01,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 16: [2023-05-10 10:11:01,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:01,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 17: [2023-05-10 10:11:01,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 29: [2023-05-10 10:11:01,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 29: [2023-05-10 10:11:01,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:01,093] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:01,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:01,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:01,093] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:01,093] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:01,093] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:01,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:01,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:01,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:01,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 24: [2023-05-10 10:11:01,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 24: [2023-05-10 10:11:01,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 26: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 2: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 16: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 16: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 1: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 14: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 2: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 29: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 29: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 29: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 29: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 29: [2023-05-10 10:11:01,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 26: [2023-05-10 10:11:01,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 29: [2023-05-10 10:11:01,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 29: [2023-05-10 10:11:01,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 1: [2023-05-10 10:11:01,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 1: [2023-05-10 10:11:01,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:01,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 24: [2023-05-10 10:11:01,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 16: [2023-05-10 10:11:01,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:01,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 26: [2023-05-10 10:11:01,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:01,096] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 26: [2023-05-10 10:11:01,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 16: [2023-05-10 10:11:01,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 24: [2023-05-10 10:11:01,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 29: [2023-05-10 10:11:01,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 24: [2023-05-10 10:11:01,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 1: [2023-05-10 10:11:01,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 24: [2023-05-10 10:11:01,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 24: [2023-05-10 10:11:01,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:01,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:01,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:01,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 24: [2023-05-10 10:11:01,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:01,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:01,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 2: [2023-05-10 10:11:01,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 2: [2023-05-10 10:11:01,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:01,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:01,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 29: [2023-05-10 10:11:01,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:01,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 29: [2023-05-10 10:11:01,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 2: [2023-05-10 10:11:01,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:01,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 19: [2023-05-10 10:11:01,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:01,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:01,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:01,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 4: [2023-05-10 10:11:01,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:01,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 4: [2023-05-10 10:11:01,103] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,103] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,103] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,103] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,103] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 30: [2023-05-10 10:11:01,103] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 4: [2023-05-10 10:11:01,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 29: [2023-05-10 10:11:01,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 17: [2023-05-10 10:11:01,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 29: [2023-05-10 10:11:01,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:01,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 3: [2023-05-10 10:11:01,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 29: [2023-05-10 10:11:01,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:01,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 29: [2023-05-10 10:11:01,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:01,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 3: [2023-05-10 10:11:01,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 3: [2023-05-10 10:11:01,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 3: [2023-05-10 10:11:01,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 3: [2023-05-10 10:11:01,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 3: [2023-05-10 10:11:01,106] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:01,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 13: [2023-05-10 10:11:01,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 13: [2023-05-10 10:11:01,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 13: [2023-05-10 10:11:01,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 13: [2023-05-10 10:11:01,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 13: [2023-05-10 10:11:01,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 13: [2023-05-10 10:11:01,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 13: [2023-05-10 10:11:01,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 13: [2023-05-10 10:11:01,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,108] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 4: [2023-05-10 10:11:01,108] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:01,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 4: [2023-05-10 10:11:01,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:01,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 4: [2023-05-10 10:11:01,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 4: [2023-05-10 10:11:01,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:01,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 22: [2023-05-10 10:11:01,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:01,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:01,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:01,110] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:01,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 0: [2023-05-10 10:11:01,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:01,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 22: [2023-05-10 10:11:01,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 22: [2023-05-10 10:11:01,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 22: [2023-05-10 10:11:01,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 22: [2023-05-10 10:11:01,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:01,111] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 0: [2023-05-10 10:11:01,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 0: [2023-05-10 10:11:01,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 0: [2023-05-10 10:11:01,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 0: [2023-05-10 10:11:01,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 0: [2023-05-10 10:11:01,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 10: [2023-05-10 10:11:01,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 10: [2023-05-10 10:11:01,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 5: [2023-05-10 10:11:01,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 10: [2023-05-10 10:11:01,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 5: [2023-05-10 10:11:01,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:01,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 11: [2023-05-10 10:11:01,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,114] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:01,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 28: [2023-05-10 10:11:01,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 0: [2023-05-10 10:11:01,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:01,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:01,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:01,115] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 3: [2023-05-10 10:11:01,115] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:01,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:01,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 22: [2023-05-10 10:11:01,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:01,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:01,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:01,116] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 11: [2023-05-10 10:11:01,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:01,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 3: [2023-05-10 10:11:01,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:01,117] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:01,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:01,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:01,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:01,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:01,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 0: [2023-05-10 10:11:01,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:01,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:01,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:01,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 13: [2023-05-10 10:11:01,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:01,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 28: [2023-05-10 10:11:01,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 28: [2023-05-10 10:11:01,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 28: [2023-05-10 10:11:01,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 25: [2023-05-10 10:11:01,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 25: [2023-05-10 10:11:01,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 13: [2023-05-10 10:11:01,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 11: [2023-05-10 10:11:01,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:01,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 9: [2023-05-10 10:11:01,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:01,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 13: [2023-05-10 10:11:01,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:01,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:01,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:01,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 13: [2023-05-10 10:11:01,123] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:01,123] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,124] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,124] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 11: [2023-05-10 10:11:01,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 11: [2023-05-10 10:11:01,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 25: [2023-05-10 10:11:01,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 11: [2023-05-10 10:11:01,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 18: [2023-05-10 10:11:01,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 28: [2023-05-10 10:11:01,128] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 2: [2023-05-10 10:11:01,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 28: [2023-05-10 10:11:01,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:01,130] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:01,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 28: [2023-05-10 10:11:01,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:01,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 18: [2023-05-10 10:11:01,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 22: [2023-05-10 10:11:01,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 27: [2023-05-10 10:11:01,140] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,141] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 20: [2023-05-10 10:11:01,141] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 20: [2023-05-10 10:11:01,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 20: [2023-05-10 10:11:01,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 20: [2023-05-10 10:11:01,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 20: [2023-05-10 10:11:01,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 20: [2023-05-10 10:11:01,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 19: [2023-05-10 10:11:01,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 29: [2023-05-10 10:11:01,144] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 22: [2023-05-10 10:11:01,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 20: [2023-05-10 10:11:01,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 22: [2023-05-10 10:11:01,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 22: [2023-05-10 10:11:01,146] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 24: [2023-05-10 10:11:01,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,146] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 19: [2023-05-10 10:11:01,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 9: [2023-05-10 10:11:01,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 18: [2023-05-10 10:11:01,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,148] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,148] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,148] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,148] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 19: [2023-05-10 10:11:01,148] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 14: [2023-05-10 10:11:01,148] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,148] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 14: [2023-05-10 10:11:01,148] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 22: [2023-05-10 10:11:01,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 22: [2023-05-10 10:11:01,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 18: [2023-05-10 10:11:01,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 18: [2023-05-10 10:11:01,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 18: [2023-05-10 10:11:01,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 14: [2023-05-10 10:11:01,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 20: [2023-05-10 10:11:01,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 2: [2023-05-10 10:11:01,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 20: [2023-05-10 10:11:01,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 27: [2023-05-10 10:11:01,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 19: [2023-05-10 10:11:01,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 19: [2023-05-10 10:11:01,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,154] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 14: [2023-05-10 10:11:01,154] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 5: [2023-05-10 10:11:01,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 30: [2023-05-10 10:11:01,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt... 29: [2023-05-10 10:11:01,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 18: [2023-05-10 10:11:01,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 2: [2023-05-10 10:11:01,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 14: [2023-05-10 10:11:01,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 14: [2023-05-10 10:11:01,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 14: [2023-05-10 10:11:01,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 14: [2023-05-10 10:11:01,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 27: [2023-05-10 10:11:01,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,161] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,163] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,163] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,165] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 3: [2023-05-10 10:11:01,165] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 3: [2023-05-10 10:11:01,165] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,165] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 19: [2023-05-10 10:11:01,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 19: [2023-05-10 10:11:01,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 27: [2023-05-10 10:11:01,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 29: [2023-05-10 10:11:01,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 19: [2023-05-10 10:11:01,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 19: [2023-05-10 10:11:01,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 0: [2023-05-10 10:11:01,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 19: [2023-05-10 10:11:01,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,170] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 24: [2023-05-10 10:11:01,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 28: [2023-05-10 10:11:01,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 27: [2023-05-10 10:11:01,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 5: [2023-05-10 10:11:01,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 30: [2023-05-10 10:11:01,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 4: [2023-05-10 10:11:01,173] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 14: [2023-05-10 10:11:01,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 16: [2023-05-10 10:11:01,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 7: [2023-05-10 10:11:01,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 16: [2023-05-10 10:11:01,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 19: [2023-05-10 10:11:01,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 19: [2023-05-10 10:11:01,176] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 14: [2023-05-10 10:11:01,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,177] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 3: [2023-05-10 10:11:01,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,177] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,178] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 29: [2023-05-10 10:11:01,178] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 29: [2023-05-10 10:11:01,178] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 14: [2023-05-10 10:11:01,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 19: [2023-05-10 10:11:01,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 2: [2023-05-10 10:11:01,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 2: [2023-05-10 10:11:01,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 2: [2023-05-10 10:11:01,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 29: [2023-05-10 10:11:01,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,181] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 30: [2023-05-10 10:11:01,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 5: [2023-05-10 10:11:01,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 3: [2023-05-10 10:11:01,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 3: [2023-05-10 10:11:01,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 3: [2023-05-10 10:11:01,184] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,185] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 30: [2023-05-10 10:11:01,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 14: [2023-05-10 10:11:01,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 2: [2023-05-10 10:11:01,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 4: [2023-05-10 10:11:01,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 17: [2023-05-10 10:11:01,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 14: [2023-05-10 10:11:01,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 14: [2023-05-10 10:11:01,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 4: [2023-05-10 10:11:01,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 0: [2023-05-10 10:11:01,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 0: [2023-05-10 10:11:01,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 0: [2023-05-10 10:11:01,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 20: [2023-05-10 10:11:01,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 0: [2023-05-10 10:11:01,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,191] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,193] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 27: [2023-05-10 10:11:01,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 4: [2023-05-10 10:11:01,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 14: [2023-05-10 10:11:01,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 19: [2023-05-10 10:11:01,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 14: [2023-05-10 10:11:01,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 11: [2023-05-10 10:11:01,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 27: [2023-05-10 10:11:01,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 27: [2023-05-10 10:11:01,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 19: [2023-05-10 10:11:01,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 19: [2023-05-10 10:11:01,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 4: [2023-05-10 10:11:01,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 13: [2023-05-10 10:11:01,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 13: [2023-05-10 10:11:01,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 13: [2023-05-10 10:11:01,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 30: [2023-05-10 10:11:01,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 30: [2023-05-10 10:11:01,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 30: [2023-05-10 10:11:01,198] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 29: [2023-05-10 10:11:01,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 16: [2023-05-10 10:11:01,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,199] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 30: [2023-05-10 10:11:01,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 30: [2023-05-10 10:11:01,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,200] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 24: [2023-05-10 10:11:01,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 16: [2023-05-10 10:11:01,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 24: [2023-05-10 10:11:01,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 4: [2023-05-10 10:11:01,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 4: [2023-05-10 10:11:01,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 4: [2023-05-10 10:11:01,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 24: [2023-05-10 10:11:01,201] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 29: [2023-05-10 10:11:01,202] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 11: [2023-05-10 10:11:01,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 16: [2023-05-10 10:11:01,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 26: [2023-05-10 10:11:01,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 16: [2023-05-10 10:11:01,205] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 11: [2023-05-10 10:11:01,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 24: [2023-05-10 10:11:01,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 24: [2023-05-10 10:11:01,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 28: [2023-05-10 10:11:01,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 28: [2023-05-10 10:11:01,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 16: [2023-05-10 10:11:01,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 4: [2023-05-10 10:11:01,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 5: [2023-05-10 10:11:01,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 5: [2023-05-10 10:11:01,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 30: [2023-05-10 10:11:01,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 5: [2023-05-10 10:11:01,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 13: [2023-05-10 10:11:01,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 13: [2023-05-10 10:11:01,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 13: [2023-05-10 10:11:01,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 13: [2023-05-10 10:11:01,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 13: [2023-05-10 10:11:01,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 20: [2023-05-10 10:11:01,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 11: [2023-05-10 10:11:01,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 5: [2023-05-10 10:11:01,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 5: [2023-05-10 10:11:01,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 30: [2023-05-10 10:11:01,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 5: [2023-05-10 10:11:01,213] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 5: [2023-05-10 10:11:01,214] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,215] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,216] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,217] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 20: [2023-05-10 10:11:01,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 20: [2023-05-10 10:11:01,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 0: [2023-05-10 10:11:01,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,223] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 30: [2023-05-10 10:11:01,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 30: [2023-05-10 10:11:01,227] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 13: [2023-05-10 10:11:01,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 30: [2023-05-10 10:11:01,230] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 13: [2023-05-10 10:11:01,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 13: [2023-05-10 10:11:01,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 30: [2023-05-10 10:11:01,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 30: [2023-05-10 10:11:01,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 11: [2023-05-10 10:11:01,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 11: [2023-05-10 10:11:01,234] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 20: [2023-05-10 10:11:01,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 20: [2023-05-10 10:11:01,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_32-model_00-model_states.pt. 11: [2023-05-10 10:11:01,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 11: [2023-05-10 10:11:01,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 13: [2023-05-10 10:11:01,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 13: [2023-05-10 10:11:01,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 13: [2023-05-10 10:11:01,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 13: [2023-05-10 10:11:01,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 13: [2023-05-10 10:11:01,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,268] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,269] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 23: [2023-05-10 10:11:01,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 23: [2023-05-10 10:11:01,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 23: [2023-05-10 10:11:01,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 23: [2023-05-10 10:11:01,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 23: [2023-05-10 10:11:01,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 23: [2023-05-10 10:11:01,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 23: [2023-05-10 10:11:01,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 23: [2023-05-10 10:11:01,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 23: [2023-05-10 10:11:01,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 23: [2023-05-10 10:11:01,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 23: [2023-05-10 10:11:01,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 12: [2023-05-10 10:11:01,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 23: [2023-05-10 10:11:01,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 23: [2023-05-10 10:11:01,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 23: [2023-05-10 10:11:01,301] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 23: [2023-05-10 10:11:01,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 23: [2023-05-10 10:11:01,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 12: [2023-05-10 10:11:01,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 12: [2023-05-10 10:11:01,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 12: [2023-05-10 10:11:01,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 12: [2023-05-10 10:11:01,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 12: [2023-05-10 10:11:01,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 12: [2023-05-10 10:11:01,308] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 12: [2023-05-10 10:11:01,309] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 15: [2023-05-10 10:11:01,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 15: [2023-05-10 10:11:01,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 12: [2023-05-10 10:11:01,341] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 23: [2023-05-10 10:11:01,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 23: [2023-05-10 10:11:01,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 15: [2023-05-10 10:11:01,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 15: [2023-05-10 10:11:01,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 15: [2023-05-10 10:11:01,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 15: [2023-05-10 10:11:01,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 23: [2023-05-10 10:11:01,361] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 23: [2023-05-10 10:11:01,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 23: [2023-05-10 10:11:01,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 23: [2023-05-10 10:11:01,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 23: [2023-05-10 10:11:01,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 8: [2023-05-10 10:11:01,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 8: [2023-05-10 10:11:01,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 8: [2023-05-10 10:11:01,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 8: [2023-05-10 10:11:01,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 8: [2023-05-10 10:11:01,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 8: [2023-05-10 10:11:01,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 8: [2023-05-10 10:11:01,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 8: [2023-05-10 10:11:01,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 8: [2023-05-10 10:11:01,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 8: [2023-05-10 10:11:01,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 8: [2023-05-10 10:11:01,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 8: [2023-05-10 10:11:01,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 15: [2023-05-10 10:11:01,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 8: [2023-05-10 10:11:01,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 8: [2023-05-10 10:11:01,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 8: [2023-05-10 10:11:01,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 8: [2023-05-10 10:11:01,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 23: [2023-05-10 10:11:01,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,393] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 23: [2023-05-10 10:11:01,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 23: [2023-05-10 10:11:01,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,402] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,403] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,405] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,407] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,424] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,431] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 8: [2023-05-10 10:11:01,433] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,435] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,435] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,436] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,436] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,436] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,436] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,436] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,439] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:01,441] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 8: [2023-05-10 10:11:01,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:01,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 21: [2023-05-10 10:11:01,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:01,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:01,446] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:01,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 8: [2023-05-10 10:11:01,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 8: [2023-05-10 10:11:01,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 21: [2023-05-10 10:11:01,450] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 21: [2023-05-10 10:11:01,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 21: [2023-05-10 10:11:01,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 21: [2023-05-10 10:11:01,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 21: [2023-05-10 10:11:01,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 21: [2023-05-10 10:11:01,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 21: [2023-05-10 10:11:01,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 8: [2023-05-10 10:11:01,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 8: [2023-05-10 10:11:01,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 8: [2023-05-10 10:11:01,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 8: [2023-05-10 10:11:01,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 8: [2023-05-10 10:11:01,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 10: [2023-05-10 10:11:01,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 10: [2023-05-10 10:11:01,466] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 10: [2023-05-10 10:11:01,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 10: [2023-05-10 10:11:01,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 10: [2023-05-10 10:11:01,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 10: [2023-05-10 10:11:01,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 10: [2023-05-10 10:11:01,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 10: [2023-05-10 10:11:01,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 8: [2023-05-10 10:11:01,472] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 8: [2023-05-10 10:11:01,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 10: [2023-05-10 10:11:01,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 10: [2023-05-10 10:11:01,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 10: [2023-05-10 10:11:01,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 10: [2023-05-10 10:11:01,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 10: [2023-05-10 10:11:01,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 10: [2023-05-10 10:11:01,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 10: [2023-05-10 10:11:01,476] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 10: [2023-05-10 10:11:01,477] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 8: [2023-05-10 10:11:01,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 8: [2023-05-10 10:11:01,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 8: [2023-05-10 10:11:01,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 8: [2023-05-10 10:11:01,490] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 8: [2023-05-10 10:11:01,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,492] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 6: [2023-05-10 10:11:01,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 6: [2023-05-10 10:11:01,508] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 6: [2023-05-10 10:11:01,511] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:01,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 6: [2023-05-10 10:11:01,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 18: [2023-05-10 10:11:01,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 18: [2023-05-10 10:11:01,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:01,512] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,512] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:01,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 6: [2023-05-10 10:11:01,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 17: [2023-05-10 10:11:01,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,513] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:01,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,513] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 17: [2023-05-10 10:11:01,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 17: [2023-05-10 10:11:01,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 17: [2023-05-10 10:11:01,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 17: [2023-05-10 10:11:01,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 17: [2023-05-10 10:11:01,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 17: [2023-05-10 10:11:01,514] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 18: [2023-05-10 10:11:01,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 18: [2023-05-10 10:11:01,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 31: [2023-05-10 10:11:01,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 17: [2023-05-10 10:11:01,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,517] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 29: [2023-05-10 10:11:01,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 6: [2023-05-10 10:11:01,518] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,518] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 6: [2023-05-10 10:11:01,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 6: [2023-05-10 10:11:01,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,522] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:01,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:01,523] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:01,524] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,524] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,524] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,525] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,525] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,526] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,528] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 10: [2023-05-10 10:11:01,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 10: [2023-05-10 10:11:01,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 10: [2023-05-10 10:11:01,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 17: [2023-05-10 10:11:01,530] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 10: [2023-05-10 10:11:01,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,535] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,536] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,537] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 10: [2023-05-10 10:11:01,537] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,542] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 21: [2023-05-10 10:11:01,546] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 10: [2023-05-10 10:11:01,546] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 10: [2023-05-10 10:11:01,546] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 10: [2023-05-10 10:11:01,546] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 31: [2023-05-10 10:11:01,547] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 21: [2023-05-10 10:11:01,548] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 10: [2023-05-10 10:11:01,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 21: [2023-05-10 10:11:01,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 21: [2023-05-10 10:11:01,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,555] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,555] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 21: [2023-05-10 10:11:01,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 21: [2023-05-10 10:11:01,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 10: [2023-05-10 10:11:01,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 10: [2023-05-10 10:11:01,557] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 10: [2023-05-10 10:11:01,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 10: [2023-05-10 10:11:01,565] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 7: [2023-05-10 10:11:01,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 7: [2023-05-10 10:11:01,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 7: [2023-05-10 10:11:01,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 7: [2023-05-10 10:11:01,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 7: [2023-05-10 10:11:01,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 7: [2023-05-10 10:11:01,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 7: [2023-05-10 10:11:01,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 7: [2023-05-10 10:11:01,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 17: [2023-05-10 10:11:01,568] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,572] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 7: [2023-05-10 10:11:01,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,573] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,573] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,573] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:01,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 7: [2023-05-10 10:11:01,575] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,576] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,577] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 6: [2023-05-10 10:11:01,578] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 10: [2023-05-10 10:11:01,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 27: [2023-05-10 10:11:01,581] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 27: [2023-05-10 10:11:01,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 10: [2023-05-10 10:11:01,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 27: [2023-05-10 10:11:01,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 27: [2023-05-10 10:11:01,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 10: [2023-05-10 10:11:01,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 27: [2023-05-10 10:11:01,586] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 27: [2023-05-10 10:11:01,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 27: [2023-05-10 10:11:01,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 27: [2023-05-10 10:11:01,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,587] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,588] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 29: [2023-05-10 10:11:01,590] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,593] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,596] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 6: [2023-05-10 10:11:01,597] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:01,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 1: [2023-05-10 10:11:01,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 1: [2023-05-10 10:11:01,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,599] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,599] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:01,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,600] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,600] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 18: [2023-05-10 10:11:01,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:01,601] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,601] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,602] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 18: [2023-05-10 10:11:01,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 18: [2023-05-10 10:11:01,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,605] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:01,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 0: [2023-05-10 10:11:01,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 0: [2023-05-10 10:11:01,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 0: [2023-05-10 10:11:01,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 0: [2023-05-10 10:11:01,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 0: [2023-05-10 10:11:01,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 0: [2023-05-10 10:11:01,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,606] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 18: [2023-05-10 10:11:01,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 18: [2023-05-10 10:11:01,608] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:01,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 18: [2023-05-10 10:11:01,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 18: [2023-05-10 10:11:01,610] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:01,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,612] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:01,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 19: [2023-05-10 10:11:01,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 19: [2023-05-10 10:11:01,615] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 19: [2023-05-10 10:11:01,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 19: [2023-05-10 10:11:01,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 19: [2023-05-10 10:11:01,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 19: [2023-05-10 10:11:01,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 19: [2023-05-10 10:11:01,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 19: [2023-05-10 10:11:01,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,616] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 19: [2023-05-10 10:11:01,619] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 19: [2023-05-10 10:11:01,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 9: [2023-05-10 10:11:01,620] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,620] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 19: [2023-05-10 10:11:01,621] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,621] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 19: [2023-05-10 10:11:01,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 19: [2023-05-10 10:11:01,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 9: [2023-05-10 10:11:01,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 9: [2023-05-10 10:11:01,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 9: [2023-05-10 10:11:01,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 9: [2023-05-10 10:11:01,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 9: [2023-05-10 10:11:01,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 17: [2023-05-10 10:11:01,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 19: [2023-05-10 10:11:01,625] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 19: [2023-05-10 10:11:01,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 29: [2023-05-10 10:11:01,627] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:01,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 9: [2023-05-10 10:11:01,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 29: [2023-05-10 10:11:01,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 29: [2023-05-10 10:11:01,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 29: [2023-05-10 10:11:01,630] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,631] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 9: [2023-05-10 10:11:01,632] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 9: [2023-05-10 10:11:01,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,633] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 27: [2023-05-10 10:11:01,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 9: [2023-05-10 10:11:01,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,634] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 25: [2023-05-10 10:11:01,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 17: [2023-05-10 10:11:01,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 17: [2023-05-10 10:11:01,636] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:01,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 7: [2023-05-10 10:11:01,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 7: [2023-05-10 10:11:01,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 7: [2023-05-10 10:11:01,637] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 6: [2023-05-10 10:11:01,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:01,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 17: [2023-05-10 10:11:01,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:01,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 17: [2023-05-10 10:11:01,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 17: [2023-05-10 10:11:01,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 7: [2023-05-10 10:11:01,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 7: [2023-05-10 10:11:01,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 7: [2023-05-10 10:11:01,640] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 2: [2023-05-10 10:11:01,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 27: [2023-05-10 10:11:01,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 25: [2023-05-10 10:11:01,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 25: [2023-05-10 10:11:01,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 6: [2023-05-10 10:11:01,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:01,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 17: [2023-05-10 10:11:01,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:01,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 25: [2023-05-10 10:11:01,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 25: [2023-05-10 10:11:01,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 25: [2023-05-10 10:11:01,644] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,645] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:01,647] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:01,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,647] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 7: [2023-05-10 10:11:01,653] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:01,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 20: [2023-05-10 10:11:01,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 20: [2023-05-10 10:11:01,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 20: [2023-05-10 10:11:01,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 20: [2023-05-10 10:11:01,655] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 20: [2023-05-10 10:11:01,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 20: [2023-05-10 10:11:01,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 20: [2023-05-10 10:11:01,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 17: [2023-05-10 10:11:01,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:01,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,658] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:01,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 20: [2023-05-10 10:11:01,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 2: [2023-05-10 10:11:01,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:01,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 13: [2023-05-10 10:11:01,659] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 16: [2023-05-10 10:11:01,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,660] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 16: [2023-05-10 10:11:01,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,660] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:01,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 16: [2023-05-10 10:11:01,661] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 13: [2023-05-10 10:11:01,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 22: [2023-05-10 10:11:01,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 13: [2023-05-10 10:11:01,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,663] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 17: [2023-05-10 10:11:01,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 27: [2023-05-10 10:11:01,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:01,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 22: [2023-05-10 10:11:01,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 22: [2023-05-10 10:11:01,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,665] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:01,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 27: [2023-05-10 10:11:01,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:01,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 16: [2023-05-10 10:11:01,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 22: [2023-05-10 10:11:01,666] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 16: [2023-05-10 10:11:01,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 22: [2023-05-10 10:11:01,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 22: [2023-05-10 10:11:01,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:01,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 16: [2023-05-10 10:11:01,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 24: [2023-05-10 10:11:01,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 24: [2023-05-10 10:11:01,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,669] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 16: [2023-05-10 10:11:01,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 7: [2023-05-10 10:11:01,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,670] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 1: [2023-05-10 10:11:01,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 7: [2023-05-10 10:11:01,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:01,671] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,673] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,672] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 24: [2023-05-10 10:11:01,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 24: [2023-05-10 10:11:01,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 24: [2023-05-10 10:11:01,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 24: [2023-05-10 10:11:01,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 24: [2023-05-10 10:11:01,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 24: [2023-05-10 10:11:01,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 24: [2023-05-10 10:11:01,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 24: [2023-05-10 10:11:01,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 19: [2023-05-10 10:11:01,675] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 0: [2023-05-10 10:11:01,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 19: [2023-05-10 10:11:01,676] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,677] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 14: [2023-05-10 10:11:01,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,677] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,679] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:01,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 19: [2023-05-10 10:11:01,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 24: [2023-05-10 10:11:01,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,681] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 3: [2023-05-10 10:11:01,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 24: [2023-05-10 10:11:01,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 24: [2023-05-10 10:11:01,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,682] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 19: [2023-05-10 10:11:01,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 24: [2023-05-10 10:11:01,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 24: [2023-05-10 10:11:01,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 3: [2023-05-10 10:11:01,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 9: [2023-05-10 10:11:01,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 9: [2023-05-10 10:11:01,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 24: [2023-05-10 10:11:01,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 26: [2023-05-10 10:11:01,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 14: [2023-05-10 10:11:01,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 14: [2023-05-10 10:11:01,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 14: [2023-05-10 10:11:01,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 27: [2023-05-10 10:11:01,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:01,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:01,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 9: [2023-05-10 10:11:01,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 14: [2023-05-10 10:11:01,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 0: [2023-05-10 10:11:01,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 0: [2023-05-10 10:11:01,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,688] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,688] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:01,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 4: [2023-05-10 10:11:01,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 4: [2023-05-10 10:11:01,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 4: [2023-05-10 10:11:01,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 4: [2023-05-10 10:11:01,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 4: [2023-05-10 10:11:01,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 4: [2023-05-10 10:11:01,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 27: [2023-05-10 10:11:01,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:01,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,689] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 4: [2023-05-10 10:11:01,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 0: [2023-05-10 10:11:01,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 15: [2023-05-10 10:11:01,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 15: [2023-05-10 10:11:01,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 15: [2023-05-10 10:11:01,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 15: [2023-05-10 10:11:01,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 15: [2023-05-10 10:11:01,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 15: [2023-05-10 10:11:01,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 15: [2023-05-10 10:11:01,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 30: [2023-05-10 10:11:01,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 0: [2023-05-10 10:11:01,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 30: [2023-05-10 10:11:01,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 30: [2023-05-10 10:11:01,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 30: [2023-05-10 10:11:01,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 30: [2023-05-10 10:11:01,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 30: [2023-05-10 10:11:01,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 4: [2023-05-10 10:11:01,694] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 30: [2023-05-10 10:11:01,694] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 11: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 11: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 4: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 4: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 4: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 4: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 28: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 28: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 28: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 28: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 11: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 11: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 11: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 11: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 11: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 28: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 28: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 28: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 11: [2023-05-10 10:11:01,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 9: [2023-05-10 10:11:01,695] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 19: [2023-05-10 10:11:01,696] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,696] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,697] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 4: [2023-05-10 10:11:01,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 4: [2023-05-10 10:11:01,697] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 0: [2023-05-10 10:11:01,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:01,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 11: [2023-05-10 10:11:01,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 11: [2023-05-10 10:11:01,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 9: [2023-05-10 10:11:01,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 1: [2023-05-10 10:11:01,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:01,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 9: [2023-05-10 10:11:01,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:01,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 1: [2023-05-10 10:11:01,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,703] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 1: [2023-05-10 10:11:01,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:01,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:01,704] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,704] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:01,704] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 28: [2023-05-10 10:11:01,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 15: [2023-05-10 10:11:01,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:01,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 1: [2023-05-10 10:11:01,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:01,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 30: [2023-05-10 10:11:01,705] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 11: [2023-05-10 10:11:01,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 15: [2023-05-10 10:11:01,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:01,706] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 30: [2023-05-10 10:11:01,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 15: [2023-05-10 10:11:01,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:01,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 11: [2023-05-10 10:11:01,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 28: [2023-05-10 10:11:01,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 19: [2023-05-10 10:11:01,707] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:01,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 25: [2023-05-10 10:11:01,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 30: [2023-05-10 10:11:01,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 11: [2023-05-10 10:11:01,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 30: [2023-05-10 10:11:01,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 11: [2023-05-10 10:11:01,708] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,709] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 9: [2023-05-10 10:11:01,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,710] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,711] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 16: [2023-05-10 10:11:01,712] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,713] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:01,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 9: [2023-05-10 10:11:01,714] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:01,715] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 26: [2023-05-10 10:11:01,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:01,717] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,718] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 0: [2023-05-10 10:11:01,718] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:01,718] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 26: [2023-05-10 10:11:01,719] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 26: [2023-05-10 10:11:01,720] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 16: [2023-05-10 10:11:01,723] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 26: [2023-05-10 10:11:01,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 26: [2023-05-10 10:11:01,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:01,724] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 24: [2023-05-10 10:11:01,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 9: [2023-05-10 10:11:01,725] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 9: [2023-05-10 10:11:01,726] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 9: [2023-05-10 10:11:01,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:01,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 24: [2023-05-10 10:11:01,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,728] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:01,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 9: [2023-05-10 10:11:01,730] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:01,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:01,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 14: [2023-05-10 10:11:01,733] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:01,734] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:01,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 25: [2023-05-10 10:11:01,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:01,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:01,738] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:01,739] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,740] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,740] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:01,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 24: [2023-05-10 10:11:01,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:01,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 20: [2023-05-10 10:11:01,746] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 20: [2023-05-10 10:11:01,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 4: [2023-05-10 10:11:01,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 24: [2023-05-10 10:11:01,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 13: [2023-05-10 10:11:01,748] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,748] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:01,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,749] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 28: [2023-05-10 10:11:01,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 28: [2023-05-10 10:11:01,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 5: [2023-05-10 10:11:01,751] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 5: [2023-05-10 10:11:01,751] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 5: [2023-05-10 10:11:01,751] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,752] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:01,752] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 5: [2023-05-10 10:11:01,752] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 11: [2023-05-10 10:11:01,752] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 24: [2023-05-10 10:11:01,752] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:01,752] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 5: [2023-05-10 10:11:01,752] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,752] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 3: [2023-05-10 10:11:01,752] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:01,752] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 11: [2023-05-10 10:11:01,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,753] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 14: [2023-05-10 10:11:01,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:01,757] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 4: [2023-05-10 10:11:01,757] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 4: [2023-05-10 10:11:01,757] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 5: [2023-05-10 10:11:01,757] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 5: [2023-05-10 10:11:01,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 5: [2023-05-10 10:11:01,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 24: [2023-05-10 10:11:01,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 24: [2023-05-10 10:11:01,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 24: [2023-05-10 10:11:01,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 24: [2023-05-10 10:11:01,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 20: [2023-05-10 10:11:01,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 24: [2023-05-10 10:11:01,759] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,760] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 5: [2023-05-10 10:11:01,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 5: [2023-05-10 10:11:01,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,761] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 5: [2023-05-10 10:11:01,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 5: [2023-05-10 10:11:01,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 5: [2023-05-10 10:11:01,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt... 20: [2023-05-10 10:11:01,764] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:01,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 4: [2023-05-10 10:11:01,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 4: [2023-05-10 10:11:01,765] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 22: [2023-05-10 10:11:01,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 24: [2023-05-10 10:11:01,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 3: [2023-05-10 10:11:01,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:01,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:01,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:01,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,773] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:01,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:01,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:01,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:01,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:01,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:01,774] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 11: [2023-05-10 10:11:01,774] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:01,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:01,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:01,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:01,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 28: [2023-05-10 10:11:01,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 28: [2023-05-10 10:11:01,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 28: [2023-05-10 10:11:01,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,777] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:01,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 15: [2023-05-10 10:11:01,777] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 20: [2023-05-10 10:11:01,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 16: [2023-05-10 10:11:01,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:01,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 11: [2023-05-10 10:11:01,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 11: [2023-05-10 10:11:01,778] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 3: [2023-05-10 10:11:01,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 3: [2023-05-10 10:11:01,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:01,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 14: [2023-05-10 10:11:01,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:01,780] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 16: [2023-05-10 10:11:01,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:01,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:01,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 3: [2023-05-10 10:11:01,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:01,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:01,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:01,781] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:01,781] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 3: [2023-05-10 10:11:01,782] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:01,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,783] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 4: [2023-05-10 10:11:01,783] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:01,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:01,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,785] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:01,787] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 14: [2023-05-10 10:11:01,788] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 14: [2023-05-10 10:11:01,788] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 15: [2023-05-10 10:11:01,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 15: [2023-05-10 10:11:01,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:01,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 24: [2023-05-10 10:11:01,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 14: [2023-05-10 10:11:01,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:01,789] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:01,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:01,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 14: [2023-05-10 10:11:01,791] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 24: [2023-05-10 10:11:01,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 24: [2023-05-10 10:11:01,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:01,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:01,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 24: [2023-05-10 10:11:01,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:01,797] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,798] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 11: [2023-05-10 10:11:01,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 28: [2023-05-10 10:11:01,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:01,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:01,809] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:01,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:01,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,811] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 30: [2023-05-10 10:11:01,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 30: [2023-05-10 10:11:01,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 5: [2023-05-10 10:11:01,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 5: [2023-05-10 10:11:01,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 11: [2023-05-10 10:11:01,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:01,813] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 28: [2023-05-10 10:11:01,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:01,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,815] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 28: [2023-05-10 10:11:01,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:01,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:01,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:01,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:01,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:01,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:01,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 5: [2023-05-10 10:11:01,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 12: [2023-05-10 10:11:01,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 12: [2023-05-10 10:11:01,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 12: [2023-05-10 10:11:01,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 12: [2023-05-10 10:11:01,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 12: [2023-05-10 10:11:01,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 12: [2023-05-10 10:11:01,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 12: [2023-05-10 10:11:01,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 12: [2023-05-10 10:11:01,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 5: [2023-05-10 10:11:01,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 5: [2023-05-10 10:11:01,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 5: [2023-05-10 10:11:01,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_33-model_00-model_states.pt. 15: [2023-05-10 10:11:01,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:01,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 12: [2023-05-10 10:11:01,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:01,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,836] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:01,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:01,838] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,839] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,843] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:01,845] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:01,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:01,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:01,856] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:01,857] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:01,858] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 23: [2023-05-10 10:11:01,866] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 23: [2023-05-10 10:11:01,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 23: [2023-05-10 10:11:01,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 23: [2023-05-10 10:11:01,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 23: [2023-05-10 10:11:01,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 23: [2023-05-10 10:11:01,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 23: [2023-05-10 10:11:01,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 23: [2023-05-10 10:11:01,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 12: [2023-05-10 10:11:01,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 12: [2023-05-10 10:11:01,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 12: [2023-05-10 10:11:01,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 12: [2023-05-10 10:11:01,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 23: [2023-05-10 10:11:01,918] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 12: [2023-05-10 10:11:01,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 12: [2023-05-10 10:11:01,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 12: [2023-05-10 10:11:01,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 12: [2023-05-10 10:11:01,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 23: [2023-05-10 10:11:01,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 23: [2023-05-10 10:11:01,928] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:01,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:01,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:01,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:01,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:01,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:01,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:01,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:01,935] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:01,939] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 8: [2023-05-10 10:11:01,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 29: [2023-05-10 10:11:01,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:01,940] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 23: [2023-05-10 10:11:01,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:01,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 29: [2023-05-10 10:11:01,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:01,941] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:01,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 8: [2023-05-10 10:11:01,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 8: [2023-05-10 10:11:01,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 29: [2023-05-10 10:11:01,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:01,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:01,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:01,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 29: [2023-05-10 10:11:01,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:01,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 23: [2023-05-10 10:11:01,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 23: [2023-05-10 10:11:01,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 23: [2023-05-10 10:11:01,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 23: [2023-05-10 10:11:01,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:01,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,945] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:01,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 29: [2023-05-10 10:11:01,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 29: [2023-05-10 10:11:01,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 29: [2023-05-10 10:11:01,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 29: [2023-05-10 10:11:01,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 29: [2023-05-10 10:11:01,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 29: [2023-05-10 10:11:01,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 12: [2023-05-10 10:11:01,957] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 6: [2023-05-10 10:11:01,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:01,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:01,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:01,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:01,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:01,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:01,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:01,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 12: [2023-05-10 10:11:01,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 12: [2023-05-10 10:11:01,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 12: [2023-05-10 10:11:01,960] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 6: [2023-05-10 10:11:01,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,963] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 10: [2023-05-10 10:11:01,965] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:01,965] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:01,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:01,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:01,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:01,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:01,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:01,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:01,966] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,967] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 31: [2023-05-10 10:11:01,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 31: [2023-05-10 10:11:01,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 31: [2023-05-10 10:11:01,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 31: [2023-05-10 10:11:01,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 31: [2023-05-10 10:11:01,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 31: [2023-05-10 10:11:01,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:01,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:01,969] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:01,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,970] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:01,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 10: [2023-05-10 10:11:01,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 10: [2023-05-10 10:11:01,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:01,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 10: [2023-05-10 10:11:01,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:01,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 10: [2023-05-10 10:11:01,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:01,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:01,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 23: [2023-05-10 10:11:01,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 31: [2023-05-10 10:11:01,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 17: [2023-05-10 10:11:01,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:01,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:01,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:01,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:01,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:01,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:01,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:01,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:01,988] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 19: [2023-05-10 10:11:01,990] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:01,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 19: [2023-05-10 10:11:01,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:01,993] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 19: [2023-05-10 10:11:01,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 19: [2023-05-10 10:11:01,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 19: [2023-05-10 10:11:01,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 19: [2023-05-10 10:11:01,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 19: [2023-05-10 10:11:01,994] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:01,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 8: [2023-05-10 10:11:01,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:01,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 19: [2023-05-10 10:11:01,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 17: [2023-05-10 10:11:01,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 17: [2023-05-10 10:11:01,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 17: [2023-05-10 10:11:01,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:01,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 17: [2023-05-10 10:11:01,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 17: [2023-05-10 10:11:02,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 8: [2023-05-10 10:11:02,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:02,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:02,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:02,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:02,003] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 29: [2023-05-10 10:11:02,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:02,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:02,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:02,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:02,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:02,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:02,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:02,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 6: [2023-05-10 10:11:02,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:02,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:02,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:02,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:02,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:02,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:02,021] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:02,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:02,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 31: [2023-05-10 10:11:02,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 31: [2023-05-10 10:11:02,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:02,026] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:02,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:02,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:02,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:02,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 29: [2023-05-10 10:11:02,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:02,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:02,035] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:02,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 6: [2023-05-10 10:11:02,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:02,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:02,038] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:02,038] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 8: [2023-05-10 10:11:02,039] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,042] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,042] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:02,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:02,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:02,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:02,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:02,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:02,044] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 7: [2023-05-10 10:11:02,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:02,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 2: [2023-05-10 10:11:02,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 7: [2023-05-10 10:11:02,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:02,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 31: [2023-05-10 10:11:02,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 31: [2023-05-10 10:11:02,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 7: [2023-05-10 10:11:02,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:02,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 31: [2023-05-10 10:11:02,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 31: [2023-05-10 10:11:02,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 31: [2023-05-10 10:11:02,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 31: [2023-05-10 10:11:02,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 31: [2023-05-10 10:11:02,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:02,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:02,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 9: [2023-05-10 10:11:02,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 9: [2023-05-10 10:11:02,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:02,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:02,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 2: [2023-05-10 10:11:02,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 2: [2023-05-10 10:11:02,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 2: [2023-05-10 10:11:02,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 2: [2023-05-10 10:11:02,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:02,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 9: [2023-05-10 10:11:02,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:02,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:02,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:02,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 29: [2023-05-10 10:11:02,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 1: [2023-05-10 10:11:02,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:02,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 17: [2023-05-10 10:11:02,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:02,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 9: [2023-05-10 10:11:02,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:02,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:02,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:02,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:02,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 6: [2023-05-10 10:11:02,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:02,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:02,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:02,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:02,053] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 29: [2023-05-10 10:11:02,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 29: [2023-05-10 10:11:02,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 29: [2023-05-10 10:11:02,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:02,056] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 21: [2023-05-10 10:11:02,056] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 21: [2023-05-10 10:11:02,056] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 19: [2023-05-10 10:11:02,056] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 21: [2023-05-10 10:11:02,056] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 21: [2023-05-10 10:11:02,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 21: [2023-05-10 10:11:02,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 21: [2023-05-10 10:11:02,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 21: [2023-05-10 10:11:02,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 21: [2023-05-10 10:11:02,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:02,057] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 2: [2023-05-10 10:11:02,058] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:02,058] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:02,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:02,060] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:02,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:02,061] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 10: [2023-05-10 10:11:02,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 21: [2023-05-10 10:11:02,063] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 21: [2023-05-10 10:11:02,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 21: [2023-05-10 10:11:02,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 21: [2023-05-10 10:11:02,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 21: [2023-05-10 10:11:02,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 21: [2023-05-10 10:11:02,064] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 21: [2023-05-10 10:11:02,066] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 19: [2023-05-10 10:11:02,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 19: [2023-05-10 10:11:02,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 6: [2023-05-10 10:11:02,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:02,069] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 19: [2023-05-10 10:11:02,070] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 6: [2023-05-10 10:11:02,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:02,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,074] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 26: [2023-05-10 10:11:02,074] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:02,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 26: [2023-05-10 10:11:02,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 26: [2023-05-10 10:11:02,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 26: [2023-05-10 10:11:02,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 26: [2023-05-10 10:11:02,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:02,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,075] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 15: [2023-05-10 10:11:02,076] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,077] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 19: [2023-05-10 10:11:02,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 17: [2023-05-10 10:11:02,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 26: [2023-05-10 10:11:02,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:02,080] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:02,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 31: [2023-05-10 10:11:02,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 31: [2023-05-10 10:11:02,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,082] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:02,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:02,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:02,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:02,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 6: [2023-05-10 10:11:02,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:02,084] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:02,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:02,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 26: [2023-05-10 10:11:02,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 6: [2023-05-10 10:11:02,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 31: [2023-05-10 10:11:02,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 31: [2023-05-10 10:11:02,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 31: [2023-05-10 10:11:02,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:02,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 26: [2023-05-10 10:11:02,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:02,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 6: [2023-05-10 10:11:02,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,087] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:02,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:02,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 2: [2023-05-10 10:11:02,093] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 19: [2023-05-10 10:11:02,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 19: [2023-05-10 10:11:02,097] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:02,098] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,100] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 2: [2023-05-10 10:11:02,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 7: [2023-05-10 10:11:02,107] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,108] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:02,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,110] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:02,111] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:02,113] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 21: [2023-05-10 10:11:02,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,117] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:02,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,120] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 21: [2023-05-10 10:11:02,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,122] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:02,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:02,124] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:02,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:02,124] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 1: [2023-05-10 10:11:02,125] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 21: [2023-05-10 10:11:02,125] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 2: [2023-05-10 10:11:02,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:02,128] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 21: [2023-05-10 10:11:02,128] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 17: [2023-05-10 10:11:02,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,129] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 1: [2023-05-10 10:11:02,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 26: [2023-05-10 10:11:02,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,131] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,131] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,136] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 1: [2023-05-10 10:11:02,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:02,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 2: [2023-05-10 10:11:02,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 2: [2023-05-10 10:11:02,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 2: [2023-05-10 10:11:02,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,139] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,140] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,140] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,141] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,141] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 15: [2023-05-10 10:11:02,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 9: [2023-05-10 10:11:02,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,145] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,146] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 27: [2023-05-10 10:11:02,146] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 18: [2023-05-10 10:11:02,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 18: [2023-05-10 10:11:02,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 18: [2023-05-10 10:11:02,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 18: [2023-05-10 10:11:02,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 1: [2023-05-10 10:11:02,146] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 18: [2023-05-10 10:11:02,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 18: [2023-05-10 10:11:02,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 18: [2023-05-10 10:11:02,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,146] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:02,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 21: [2023-05-10 10:11:02,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 1: [2023-05-10 10:11:02,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 27: [2023-05-10 10:11:02,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 27: [2023-05-10 10:11:02,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:02,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 27: [2023-05-10 10:11:02,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 27: [2023-05-10 10:11:02,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:02,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 18: [2023-05-10 10:11:02,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 27: [2023-05-10 10:11:02,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 18: [2023-05-10 10:11:02,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 18: [2023-05-10 10:11:02,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 1: [2023-05-10 10:11:02,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 18: [2023-05-10 10:11:02,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 26: [2023-05-10 10:11:02,154] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 18: [2023-05-10 10:11:02,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 26: [2023-05-10 10:11:02,155] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:02,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 20: [2023-05-10 10:11:02,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 20: [2023-05-10 10:11:02,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 20: [2023-05-10 10:11:02,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 26: [2023-05-10 10:11:02,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 26: [2023-05-10 10:11:02,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 26: [2023-05-10 10:11:02,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 26: [2023-05-10 10:11:02,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 26: [2023-05-10 10:11:02,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 21: [2023-05-10 10:11:02,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 20: [2023-05-10 10:11:02,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 20: [2023-05-10 10:11:02,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 20: [2023-05-10 10:11:02,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 20: [2023-05-10 10:11:02,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 20: [2023-05-10 10:11:02,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 2: [2023-05-10 10:11:02,159] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 20: [2023-05-10 10:11:02,160] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:02,160] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 21: [2023-05-10 10:11:02,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:02,165] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 20: [2023-05-10 10:11:02,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:02,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 0: [2023-05-10 10:11:02,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 0: [2023-05-10 10:11:02,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 20: [2023-05-10 10:11:02,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:02,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:02,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 0: [2023-05-10 10:11:02,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 0: [2023-05-10 10:11:02,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 0: [2023-05-10 10:11:02,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 0: [2023-05-10 10:11:02,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 20: [2023-05-10 10:11:02,168] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:02,169] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 21: [2023-05-10 10:11:02,169] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:02,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 20: [2023-05-10 10:11:02,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 26: [2023-05-10 10:11:02,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:02,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:02,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:02,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:02,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:02,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:02,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:02,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:02,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:02,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 2: [2023-05-10 10:11:02,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:02,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 26: [2023-05-10 10:11:02,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,192] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 27: [2023-05-10 10:11:02,192] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,194] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 26: [2023-05-10 10:11:02,194] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:02,198] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 27: [2023-05-10 10:11:02,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,199] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,200] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 18: [2023-05-10 10:11:02,201] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 15: [2023-05-10 10:11:02,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:02,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 15: [2023-05-10 10:11:02,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:02,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:02,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 15: [2023-05-10 10:11:02,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:02,206] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 20: [2023-05-10 10:11:02,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,207] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 24: [2023-05-10 10:11:02,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 18: [2023-05-10 10:11:02,208] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 24: [2023-05-10 10:11:02,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 24: [2023-05-10 10:11:02,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 24: [2023-05-10 10:11:02,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:02,212] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 18: [2023-05-10 10:11:02,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 18: [2023-05-10 10:11:02,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 18: [2023-05-10 10:11:02,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 18: [2023-05-10 10:11:02,213] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,216] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:02,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:02,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:02,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:02,218] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 27: [2023-05-10 10:11:02,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 27: [2023-05-10 10:11:02,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 13: [2023-05-10 10:11:02,219] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:02,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 13: [2023-05-10 10:11:02,220] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 18: [2023-05-10 10:11:02,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 18: [2023-05-10 10:11:02,221] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:02,221] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:02,222] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 27: [2023-05-10 10:11:02,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,223] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 25: [2023-05-10 10:11:02,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:02,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:02,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:02,224] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:02,225] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:02,226] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 18: [2023-05-10 10:11:02,228] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 20: [2023-05-10 10:11:02,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 20: [2023-05-10 10:11:02,232] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 28: [2023-05-10 10:11:02,233] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 0: [2023-05-10 10:11:02,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 0: [2023-05-10 10:11:02,235] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 0: [2023-05-10 10:11:02,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 0: [2023-05-10 10:11:02,236] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:02,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:02,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 18: [2023-05-10 10:11:02,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 22: [2023-05-10 10:11:02,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 18: [2023-05-10 10:11:02,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 27: [2023-05-10 10:11:02,240] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 22: [2023-05-10 10:11:02,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:02,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:02,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:02,245] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 18: [2023-05-10 10:11:02,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 28: [2023-05-10 10:11:02,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 3: [2023-05-10 10:11:02,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:02,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:02,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:02,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:02,247] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 3: [2023-05-10 10:11:02,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:02,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 0: [2023-05-10 10:11:02,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:02,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 18: [2023-05-10 10:11:02,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:02,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:02,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:02,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 20: [2023-05-10 10:11:02,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:02,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 3: [2023-05-10 10:11:02,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:02,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:02,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:02,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 24: [2023-05-10 10:11:02,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 24: [2023-05-10 10:11:02,255] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 27: [2023-05-10 10:11:02,256] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 27: [2023-05-10 10:11:02,258] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 20: [2023-05-10 10:11:02,264] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 16: [2023-05-10 10:11:02,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 0: [2023-05-10 10:11:02,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,265] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 16: [2023-05-10 10:11:02,265] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 0: [2023-05-10 10:11:02,266] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,267] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 4: [2023-05-10 10:11:02,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,270] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 16: [2023-05-10 10:11:02,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:02,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,271] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:02,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 4: [2023-05-10 10:11:02,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 4: [2023-05-10 10:11:02,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 4: [2023-05-10 10:11:02,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 4: [2023-05-10 10:11:02,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 4: [2023-05-10 10:11:02,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 20: [2023-05-10 10:11:02,272] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 14: [2023-05-10 10:11:02,272] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 16: [2023-05-10 10:11:02,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:02,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 11: [2023-05-10 10:11:02,274] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 16: [2023-05-10 10:11:02,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:02,276] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 11: [2023-05-10 10:11:02,276] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 11: [2023-05-10 10:11:02,276] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:02,276] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 11: [2023-05-10 10:11:02,276] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 11: [2023-05-10 10:11:02,276] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:02,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:02,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 0: [2023-05-10 10:11:02,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:02,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 24: [2023-05-10 10:11:02,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 20: [2023-05-10 10:11:02,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 14: [2023-05-10 10:11:02,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 14: [2023-05-10 10:11:02,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 14: [2023-05-10 10:11:02,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 14: [2023-05-10 10:11:02,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 14: [2023-05-10 10:11:02,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:02,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:02,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:02,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 4: [2023-05-10 10:11:02,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 14: [2023-05-10 10:11:02,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:02,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,281] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,281] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:02,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 20: [2023-05-10 10:11:02,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 14: [2023-05-10 10:11:02,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:02,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:02,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 20: [2023-05-10 10:11:02,284] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 30: [2023-05-10 10:11:02,284] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 30: [2023-05-10 10:11:02,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 30: [2023-05-10 10:11:02,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 30: [2023-05-10 10:11:02,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 30: [2023-05-10 10:11:02,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 30: [2023-05-10 10:11:02,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 30: [2023-05-10 10:11:02,285] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 11: [2023-05-10 10:11:02,286] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:02,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:02,287] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 11: [2023-05-10 10:11:02,288] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 25: [2023-05-10 10:11:02,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,291] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 25: [2023-05-10 10:11:02,292] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 30: [2023-05-10 10:11:02,292] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 3: [2023-05-10 10:11:02,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:02,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:02,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:02,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:02,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 22: [2023-05-10 10:11:02,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 30: [2023-05-10 10:11:02,299] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:02,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 28: [2023-05-10 10:11:02,302] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 13: [2023-05-10 10:11:02,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,308] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 24: [2023-05-10 10:11:02,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 24: [2023-05-10 10:11:02,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 24: [2023-05-10 10:11:02,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 24: [2023-05-10 10:11:02,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 24: [2023-05-10 10:11:02,315] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 24: [2023-05-10 10:11:02,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 28: [2023-05-10 10:11:02,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 28: [2023-05-10 10:11:02,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 25: [2023-05-10 10:11:02,320] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 22: [2023-05-10 10:11:02,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 11: [2023-05-10 10:11:02,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 25: [2023-05-10 10:11:02,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 25: [2023-05-10 10:11:02,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 25: [2023-05-10 10:11:02,323] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,323] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 16: [2023-05-10 10:11:02,324] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 13: [2023-05-10 10:11:02,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 22: [2023-05-10 10:11:02,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 13: [2023-05-10 10:11:02,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,332] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 3: [2023-05-10 10:11:02,334] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 14: [2023-05-10 10:11:02,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,335] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 11: [2023-05-10 10:11:02,336] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 4: [2023-05-10 10:11:02,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 4: [2023-05-10 10:11:02,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 13: [2023-05-10 10:11:02,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 4: [2023-05-10 10:11:02,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 4: [2023-05-10 10:11:02,343] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 13: [2023-05-10 10:11:02,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 13: [2023-05-10 10:11:02,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 16: [2023-05-10 10:11:02,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 13: [2023-05-10 10:11:02,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 16: [2023-05-10 10:11:02,345] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 28: [2023-05-10 10:11:02,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 16: [2023-05-10 10:11:02,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,347] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 11: [2023-05-10 10:11:02,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,349] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 28: [2023-05-10 10:11:02,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,352] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,355] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 28: [2023-05-10 10:11:02,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 28: [2023-05-10 10:11:02,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 28: [2023-05-10 10:11:02,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,357] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,357] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,358] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 22: [2023-05-10 10:11:02,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 14: [2023-05-10 10:11:02,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 14: [2023-05-10 10:11:02,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,360] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,360] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 22: [2023-05-10 10:11:02,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 22: [2023-05-10 10:11:02,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 22: [2023-05-10 10:11:02,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 11: [2023-05-10 10:11:02,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 11: [2023-05-10 10:11:02,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 14: [2023-05-10 10:11:02,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 4: [2023-05-10 10:11:02,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 22: [2023-05-10 10:11:02,367] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,371] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 5: [2023-05-10 10:11:02,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 11: [2023-05-10 10:11:02,375] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 4: [2023-05-10 10:11:02,376] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 5: [2023-05-10 10:11:02,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 5: [2023-05-10 10:11:02,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 5: [2023-05-10 10:11:02,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 5: [2023-05-10 10:11:02,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 16: [2023-05-10 10:11:02,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 16: [2023-05-10 10:11:02,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:02,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 14: [2023-05-10 10:11:02,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 14: [2023-05-10 10:11:02,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 30: [2023-05-10 10:11:02,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 14: [2023-05-10 10:11:02,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 14: [2023-05-10 10:11:02,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 5: [2023-05-10 10:11:02,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:02,383] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:02,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 16: [2023-05-10 10:11:02,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 16: [2023-05-10 10:11:02,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 16: [2023-05-10 10:11:02,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 16: [2023-05-10 10:11:02,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 5: [2023-05-10 10:11:02,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt... 30: [2023-05-10 10:11:02,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,396] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 30: [2023-05-10 10:11:02,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 11: [2023-05-10 10:11:02,397] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,399] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 30: [2023-05-10 10:11:02,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 30: [2023-05-10 10:11:02,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,425] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 30: [2023-05-10 10:11:02,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,442] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 5: [2023-05-10 10:11:02,442] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 5: [2023-05-10 10:11:02,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,450] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 5: [2023-05-10 10:11:02,450] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 5: [2023-05-10 10:11:02,450] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 5: [2023-05-10 10:11:02,451] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 5: [2023-05-10 10:11:02,451] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_34-model_00-model_states.pt. 10: [2023-05-10 10:11:02,456] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,456] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,472] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,485] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,486] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 23: [2023-05-10 10:11:02,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,509] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,510] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,511] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,514] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 23: [2023-05-10 10:11:02,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:02,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:02,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:02,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 23: [2023-05-10 10:11:02,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:02,515] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:02,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:02,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:02,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:02,516] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 23: [2023-05-10 10:11:02,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 23: [2023-05-10 10:11:02,517] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,519] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,519] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 23: [2023-05-10 10:11:02,520] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,520] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,521] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,523] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,527] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,528] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,529] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,532] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,533] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,534] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,540] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,541] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,543] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 17: [2023-05-10 10:11:02,544] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,545] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,549] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,551] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 17: [2023-05-10 10:11:02,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,554] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,556] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,558] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,562] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,570] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:02,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 15: [2023-05-10 10:11:02,570] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 15: [2023-05-10 10:11:02,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,571] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:02,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 15: [2023-05-10 10:11:02,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 15: [2023-05-10 10:11:02,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 15: [2023-05-10 10:11:02,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 15: [2023-05-10 10:11:02,571] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,572] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,574] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,576] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,579] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,580] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:02,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:02,582] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:02,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:02,583] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:02,584] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,584] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,585] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,585] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,587] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,589] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,590] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 17: [2023-05-10 10:11:02,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,595] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,597] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,597] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,597] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,598] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,598] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,603] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,604] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,606] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,607] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 17: [2023-05-10 10:11:02,611] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,613] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,614] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,615] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,616] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,617] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,618] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,622] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:02,623] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:02,623] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 17: [2023-05-10 10:11:02,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,626] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,626] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 17: [2023-05-10 10:11:02,627] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,628] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,629] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:02,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,632] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,633] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,634] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,635] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,637] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:02,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,638] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:02,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:02,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,639] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,640] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:02,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:02,641] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,642] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 19: [2023-05-10 10:11:02,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:02,643] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:02,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 15: [2023-05-10 10:11:02,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 15: [2023-05-10 10:11:02,654] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 17: [2023-05-10 10:11:02,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:02,656] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 15: [2023-05-10 10:11:02,656] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:02,657] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:02,657] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:02,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:02,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:02,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:02,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,658] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:02,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:02,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:02,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:02,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:02,659] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,661] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:02,662] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 29: [2023-05-10 10:11:02,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:02,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:02,663] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 29: [2023-05-10 10:11:02,664] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 1: [2023-05-10 10:11:02,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 1: [2023-05-10 10:11:02,666] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 1: [2023-05-10 10:11:02,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 1: [2023-05-10 10:11:02,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 1: [2023-05-10 10:11:02,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 1: [2023-05-10 10:11:02,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 1: [2023-05-10 10:11:02,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:02,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 15: [2023-05-10 10:11:02,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 15: [2023-05-10 10:11:02,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 15: [2023-05-10 10:11:02,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:02,667] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,667] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 29: [2023-05-10 10:11:02,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 29: [2023-05-10 10:11:02,668] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 29: [2023-05-10 10:11:02,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:02,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:02,669] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:02,670] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:02,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:02,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:02,671] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 1: [2023-05-10 10:11:02,672] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 1: [2023-05-10 10:11:02,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 1: [2023-05-10 10:11:02,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 1: [2023-05-10 10:11:02,673] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 1: [2023-05-10 10:11:02,674] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 1: [2023-05-10 10:11:02,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 1: [2023-05-10 10:11:02,675] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:02,676] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,678] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 17: [2023-05-10 10:11:02,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 17: [2023-05-10 10:11:02,679] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 19: [2023-05-10 10:11:02,681] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:02,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:02,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:02,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:02,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:02,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:02,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:02,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:02,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,683] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 15: [2023-05-10 10:11:02,683] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:02,684] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 31: [2023-05-10 10:11:02,685] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 15: [2023-05-10 10:11:02,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:02,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:02,686] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 15: [2023-05-10 10:11:02,686] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:02,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 15: [2023-05-10 10:11:02,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:02,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 15: [2023-05-10 10:11:02,687] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 31: [2023-05-10 10:11:02,687] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,689] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:02,690] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,690] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,691] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:02,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 31: [2023-05-10 10:11:02,691] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 31: [2023-05-10 10:11:02,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 31: [2023-05-10 10:11:02,692] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 31: [2023-05-10 10:11:02,693] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:02,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:02,698] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 15: [2023-05-10 10:11:02,698] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:02,699] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 15: [2023-05-10 10:11:02,699] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:02,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 15: [2023-05-10 10:11:02,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 19: [2023-05-10 10:11:02,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 19: [2023-05-10 10:11:02,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 15: [2023-05-10 10:11:02,700] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 15: [2023-05-10 10:11:02,700] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:02,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 15: [2023-05-10 10:11:02,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:02,701] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:02,701] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 15: [2023-05-10 10:11:02,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 15: [2023-05-10 10:11:02,702] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:02,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 7: [2023-05-10 10:11:02,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,702] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,706] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,712] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,714] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:02,715] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:02,715] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:02,716] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 19: [2023-05-10 10:11:02,718] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 19: [2023-05-10 10:11:02,719] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:02,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,723] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:02,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:02,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 1: [2023-05-10 10:11:02,728] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:02,729] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:02,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,731] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:02,731] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,732] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:02,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 1: [2023-05-10 10:11:02,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,732] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:02,733] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:02,735] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:02,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,736] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:02,737] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:02,737] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:02,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 18: [2023-05-10 10:11:02,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 18: [2023-05-10 10:11:02,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 18: [2023-05-10 10:11:02,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 18: [2023-05-10 10:11:02,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 18: [2023-05-10 10:11:02,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 18: [2023-05-10 10:11:02,738] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:02,739] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,739] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:02,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_121_mp_rank_00_optim_states.pt... 15: [2023-05-10 10:11:02,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_127_mp_rank_00_optim_states.pt... 15: [2023-05-10 10:11:02,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_120_mp_rank_00_optim_states.pt... 15: [2023-05-10 10:11:02,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_126_mp_rank_00_optim_states.pt... 15: [2023-05-10 10:11:02,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_124_mp_rank_00_optim_states.pt... 15: [2023-05-10 10:11:02,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_125_mp_rank_00_optim_states.pt... 15: [2023-05-10 10:11:02,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_122_mp_rank_00_optim_states.pt... 15: [2023-05-10 10:11:02,740] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_123_mp_rank_00_optim_states.pt... 19: [2023-05-10 10:11:02,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:02,742] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:02,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:02,743] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:02,743] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:02,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 31: [2023-05-10 10:11:02,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:02,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:02,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 18: [2023-05-10 10:11:02,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:02,744] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 18: [2023-05-10 10:11:02,744] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,745] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 1: [2023-05-10 10:11:02,745] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 18: [2023-05-10 10:11:02,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,746] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,747] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 1: [2023-05-10 10:11:02,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 1: [2023-05-10 10:11:02,750] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:02,750] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,752] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,752] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,753] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 31: [2023-05-10 10:11:02,754] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 1: [2023-05-10 10:11:02,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:02,756] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,758] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,759] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:02,760] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:02,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:02,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,761] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:02,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,762] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 27: [2023-05-10 10:11:02,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:02,765] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:02,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:02,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:02,766] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:02,767] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:02,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:02,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:02,767] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 29: [2023-05-10 10:11:02,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:02,768] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 27: [2023-05-10 10:11:02,769] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 27: [2023-05-10 10:11:02,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 27: [2023-05-10 10:11:02,770] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 27: [2023-05-10 10:11:02,771] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 31: [2023-05-10 10:11:02,771] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,772] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 27: [2023-05-10 10:11:02,773] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 27: [2023-05-10 10:11:02,774] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 1: [2023-05-10 10:11:02,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:02,775] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:02,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:02,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:02,778] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:02,779] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:02,780] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:02,782] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:02,784] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 21: [2023-05-10 10:11:02,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,785] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 25: [2023-05-10 10:11:02,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 25: [2023-05-10 10:11:02,787] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 25: [2023-05-10 10:11:02,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 25: [2023-05-10 10:11:02,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 25: [2023-05-10 10:11:02,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 25: [2023-05-10 10:11:02,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 25: [2023-05-10 10:11:02,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 25: [2023-05-10 10:11:02,788] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,790] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 13: [2023-05-10 10:11:02,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 13: [2023-05-10 10:11:02,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 13: [2023-05-10 10:11:02,790] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 13: [2023-05-10 10:11:02,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 13: [2023-05-10 10:11:02,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 13: [2023-05-10 10:11:02,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 13: [2023-05-10 10:11:02,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 13: [2023-05-10 10:11:02,791] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 25: [2023-05-10 10:11:02,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 25: [2023-05-10 10:11:02,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,792] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 25: [2023-05-10 10:11:02,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,793] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 25: [2023-05-10 10:11:02,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 25: [2023-05-10 10:11:02,794] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 25: [2023-05-10 10:11:02,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 25: [2023-05-10 10:11:02,795] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 25: [2023-05-10 10:11:02,796] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:02,799] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:02,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:02,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:02,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:02,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:02,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:02,802] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 31: [2023-05-10 10:11:02,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:02,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:02,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:02,803] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:02,802] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 13: [2023-05-10 10:11:02,804] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 13: [2023-05-10 10:11:02,804] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:02,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:02,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 13: [2023-05-10 10:11:02,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 23: [2023-05-10 10:11:02,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:02,805] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:02,806] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 18: [2023-05-10 10:11:02,806] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 18: [2023-05-10 10:11:02,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 18: [2023-05-10 10:11:02,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 13: [2023-05-10 10:11:02,807] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 18: [2023-05-10 10:11:02,809] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 6: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 6: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 6: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 0: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 0: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 6: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 0: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 0: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 0: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 0: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 6: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 6: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 6: [2023-05-10 10:11:02,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 6: [2023-05-10 10:11:02,811] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,811] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:02,811] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 12: [2023-05-10 10:11:02,811] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 12: [2023-05-10 10:11:02,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 12: [2023-05-10 10:11:02,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 12: [2023-05-10 10:11:02,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 12: [2023-05-10 10:11:02,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 12: [2023-05-10 10:11:02,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 12: [2023-05-10 10:11:02,812] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 6: [2023-05-10 10:11:02,813] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 6: [2023-05-10 10:11:02,814] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 12: [2023-05-10 10:11:02,815] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 6: [2023-05-10 10:11:02,816] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:02,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,817] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 12: [2023-05-10 10:11:02,818] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 20: [2023-05-10 10:11:02,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 20: [2023-05-10 10:11:02,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 20: [2023-05-10 10:11:02,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 20: [2023-05-10 10:11:02,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 20: [2023-05-10 10:11:02,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 20: [2023-05-10 10:11:02,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 20: [2023-05-10 10:11:02,819] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 6: [2023-05-10 10:11:02,819] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 20: [2023-05-10 10:11:02,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,820] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 6: [2023-05-10 10:11:02,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 6: [2023-05-10 10:11:02,821] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 6: [2023-05-10 10:11:02,822] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 6: [2023-05-10 10:11:02,822] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 27: [2023-05-10 10:11:02,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 20: [2023-05-10 10:11:02,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 20: [2023-05-10 10:11:02,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 12: [2023-05-10 10:11:02,823] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 12: [2023-05-10 10:11:02,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 12: [2023-05-10 10:11:02,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 18: [2023-05-10 10:11:02,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:02,824] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 12: [2023-05-10 10:11:02,825] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:02,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 24: [2023-05-10 10:11:02,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 24: [2023-05-10 10:11:02,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 24: [2023-05-10 10:11:02,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 24: [2023-05-10 10:11:02,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 24: [2023-05-10 10:11:02,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 24: [2023-05-10 10:11:02,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 24: [2023-05-10 10:11:02,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 12: [2023-05-10 10:11:02,826] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 27: [2023-05-10 10:11:02,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 20: [2023-05-10 10:11:02,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,828] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:02,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 24: [2023-05-10 10:11:02,829] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 24: [2023-05-10 10:11:02,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 20: [2023-05-10 10:11:02,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 20: [2023-05-10 10:11:02,830] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 27: [2023-05-10 10:11:02,830] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 20: [2023-05-10 10:11:02,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,831] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,831] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,831] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,831] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:02,831] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 24: [2023-05-10 10:11:02,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,832] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 10: [2023-05-10 10:11:02,832] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:02,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 10: [2023-05-10 10:11:02,833] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:02,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 24: [2023-05-10 10:11:02,833] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 24: [2023-05-10 10:11:02,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 24: [2023-05-10 10:11:02,834] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 21: [2023-05-10 10:11:02,836] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 18: [2023-05-10 10:11:02,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:02,837] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:02,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:02,839] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:02,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:02,840] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:02,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:02,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:02,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:02,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:02,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:02,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:02,842] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 25: [2023-05-10 10:11:02,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 25: [2023-05-10 10:11:02,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 10: [2023-05-10 10:11:02,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 10: [2023-05-10 10:11:02,844] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:02,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 8: [2023-05-10 10:11:02,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 8: [2023-05-10 10:11:02,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 8: [2023-05-10 10:11:02,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 8: [2023-05-10 10:11:02,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 8: [2023-05-10 10:11:02,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 8: [2023-05-10 10:11:02,845] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:02,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 25: [2023-05-10 10:11:02,846] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,846] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,846] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:02,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 10: [2023-05-10 10:11:02,847] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 10: [2023-05-10 10:11:02,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:02,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 25: [2023-05-10 10:11:02,847] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,849] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 9: [2023-05-10 10:11:02,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,849] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,850] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 27: [2023-05-10 10:11:02,851] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,852] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 27: [2023-05-10 10:11:02,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,853] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:02,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 10: [2023-05-10 10:11:02,854] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 10: [2023-05-10 10:11:02,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 23: [2023-05-10 10:11:02,854] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:02,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:02,859] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:02,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 10: [2023-05-10 10:11:02,860] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 10: [2023-05-10 10:11:02,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:02,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 10: [2023-05-10 10:11:02,863] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 10: [2023-05-10 10:11:02,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 10: [2023-05-10 10:11:02,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 10: [2023-05-10 10:11:02,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:02,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 10: [2023-05-10 10:11:02,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 10: [2023-05-10 10:11:02,864] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 10: [2023-05-10 10:11:02,864] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 10: [2023-05-10 10:11:02,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 10: [2023-05-10 10:11:02,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 10: [2023-05-10 10:11:02,865] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 10: [2023-05-10 10:11:02,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:02,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 6: [2023-05-10 10:11:02,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 17: [2023-05-10 10:11:02,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 17: [2023-05-10 10:11:02,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 17: [2023-05-10 10:11:02,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 17: [2023-05-10 10:11:02,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:02,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 17: [2023-05-10 10:11:02,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 17: [2023-05-10 10:11:02,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:02,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:02,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 17: [2023-05-10 10:11:02,872] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:02,872] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 25: [2023-05-10 10:11:02,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 25: [2023-05-10 10:11:02,873] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:02,875] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,875] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 25: [2023-05-10 10:11:02,876] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,876] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:02,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:02,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 0: [2023-05-10 10:11:02,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 0: [2023-05-10 10:11:02,877] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 27: [2023-05-10 10:11:02,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 27: [2023-05-10 10:11:02,877] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:02,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:02,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:02,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:02,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:02,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:02,878] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 25: [2023-05-10 10:11:02,878] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,879] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 17: [2023-05-10 10:11:02,879] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 17: [2023-05-10 10:11:02,880] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:02,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:02,881] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,881] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 17: [2023-05-10 10:11:02,882] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:02,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 13: [2023-05-10 10:11:02,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 13: [2023-05-10 10:11:02,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,883] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 17: [2023-05-10 10:11:02,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:02,884] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 20: [2023-05-10 10:11:02,884] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:02,887] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:02,888] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:02,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 0: [2023-05-10 10:11:02,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 0: [2023-05-10 10:11:02,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 0: [2023-05-10 10:11:02,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 0: [2023-05-10 10:11:02,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 20: [2023-05-10 10:11:02,891] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 25: [2023-05-10 10:11:02,892] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,893] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 9: [2023-05-10 10:11:02,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:02,893] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 23: [2023-05-10 10:11:02,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:02,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,894] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 21: [2023-05-10 10:11:02,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:02,894] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 23: [2023-05-10 10:11:02,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:02,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 21: [2023-05-10 10:11:02,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 25: [2023-05-10 10:11:02,895] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:02,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 13: [2023-05-10 10:11:02,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 13: [2023-05-10 10:11:02,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 13: [2023-05-10 10:11:02,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 13: [2023-05-10 10:11:02,896] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 21: [2023-05-10 10:11:02,896] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 21: [2023-05-10 10:11:02,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 25: [2023-05-10 10:11:02,897] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 20: [2023-05-10 10:11:02,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 0: [2023-05-10 10:11:02,900] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:02,900] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 6: [2023-05-10 10:11:02,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 6: [2023-05-10 10:11:02,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 6: [2023-05-10 10:11:02,902] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 0: [2023-05-10 10:11:02,902] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:02,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:02,903] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:02,903] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 24: [2023-05-10 10:11:02,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:02,904] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 20: [2023-05-10 10:11:02,905] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:02,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 3: [2023-05-10 10:11:02,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 3: [2023-05-10 10:11:02,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 3: [2023-05-10 10:11:02,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 3: [2023-05-10 10:11:02,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 3: [2023-05-10 10:11:02,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 3: [2023-05-10 10:11:02,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 3: [2023-05-10 10:11:02,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 3: [2023-05-10 10:11:02,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 24: [2023-05-10 10:11:02,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 8: [2023-05-10 10:11:02,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:02,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 12: [2023-05-10 10:11:02,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:02,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 23: [2023-05-10 10:11:02,907] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:02,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:02,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:02,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 23: [2023-05-10 10:11:02,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 23: [2023-05-10 10:11:02,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:02,908] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:02,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 23: [2023-05-10 10:11:02,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:02,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 23: [2023-05-10 10:11:02,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 23: [2023-05-10 10:11:02,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:02,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 12: [2023-05-10 10:11:02,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 12: [2023-05-10 10:11:02,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 12: [2023-05-10 10:11:02,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 12: [2023-05-10 10:11:02,909] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 3: [2023-05-10 10:11:02,909] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,910] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 9: [2023-05-10 10:11:02,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:02,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:02,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:02,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 20: [2023-05-10 10:11:02,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,911] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:02,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 23: [2023-05-10 10:11:02,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 13: [2023-05-10 10:11:02,912] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 20: [2023-05-10 10:11:02,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 20: [2023-05-10 10:11:02,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 23: [2023-05-10 10:11:02,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:02,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 24: [2023-05-10 10:11:02,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 24: [2023-05-10 10:11:02,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 24: [2023-05-10 10:11:02,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 8: [2023-05-10 10:11:02,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:02,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:02,912] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:02,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:02,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:02,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:02,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:02,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 3: [2023-05-10 10:11:02,914] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,915] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:02,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:02,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:02,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:02,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:02,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:02,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:02,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:02,915] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 3: [2023-05-10 10:11:02,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,916] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 0: [2023-05-10 10:11:02,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:02,917] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:02,918] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,919] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:02,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 8: [2023-05-10 10:11:02,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 8: [2023-05-10 10:11:02,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 8: [2023-05-10 10:11:02,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_84_mp_rank_00_optim_states.pt... 10: [2023-05-10 10:11:02,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_85_mp_rank_00_optim_states.pt... 10: [2023-05-10 10:11:02,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_86_mp_rank_00_optim_states.pt... 10: [2023-05-10 10:11:02,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_81_mp_rank_00_optim_states.pt... 10: [2023-05-10 10:11:02,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_87_mp_rank_00_optim_states.pt... 8: [2023-05-10 10:11:02,920] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:02,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_83_mp_rank_00_optim_states.pt... 10: [2023-05-10 10:11:02,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_80_mp_rank_00_optim_states.pt... 10: [2023-05-10 10:11:02,920] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_82_mp_rank_00_optim_states.pt... 20: [2023-05-10 10:11:02,921] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:02,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:02,922] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 9: [2023-05-10 10:11:02,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:02,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 11: [2023-05-10 10:11:02,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 24: [2023-05-10 10:11:02,924] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 11: [2023-05-10 10:11:02,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 11: [2023-05-10 10:11:02,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 11: [2023-05-10 10:11:02,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 11: [2023-05-10 10:11:02,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 11: [2023-05-10 10:11:02,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 11: [2023-05-10 10:11:02,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 22: [2023-05-10 10:11:02,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 22: [2023-05-10 10:11:02,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 22: [2023-05-10 10:11:02,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 22: [2023-05-10 10:11:02,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 22: [2023-05-10 10:11:02,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 22: [2023-05-10 10:11:02,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 22: [2023-05-10 10:11:02,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 19: [2023-05-10 10:11:02,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:02,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:02,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,927] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,928] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 20: [2023-05-10 10:11:02,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 30: [2023-05-10 10:11:02,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,929] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 30: [2023-05-10 10:11:02,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:02,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 19: [2023-05-10 10:11:02,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 19: [2023-05-10 10:11:02,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 19: [2023-05-10 10:11:02,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 19: [2023-05-10 10:11:02,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 6: [2023-05-10 10:11:02,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:02,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:02,930] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:02,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 22: [2023-05-10 10:11:02,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 22: [2023-05-10 10:11:02,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:02,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:02,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:02,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:02,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:02,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 13: [2023-05-10 10:11:02,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:02,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 9: [2023-05-10 10:11:02,934] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 9: [2023-05-10 10:11:02,934] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 17: [2023-05-10 10:11:02,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:02,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 8: [2023-05-10 10:11:02,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:02,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 22: [2023-05-10 10:11:02,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,936] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 22: [2023-05-10 10:11:02,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 22: [2023-05-10 10:11:02,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 22: [2023-05-10 10:11:02,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 11: [2023-05-10 10:11:02,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:02,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:02,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 20: [2023-05-10 10:11:02,940] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 9: [2023-05-10 10:11:02,941] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 29: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 29: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 9: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 9: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 9: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 9: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 9: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 20: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 29: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 29: [2023-05-10 10:11:02,942] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 29: [2023-05-10 10:11:02,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:02,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:02,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 9: [2023-05-10 10:11:02,943] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 9: [2023-05-10 10:11:02,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 9: [2023-05-10 10:11:02,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 9: [2023-05-10 10:11:02,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:02,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:02,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:02,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:02,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:02,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:02,944] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:02,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,946] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:02,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,946] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:02,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 29: [2023-05-10 10:11:02,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:02,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:02,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,947] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:02,947] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 9: [2023-05-10 10:11:02,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:02,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 9: [2023-05-10 10:11:02,948] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 9: [2023-05-10 10:11:02,948] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:02,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 9: [2023-05-10 10:11:02,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:02,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:02,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:02,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:02,949] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 17: [2023-05-10 10:11:02,950] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 8: [2023-05-10 10:11:02,950] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 29: [2023-05-10 10:11:02,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:02,951] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:02,952] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:02,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 8: [2023-05-10 10:11:02,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:02,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:02,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:02,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,953] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:02,953] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:02,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,954] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:02,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:02,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:02,955] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 17: [2023-05-10 10:11:02,956] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:02,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_190_mp_rank_00_optim_states.pt... 23: [2023-05-10 10:11:02,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_184_mp_rank_00_optim_states.pt... 23: [2023-05-10 10:11:02,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_185_mp_rank_00_optim_states.pt... 23: [2023-05-10 10:11:02,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_188_mp_rank_00_optim_states.pt... 23: [2023-05-10 10:11:02,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_187_mp_rank_00_optim_states.pt... 23: [2023-05-10 10:11:02,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_191_mp_rank_00_optim_states.pt... 23: [2023-05-10 10:11:02,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_186_mp_rank_00_optim_states.pt... 23: [2023-05-10 10:11:02,965] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_189_mp_rank_00_optim_states.pt... 14: [2023-05-10 10:11:02,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 14: [2023-05-10 10:11:02,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 14: [2023-05-10 10:11:02,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 14: [2023-05-10 10:11:02,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 14: [2023-05-10 10:11:02,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 14: [2023-05-10 10:11:02,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 14: [2023-05-10 10:11:02,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 14: [2023-05-10 10:11:02,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 3: [2023-05-10 10:11:02,967] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 26: [2023-05-10 10:11:02,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 26: [2023-05-10 10:11:02,968] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 26: [2023-05-10 10:11:02,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 26: [2023-05-10 10:11:02,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 26: [2023-05-10 10:11:02,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 26: [2023-05-10 10:11:02,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 26: [2023-05-10 10:11:02,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 26: [2023-05-10 10:11:02,969] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:02,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 19: [2023-05-10 10:11:02,931] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 19: [2023-05-10 10:11:02,932] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 19: [2023-05-10 10:11:02,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 19: [2023-05-10 10:11:02,933] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 19: [2023-05-10 10:11:02,935] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 19: [2023-05-10 10:11:02,937] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 19: [2023-05-10 10:11:02,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 19: [2023-05-10 10:11:02,938] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 14: [2023-05-10 10:11:02,971] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,972] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,972] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 26: [2023-05-10 10:11:02,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 17: [2023-05-10 10:11:02,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 17: [2023-05-10 10:11:02,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 14: [2023-05-10 10:11:02,973] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:02,973] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 14: [2023-05-10 10:11:02,974] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 14: [2023-05-10 10:11:02,975] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 14: [2023-05-10 10:11:02,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 14: [2023-05-10 10:11:02,976] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 26: [2023-05-10 10:11:02,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 14: [2023-05-10 10:11:02,977] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 14: [2023-05-10 10:11:02,978] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,979] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 26: [2023-05-10 10:11:02,980] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 11: [2023-05-10 10:11:02,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 11: [2023-05-10 10:11:02,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 26: [2023-05-10 10:11:02,981] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 3: [2023-05-10 10:11:02,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 3: [2023-05-10 10:11:02,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 3: [2023-05-10 10:11:02,982] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,982] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:02,983] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 17: [2023-05-10 10:11:02,983] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:02,984] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 7: [2023-05-10 10:11:02,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 7: [2023-05-10 10:11:02,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 7: [2023-05-10 10:11:02,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 7: [2023-05-10 10:11:02,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 7: [2023-05-10 10:11:02,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 7: [2023-05-10 10:11:02,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 7: [2023-05-10 10:11:02,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 5: [2023-05-10 10:11:02,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 5: [2023-05-10 10:11:02,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 5: [2023-05-10 10:11:02,985] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:02,986] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:02,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 5: [2023-05-10 10:11:02,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 5: [2023-05-10 10:11:02,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 5: [2023-05-10 10:11:02,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 5: [2023-05-10 10:11:02,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:02,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 17: [2023-05-10 10:11:02,987] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:02,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 17: [2023-05-10 10:11:02,987] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:02,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 17: [2023-05-10 10:11:02,987] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:02,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 22: [2023-05-10 10:11:02,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:02,989] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:02,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:02,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:02,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:02,991] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 4: [2023-05-10 10:11:02,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 4: [2023-05-10 10:11:02,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 4: [2023-05-10 10:11:02,991] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 4: [2023-05-10 10:11:02,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 4: [2023-05-10 10:11:02,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 4: [2023-05-10 10:11:02,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 4: [2023-05-10 10:11:02,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 4: [2023-05-10 10:11:02,992] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 5: [2023-05-10 10:11:02,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,992] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 7: [2023-05-10 10:11:02,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:02,993] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:02,994] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 5: [2023-05-10 10:11:02,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,995] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 5: [2023-05-10 10:11:02,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 28: [2023-05-10 10:11:02,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 28: [2023-05-10 10:11:02,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 4: [2023-05-10 10:11:02,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,997] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,998] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 4: [2023-05-10 10:11:02,999] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 28: [2023-05-10 10:11:03,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 28: [2023-05-10 10:11:03,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 28: [2023-05-10 10:11:03,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 28: [2023-05-10 10:11:03,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 28: [2023-05-10 10:11:03,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 28: [2023-05-10 10:11:03,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 4: [2023-05-10 10:11:03,000] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 28: [2023-05-10 10:11:03,001] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 29: [2023-05-10 10:11:03,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 29: [2023-05-10 10:11:03,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 29: [2023-05-10 10:11:03,001] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 22: [2023-05-10 10:11:03,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 11: [2023-05-10 10:11:03,002] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 30: [2023-05-10 10:11:03,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:03,003] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 11: [2023-05-10 10:11:03,004] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 17: [2023-05-10 10:11:03,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,005] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:03,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 17: [2023-05-10 10:11:03,006] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:03,006] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 17: [2023-05-10 10:11:03,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,007] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:03,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:03,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 17: [2023-05-10 10:11:03,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 11: [2023-05-10 10:11:03,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 11: [2023-05-10 10:11:03,007] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 11: [2023-05-10 10:11:03,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 11: [2023-05-10 10:11:03,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 11: [2023-05-10 10:11:03,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 17: [2023-05-10 10:11:03,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:03,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 17: [2023-05-10 10:11:03,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 17: [2023-05-10 10:11:03,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:03,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:03,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 17: [2023-05-10 10:11:03,008] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:03,009] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:03,009] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 22: [2023-05-10 10:11:03,009] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 28: [2023-05-10 10:11:03,010] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 28: [2023-05-10 10:11:03,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 28: [2023-05-10 10:11:03,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 28: [2023-05-10 10:11:03,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 22: [2023-05-10 10:11:03,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 22: [2023-05-10 10:11:03,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 22: [2023-05-10 10:11:03,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 22: [2023-05-10 10:11:03,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 19: [2023-05-10 10:11:02,984] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 19: [2023-05-10 10:11:02,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 19: [2023-05-10 10:11:02,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 19: [2023-05-10 10:11:02,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 19: [2023-05-10 10:11:02,988] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 19: [2023-05-10 10:11:03,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 19: [2023-05-10 10:11:03,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 19: [2023-05-10 10:11:03,004] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 19: [2023-05-10 10:11:03,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 9: [2023-05-10 10:11:03,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_77_mp_rank_00_optim_states.pt... 9: [2023-05-10 10:11:03,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_73_mp_rank_00_optim_states.pt... 9: [2023-05-10 10:11:03,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_76_mp_rank_00_optim_states.pt... 9: [2023-05-10 10:11:03,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_74_mp_rank_00_optim_states.pt... 9: [2023-05-10 10:11:03,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_75_mp_rank_00_optim_states.pt... 9: [2023-05-10 10:11:03,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_72_mp_rank_00_optim_states.pt... 9: [2023-05-10 10:11:03,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_78_mp_rank_00_optim_states.pt... 9: [2023-05-10 10:11:03,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_79_mp_rank_00_optim_states.pt... 29: [2023-05-10 10:11:03,014] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 19: [2023-05-10 10:11:03,015] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 16: [2023-05-10 10:11:03,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 22: [2023-05-10 10:11:03,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 16: [2023-05-10 10:11:03,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 16: [2023-05-10 10:11:03,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 16: [2023-05-10 10:11:03,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 16: [2023-05-10 10:11:03,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 16: [2023-05-10 10:11:03,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:03,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:03,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 19: [2023-05-10 10:11:03,017] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,017] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:03,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 29: [2023-05-10 10:11:03,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 29: [2023-05-10 10:11:03,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 29: [2023-05-10 10:11:03,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 14: [2023-05-10 10:11:03,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:03,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 2: [2023-05-10 10:11:03,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 17: [2023-05-10 10:11:03,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_138_mp_rank_00_optim_states.pt... 17: [2023-05-10 10:11:03,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_141_mp_rank_00_optim_states.pt... 17: [2023-05-10 10:11:03,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_137_mp_rank_00_optim_states.pt... 17: [2023-05-10 10:11:03,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_140_mp_rank_00_optim_states.pt... 17: [2023-05-10 10:11:03,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_142_mp_rank_00_optim_states.pt... 17: [2023-05-10 10:11:03,019] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_139_mp_rank_00_optim_states.pt... 17: [2023-05-10 10:11:03,020] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_143_mp_rank_00_optim_states.pt... 30: [2023-05-10 10:11:03,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:03,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:03,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:03,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:03,020] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:03,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 19: [2023-05-10 10:11:03,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 19: [2023-05-10 10:11:03,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 19: [2023-05-10 10:11:03,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 19: [2023-05-10 10:11:03,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 19: [2023-05-10 10:11:03,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 19: [2023-05-10 10:11:03,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 19: [2023-05-10 10:11:03,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 19: [2023-05-10 10:11:03,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,022] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:03,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 19: [2023-05-10 10:11:03,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:03,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_136_mp_rank_00_optim_states.pt... 19: [2023-05-10 10:11:03,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 19: [2023-05-10 10:11:03,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 19: [2023-05-10 10:11:03,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 19: [2023-05-10 10:11:03,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 19: [2023-05-10 10:11:03,023] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 2: [2023-05-10 10:11:03,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:03,023] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 16: [2023-05-10 10:11:03,024] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 2: [2023-05-10 10:11:03,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 2: [2023-05-10 10:11:03,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 2: [2023-05-10 10:11:03,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 2: [2023-05-10 10:11:03,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 2: [2023-05-10 10:11:03,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 2: [2023-05-10 10:11:03,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 29: [2023-05-10 10:11:03,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,025] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 16: [2023-05-10 10:11:03,026] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 3: [2023-05-10 10:11:02,990] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 3: [2023-05-10 10:11:02,996] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 3: [2023-05-10 10:11:03,005] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 3: [2023-05-10 10:11:03,011] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 3: [2023-05-10 10:11:03,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 3: [2023-05-10 10:11:03,014] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 3: [2023-05-10 10:11:03,016] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 16: [2023-05-10 10:11:03,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:03,027] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt... 19: [2023-05-10 10:11:03,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 19: [2023-05-10 10:11:03,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 29: [2023-05-10 10:11:03,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:03,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_67_mp_rank_00_optim_states.pt... 8: [2023-05-10 10:11:03,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_70_mp_rank_00_optim_states.pt... 8: [2023-05-10 10:11:03,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_69_mp_rank_00_optim_states.pt... 8: [2023-05-10 10:11:03,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_64_mp_rank_00_optim_states.pt... 8: [2023-05-10 10:11:03,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_71_mp_rank_00_optim_states.pt... 8: [2023-05-10 10:11:03,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_68_mp_rank_00_optim_states.pt... 8: [2023-05-10 10:11:03,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_65_mp_rank_00_optim_states.pt... 8: [2023-05-10 10:11:03,028] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_66_mp_rank_00_optim_states.pt... 19: [2023-05-10 10:11:03,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 26: [2023-05-10 10:11:03,028] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:03,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:03,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 29: [2023-05-10 10:11:03,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:03,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 29: [2023-05-10 10:11:03,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:03,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:03,029] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 29: [2023-05-10 10:11:03,029] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:03,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:03,030] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 14: [2023-05-10 10:11:03,031] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 19: [2023-05-10 10:11:03,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 19: [2023-05-10 10:11:03,031] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 19: [2023-05-10 10:11:03,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 19: [2023-05-10 10:11:03,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 19: [2023-05-10 10:11:03,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 19: [2023-05-10 10:11:03,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 19: [2023-05-10 10:11:03,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 19: [2023-05-10 10:11:03,032] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 30: [2023-05-10 10:11:03,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,032] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 14: [2023-05-10 10:11:03,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:03,034] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 11: [2023-05-10 10:11:03,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,035] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,036] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:03,037] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,037] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 2: [2023-05-10 10:11:03,037] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:03,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,038] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,040] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 22: [2023-05-10 10:11:03,041] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 26: [2023-05-10 10:11:03,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 26: [2023-05-10 10:11:03,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 11: [2023-05-10 10:11:03,042] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 26: [2023-05-10 10:11:03,042] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 14: [2023-05-10 10:11:03,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 14: [2023-05-10 10:11:03,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 14: [2023-05-10 10:11:03,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:03,042] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 29: [2023-05-10 10:11:03,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 22: [2023-05-10 10:11:03,043] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 14: [2023-05-10 10:11:03,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 1: [2023-05-10 10:11:03,044] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:03,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 11: [2023-05-10 10:11:03,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 11: [2023-05-10 10:11:03,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:03,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:03,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:03,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:03,045] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,045] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:03,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:03,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 11: [2023-05-10 10:11:03,046] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 5: [2023-05-10 10:11:03,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 29: [2023-05-10 10:11:03,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 22: [2023-05-10 10:11:03,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:03,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 5: [2023-05-10 10:11:03,047] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 5: [2023-05-10 10:11:03,047] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 1: [2023-05-10 10:11:03,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:03,047] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:03,047] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 29: [2023-05-10 10:11:03,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 22: [2023-05-10 10:11:03,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:03,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,048] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 29: [2023-05-10 10:11:03,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 29: [2023-05-10 10:11:03,049] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:03,049] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 29: [2023-05-10 10:11:03,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:03,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 29: [2023-05-10 10:11:03,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 29: [2023-05-10 10:11:03,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:03,050] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 29: [2023-05-10 10:11:03,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:03,051] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 29: [2023-05-10 10:11:03,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 29: [2023-05-10 10:11:03,051] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,052] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:03,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 7: [2023-05-10 10:11:03,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 7: [2023-05-10 10:11:03,052] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 26: [2023-05-10 10:11:03,053] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 30: [2023-05-10 10:11:03,054] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:03,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 7: [2023-05-10 10:11:03,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 7: [2023-05-10 10:11:03,055] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,056] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:03,057] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 4: [2023-05-10 10:11:03,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 4: [2023-05-10 10:11:03,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 4: [2023-05-10 10:11:03,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 4: [2023-05-10 10:11:03,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 26: [2023-05-10 10:11:03,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 26: [2023-05-10 10:11:03,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:03,058] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 26: [2023-05-10 10:11:03,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 26: [2023-05-10 10:11:03,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 30: [2023-05-10 10:11:03,059] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 5: [2023-05-10 10:11:03,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 5: [2023-05-10 10:11:03,059] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 5: [2023-05-10 10:11:03,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 5: [2023-05-10 10:11:03,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 5: [2023-05-10 10:11:03,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 14: [2023-05-10 10:11:03,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 4: [2023-05-10 10:11:03,061] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 4: [2023-05-10 10:11:03,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 4: [2023-05-10 10:11:03,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 4: [2023-05-10 10:11:03,062] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 14: [2023-05-10 10:11:03,062] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 31: [2023-05-10 10:11:03,063] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 31: [2023-05-10 10:11:03,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 31: [2023-05-10 10:11:03,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 31: [2023-05-10 10:11:03,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 31: [2023-05-10 10:11:03,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 31: [2023-05-10 10:11:03,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 31: [2023-05-10 10:11:03,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 26: [2023-05-10 10:11:03,067] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 31: [2023-05-10 10:11:03,068] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,069] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 5: [2023-05-10 10:11:03,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,070] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 28: [2023-05-10 10:11:03,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,071] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,072] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 26: [2023-05-10 10:11:03,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,073] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 5: [2023-05-10 10:11:03,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 14: [2023-05-10 10:11:03,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 14: [2023-05-10 10:11:03,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:03,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:03,074] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 14: [2023-05-10 10:11:03,075] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 14: [2023-05-10 10:11:03,077] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 14: [2023-05-10 10:11:03,078] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 28: [2023-05-10 10:11:03,079] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,079] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,080] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:03,081] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,081] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:03,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 7: [2023-05-10 10:11:03,083] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 7: [2023-05-10 10:11:03,083] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 7: [2023-05-10 10:11:03,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 7: [2023-05-10 10:11:03,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 7: [2023-05-10 10:11:03,084] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:03,084] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 7: [2023-05-10 10:11:03,085] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 7: [2023-05-10 10:11:03,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 7: [2023-05-10 10:11:03,085] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 7: [2023-05-10 10:11:03,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:03,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 7: [2023-05-10 10:11:03,086] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 7: [2023-05-10 10:11:03,086] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 2: [2023-05-10 10:11:03,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 4: [2023-05-10 10:11:03,087] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:03,088] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 7: [2023-05-10 10:11:03,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:03,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 5: [2023-05-10 10:11:03,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:03,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 7: [2023-05-10 10:11:03,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:03,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 7: [2023-05-10 10:11:03,089] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 7: [2023-05-10 10:11:03,089] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 7: [2023-05-10 10:11:03,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,090] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 2: [2023-05-10 10:11:03,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 4: [2023-05-10 10:11:03,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 4: [2023-05-10 10:11:03,090] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 28: [2023-05-10 10:11:03,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:03,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 7: [2023-05-10 10:11:03,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 7: [2023-05-10 10:11:03,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 7: [2023-05-10 10:11:03,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 7: [2023-05-10 10:11:03,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 7: [2023-05-10 10:11:03,091] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 28: [2023-05-10 10:11:03,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 7: [2023-05-10 10:11:03,091] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 7: [2023-05-10 10:11:03,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,092] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 5: [2023-05-10 10:11:03,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 26: [2023-05-10 10:11:03,092] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 26: [2023-05-10 10:11:03,093] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 5: [2023-05-10 10:11:03,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 5: [2023-05-10 10:11:03,094] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:03,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:03,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:03,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:03,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:03,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:03,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:03,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:03,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,097] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 2: [2023-05-10 10:11:03,097] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,097] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,097] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 4: [2023-05-10 10:11:03,099] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:03,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:03,100] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 4: [2023-05-10 10:11:03,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 4: [2023-05-10 10:11:03,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:03,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 18: [2023-05-10 10:11:03,101] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 2: [2023-05-10 10:11:03,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 18: [2023-05-10 10:11:03,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:03,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,102] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,102] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:03,104] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 16: [2023-05-10 10:11:03,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 16: [2023-05-10 10:11:03,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 16: [2023-05-10 10:11:03,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 16: [2023-05-10 10:11:03,105] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_35-model_00-model_states.pt. 16: [2023-05-10 10:11:03,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:03,105] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 19: [2023-05-10 10:11:03,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_156_mp_rank_00_optim_states.pt... 19: [2023-05-10 10:11:03,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_155_mp_rank_00_optim_states.pt... 19: [2023-05-10 10:11:03,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_159_mp_rank_00_optim_states.pt... 19: [2023-05-10 10:11:03,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_157_mp_rank_00_optim_states.pt... 19: [2023-05-10 10:11:03,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_154_mp_rank_00_optim_states.pt... 19: [2023-05-10 10:11:03,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_152_mp_rank_00_optim_states.pt... 19: [2023-05-10 10:11:03,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_153_mp_rank_00_optim_states.pt... 19: [2023-05-10 10:11:03,106] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_158_mp_rank_00_optim_states.pt... 28: [2023-05-10 10:11:03,109] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 28: [2023-05-10 10:11:03,112] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:03,112] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,113] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 2: [2023-05-10 10:11:03,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 2: [2023-05-10 10:11:03,116] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 29: [2023-05-10 10:11:03,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_232_mp_rank_00_optim_states.pt... 29: [2023-05-10 10:11:03,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_237_mp_rank_00_optim_states.pt... 29: [2023-05-10 10:11:03,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_239_mp_rank_00_optim_states.pt... 29: [2023-05-10 10:11:03,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_233_mp_rank_00_optim_states.pt... 29: [2023-05-10 10:11:03,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_236_mp_rank_00_optim_states.pt... 29: [2023-05-10 10:11:03,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_238_mp_rank_00_optim_states.pt... 29: [2023-05-10 10:11:03,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_235_mp_rank_00_optim_states.pt... 29: [2023-05-10 10:11:03,118] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_234_mp_rank_00_optim_states.pt... 2: [2023-05-10 10:11:03,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 2: [2023-05-10 10:11:03,119] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 28: [2023-05-10 10:11:03,119] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:03,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:03,120] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,121] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:03,121] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,122] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 28: [2023-05-10 10:11:03,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,125] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 2: [2023-05-10 10:11:03,126] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 2: [2023-05-10 10:11:03,126] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 2: [2023-05-10 10:11:03,127] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,127] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 1: [2023-05-10 10:11:03,132] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 2: [2023-05-10 10:11:03,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 2: [2023-05-10 10:11:03,133] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 2: [2023-05-10 10:11:03,133] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,135] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 16: [2023-05-10 10:11:03,137] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 16: [2023-05-10 10:11:03,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 16: [2023-05-10 10:11:03,138] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,138] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 31: [2023-05-10 10:11:03,140] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 31: [2023-05-10 10:11:03,141] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:03,142] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:03,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,143] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,144] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,144] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,144] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,144] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 1: [2023-05-10 10:11:03,145] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:03,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 31: [2023-05-10 10:11:03,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 31: [2023-05-10 10:11:03,146] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,147] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 1: [2023-05-10 10:11:03,148] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 2: [2023-05-10 10:11:03,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 1: [2023-05-10 10:11:03,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 2: [2023-05-10 10:11:03,149] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 2: [2023-05-10 10:11:03,149] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 2: [2023-05-10 10:11:03,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 2: [2023-05-10 10:11:03,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 2: [2023-05-10 10:11:03,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 2: [2023-05-10 10:11:03,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:03,150] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,150] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 21: [2023-05-10 10:11:03,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 21: [2023-05-10 10:11:03,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 21: [2023-05-10 10:11:03,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 21: [2023-05-10 10:11:03,151] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:03,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 21: [2023-05-10 10:11:03,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 2: [2023-05-10 10:11:03,152] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 2: [2023-05-10 10:11:03,152] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 2: [2023-05-10 10:11:03,152] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 2: [2023-05-10 10:11:03,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:03,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:03,152] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 2: [2023-05-10 10:11:03,153] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 2: [2023-05-10 10:11:03,153] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 2: [2023-05-10 10:11:03,154] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 18: [2023-05-10 10:11:03,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:03,155] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:03,156] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 1: [2023-05-10 10:11:03,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 1: [2023-05-10 10:11:03,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 1: [2023-05-10 10:11:03,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 1: [2023-05-10 10:11:03,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 1: [2023-05-10 10:11:03,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 1: [2023-05-10 10:11:03,157] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 1: [2023-05-10 10:11:03,157] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,158] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 1: [2023-05-10 10:11:03,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,158] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 1: [2023-05-10 10:11:03,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 31: [2023-05-10 10:11:03,159] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 12: [2023-05-10 10:11:03,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 12: [2023-05-10 10:11:03,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 12: [2023-05-10 10:11:03,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 12: [2023-05-10 10:11:03,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 12: [2023-05-10 10:11:03,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 12: [2023-05-10 10:11:03,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 12: [2023-05-10 10:11:03,160] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 12: [2023-05-10 10:11:03,161] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 31: [2023-05-10 10:11:03,162] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 31: [2023-05-10 10:11:03,162] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:03,164] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 31: [2023-05-10 10:11:03,164] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:03,165] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:03,165] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,166] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:03,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 31: [2023-05-10 10:11:03,167] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:03,167] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 31: [2023-05-10 10:11:03,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 15: [2023-05-10 10:11:03,168] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_120_mp_rank_00_optim_states.pt. 15: [2023-05-10 10:11:03,169] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 120 18: [2023-05-10 10:11:03,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:03,169] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 31: [2023-05-10 10:11:03,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:03,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:03,170] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 31: [2023-05-10 10:11:03,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:03,171] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,171] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:03,172] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:03,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 12: [2023-05-10 10:11:03,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:03,173] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:03,174] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:03,174] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 15: [2023-05-10 10:11:03,174] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 120 7: [2023-05-10 10:11:03,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_57_mp_rank_00_optim_states.pt... 7: [2023-05-10 10:11:03,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_60_mp_rank_00_optim_states.pt... 7: [2023-05-10 10:11:03,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_59_mp_rank_00_optim_states.pt... 7: [2023-05-10 10:11:03,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_63_mp_rank_00_optim_states.pt... 7: [2023-05-10 10:11:03,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_58_mp_rank_00_optim_states.pt... 7: [2023-05-10 10:11:03,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_56_mp_rank_00_optim_states.pt... 7: [2023-05-10 10:11:03,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_62_mp_rank_00_optim_states.pt... 7: [2023-05-10 10:11:03,175] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_61_mp_rank_00_optim_states.pt... 31: [2023-05-10 10:11:03,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:03,178] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:03,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 31: [2023-05-10 10:11:03,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 31: [2023-05-10 10:11:03,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:03,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:03,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 31: [2023-05-10 10:11:03,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 31: [2023-05-10 10:11:03,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 18: [2023-05-10 10:11:03,179] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:03,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 31: [2023-05-10 10:11:03,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:03,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 27: [2023-05-10 10:11:03,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,179] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,180] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:03,180] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 18: [2023-05-10 10:11:03,181] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 6: [2023-05-10 10:11:03,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 6: [2023-05-10 10:11:03,182] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 6: [2023-05-10 10:11:03,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 6: [2023-05-10 10:11:03,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 6: [2023-05-10 10:11:03,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 6: [2023-05-10 10:11:03,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 6: [2023-05-10 10:11:03,183] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 27: [2023-05-10 10:11:03,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 27: [2023-05-10 10:11:03,184] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:03,185] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:03,186] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_126_mp_rank_00_optim_states.pt. 6: [2023-05-10 10:11:03,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:03,186] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 126 6: [2023-05-10 10:11:03,186] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 27: [2023-05-10 10:11:03,187] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,188] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 31: [2023-05-10 10:11:03,188] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 31: [2023-05-10 10:11:03,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 18: [2023-05-10 10:11:03,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 18: [2023-05-10 10:11:03,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 18: [2023-05-10 10:11:03,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 18: [2023-05-10 10:11:03,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 18: [2023-05-10 10:11:03,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 18: [2023-05-10 10:11:03,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 18: [2023-05-10 10:11:03,189] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 18: [2023-05-10 10:11:03,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 27: [2023-05-10 10:11:03,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:03,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,189] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:03,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 18: [2023-05-10 10:11:03,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 18: [2023-05-10 10:11:03,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 27: [2023-05-10 10:11:03,190] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:03,190] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_123_mp_rank_00_optim_states.pt. 15: [2023-05-10 10:11:03,191] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 123 6: [2023-05-10 10:11:03,191] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:03,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:03,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:03,193] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:03,194] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 126 6: [2023-05-10 10:11:03,195] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 18: [2023-05-10 10:11:03,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 18: [2023-05-10 10:11:03,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 18: [2023-05-10 10:11:03,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 18: [2023-05-10 10:11:03,196] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 18: [2023-05-10 10:11:03,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 18: [2023-05-10 10:11:03,196] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 18: [2023-05-10 10:11:03,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 18: [2023-05-10 10:11:03,197] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 2: [2023-05-10 10:11:03,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_18_mp_rank_00_optim_states.pt... 2: [2023-05-10 10:11:03,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_21_mp_rank_00_optim_states.pt... 2: [2023-05-10 10:11:03,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_17_mp_rank_00_optim_states.pt... 2: [2023-05-10 10:11:03,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_16_mp_rank_00_optim_states.pt... 15: [2023-05-10 10:11:03,197] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 123 2: [2023-05-10 10:11:03,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_19_mp_rank_00_optim_states.pt... 2: [2023-05-10 10:11:03,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_20_mp_rank_00_optim_states.pt... 2: [2023-05-10 10:11:03,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_23_mp_rank_00_optim_states.pt... 2: [2023-05-10 10:11:03,197] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_22_mp_rank_00_optim_states.pt... 18: [2023-05-10 10:11:03,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:03,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_122_mp_rank_00_optim_states.pt. 18: [2023-05-10 10:11:03,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 18: [2023-05-10 10:11:03,203] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:03,203] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 122 18: [2023-05-10 10:11:03,203] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:03,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 18: [2023-05-10 10:11:03,204] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 18: [2023-05-10 10:11:03,204] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 18: [2023-05-10 10:11:03,205] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 21: [2023-05-10 10:11:03,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,206] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,208] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 21: [2023-05-10 10:11:03,208] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 15: [2023-05-10 10:11:03,209] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 122 0: [2023-05-10 10:11:03,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:03,209] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:03,210] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:03,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:03,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:03,211] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:03,212] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:03,217] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,218] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,219] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 15: [2023-05-10 10:11:03,222] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_125_mp_rank_00_optim_states.pt. 15: [2023-05-10 10:11:03,222] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 125 15: [2023-05-10 10:11:03,228] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 125 21: [2023-05-10 10:11:03,229] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:03,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:03,230] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,231] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:03,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:03,231] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:03,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:03,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:03,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,232] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,233] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,233] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,233] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,233] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,233] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,233] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,233] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,234] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 1: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_8_mp_rank_00_optim_states.pt... 1: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_10_mp_rank_00_optim_states.pt... 1: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_12_mp_rank_00_optim_states.pt... 25: [2023-05-10 10:11:03,236] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 1: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_9_mp_rank_00_optim_states.pt... 1: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_14_mp_rank_00_optim_states.pt... 1: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_13_mp_rank_00_optim_states.pt... 1: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_11_mp_rank_00_optim_states.pt... 1: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_15_mp_rank_00_optim_states.pt... 25: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 21: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 21: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 27: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 21: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 13: [2023-05-10 10:11:03,237] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 21: [2023-05-10 10:11:03,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:03,238] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:03,238] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 25: [2023-05-10 10:11:03,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:03,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:03,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 25: [2023-05-10 10:11:03,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 25: [2023-05-10 10:11:03,239] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:03,240] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 12: [2023-05-10 10:11:03,241] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:03,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:03,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:03,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 21: [2023-05-10 10:11:03,242] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 21: [2023-05-10 10:11:03,242] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:03,243] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:03,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 22: [2023-05-10 10:11:03,243] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 22: [2023-05-10 10:11:03,244] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 22: [2023-05-10 10:11:03,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 22: [2023-05-10 10:11:03,245] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 22: [2023-05-10 10:11:03,246] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:03,245] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 12: [2023-05-10 10:11:03,245] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 12: [2023-05-10 10:11:03,246] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:03,248] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 21: [2023-05-10 10:11:03,248] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:03,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 15: [2023-05-10 10:11:03,249] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_121_mp_rank_00_optim_states.pt. 15: [2023-05-10 10:11:03,249] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 121 12: [2023-05-10 10:11:03,250] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,249] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:03,250] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 13: [2023-05-10 10:11:03,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:03,250] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:03,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 6: [2023-05-10 10:11:03,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,251] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:03,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:03,251] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:03,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:03,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:03,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:03,252] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 21: [2023-05-10 10:11:03,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 13: [2023-05-10 10:11:03,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:03,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_249_mp_rank_00_optim_states.pt... 31: [2023-05-10 10:11:03,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_254_mp_rank_00_optim_states.pt... 31: [2023-05-10 10:11:03,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_252_mp_rank_00_optim_states.pt... 21: [2023-05-10 10:11:03,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 31: [2023-05-10 10:11:03,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_248_mp_rank_00_optim_states.pt... 31: [2023-05-10 10:11:03,252] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_251_mp_rank_00_optim_states.pt... 13: [2023-05-10 10:11:03,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 31: [2023-05-10 10:11:03,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_253_mp_rank_00_optim_states.pt... 31: [2023-05-10 10:11:03,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_250_mp_rank_00_optim_states.pt... 31: [2023-05-10 10:11:03,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_255_mp_rank_00_optim_states.pt... 21: [2023-05-10 10:11:03,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:03,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 21: [2023-05-10 10:11:03,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:03,253] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 21: [2023-05-10 10:11:03,253] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:03,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:03,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:03,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 21: [2023-05-10 10:11:03,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 15: [2023-05-10 10:11:03,254] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 121 21: [2023-05-10 10:11:03,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 21: [2023-05-10 10:11:03,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:03,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 21: [2023-05-10 10:11:03,254] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 21: [2023-05-10 10:11:03,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 21: [2023-05-10 10:11:03,254] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,255] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 27: [2023-05-10 10:11:03,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:03,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 6: [2023-05-10 10:11:03,259] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,259] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:03,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:03,260] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 24: [2023-05-10 10:11:03,261] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 30: [2023-05-10 10:11:03,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,261] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,262] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,262] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 6: [2023-05-10 10:11:03,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 27: [2023-05-10 10:11:03,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,263] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 27: [2023-05-10 10:11:03,263] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 27: [2023-05-10 10:11:03,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 27: [2023-05-10 10:11:03,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 27: [2023-05-10 10:11:03,264] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,266] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,267] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,270] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 15: [2023-05-10 10:11:03,271] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_127_mp_rank_00_optim_states.pt. 15: [2023-05-10 10:11:03,272] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 127 12: [2023-05-10 10:11:03,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 12: [2023-05-10 10:11:03,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 12: [2023-05-10 10:11:03,273] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,273] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:03,274] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:03,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:03,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 27: [2023-05-10 10:11:03,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:03,275] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,275] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 27: [2023-05-10 10:11:03,276] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 27: [2023-05-10 10:11:03,276] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,276] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,277] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:03,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:03,278] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 127 6: [2023-05-10 10:11:03,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 6: [2023-05-10 10:11:03,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 6: [2023-05-10 10:11:03,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:03,278] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,278] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:03,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:03,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 27: [2023-05-10 10:11:03,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,279] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:03,279] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:03,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:03,280] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:03,280] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:03,282] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:03,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:03,283] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:03,283] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,286] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:03,289] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 18: [2023-05-10 10:11:03,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_144_mp_rank_00_optim_states.pt... 18: [2023-05-10 10:11:03,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_147_mp_rank_00_optim_states.pt... 18: [2023-05-10 10:11:03,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_148_mp_rank_00_optim_states.pt... 18: [2023-05-10 10:11:03,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_146_mp_rank_00_optim_states.pt... 18: [2023-05-10 10:11:03,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_149_mp_rank_00_optim_states.pt... 18: [2023-05-10 10:11:03,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_145_mp_rank_00_optim_states.pt... 18: [2023-05-10 10:11:03,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_151_mp_rank_00_optim_states.pt... 18: [2023-05-10 10:11:03,289] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_150_mp_rank_00_optim_states.pt... 22: [2023-05-10 10:11:03,291] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 27: [2023-05-10 10:11:03,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,293] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 27: [2023-05-10 10:11:03,293] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,294] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 27: [2023-05-10 10:11:03,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,294] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 27: [2023-05-10 10:11:03,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 27: [2023-05-10 10:11:03,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,295] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 27: [2023-05-10 10:11:03,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,295] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,296] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:03,296] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 6: [2023-05-10 10:11:03,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,297] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,298] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:03,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:03,298] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:03,299] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,300] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,300] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,301] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:03,304] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:03,304] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:03,305] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:03,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:03,305] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: > overriding learning rate value to 0.0002 12: [2023-05-10 10:11:03,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: > overriding minimum learning rate value to 2e-05 0: > overriding warmup iterations value to 0 0: > overriding total number of iterations value to 1 0: > overriding decay style value to cosine 12: [2023-05-10 10:11:03,306] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:03,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:03,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:03,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:03,307] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 12: [2023-05-10 10:11:03,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 12: [2023-05-10 10:11:03,307] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:03,310] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 0: [2023-05-10 10:11:03,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,311] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,312] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,312] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,313] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,313] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 0: [2023-05-10 10:11:03,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,314] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,314] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:03,316] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 6: [2023-05-10 10:11:03,316] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,317] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:03,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,317] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,318] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,319] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:03,319] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:03,320] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,321] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:03,321] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:03,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:03,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:03,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,322] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,322] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,324] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,325] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,326] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,326] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,327] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,327] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,328] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 13: [2023-05-10 10:11:03,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,328] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:03,329] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,330] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,330] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,331] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,331] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:03,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:03,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:03,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:03,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:03,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:03,332] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,334] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 25: [2023-05-10 10:11:03,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,336] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,337] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,337] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 25: [2023-05-10 10:11:03,337] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,337] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 3: [2023-05-10 10:11:03,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 3: [2023-05-10 10:11:03,337] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 21: [2023-05-10 10:11:03,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_169_mp_rank_00_optim_states.pt... 21: [2023-05-10 10:11:03,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_168_mp_rank_00_optim_states.pt... 21: [2023-05-10 10:11:03,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_175_mp_rank_00_optim_states.pt... 21: [2023-05-10 10:11:03,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_174_mp_rank_00_optim_states.pt... 21: [2023-05-10 10:11:03,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_171_mp_rank_00_optim_states.pt... 21: [2023-05-10 10:11:03,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_170_mp_rank_00_optim_states.pt... 21: [2023-05-10 10:11:03,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_173_mp_rank_00_optim_states.pt... 21: [2023-05-10 10:11:03,338] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_172_mp_rank_00_optim_states.pt... 20: [2023-05-10 10:11:03,337] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:03,337] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:03,337] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:03,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:03,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:03,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:03,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:03,338] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 5: [2023-05-10 10:11:03,339] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 5: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 5: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 5: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 5: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 5: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 5: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 16: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 5: [2023-05-10 10:11:03,340] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:03,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 12: [2023-05-10 10:11:03,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_99_mp_rank_00_optim_states.pt... 12: [2023-05-10 10:11:03,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_97_mp_rank_00_optim_states.pt... 12: [2023-05-10 10:11:03,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_100_mp_rank_00_optim_states.pt... 12: [2023-05-10 10:11:03,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_102_mp_rank_00_optim_states.pt... 12: [2023-05-10 10:11:03,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_103_mp_rank_00_optim_states.pt... 12: [2023-05-10 10:11:03,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_96_mp_rank_00_optim_states.pt... 12: [2023-05-10 10:11:03,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_98_mp_rank_00_optim_states.pt... 12: [2023-05-10 10:11:03,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_101_mp_rank_00_optim_states.pt... 3: [2023-05-10 10:11:03,341] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 3: [2023-05-10 10:11:03,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 20: [2023-05-10 10:11:03,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 3: [2023-05-10 10:11:03,342] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:03,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 15: [2023-05-10 10:11:03,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_124_mp_rank_00_optim_states.pt. 10: [2023-05-10 10:11:03,342] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_83_mp_rank_00_optim_states.pt. 15: [2023-05-10 10:11:03,343] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 124 3: [2023-05-10 10:11:03,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:03,343] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 83 28: [2023-05-10 10:11:03,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 28: [2023-05-10 10:11:03,344] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 5: [2023-05-10 10:11:03,343] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 16: [2023-05-10 10:11:03,345] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 20: [2023-05-10 10:11:03,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 16: [2023-05-10 10:11:03,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 5: [2023-05-10 10:11:03,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 5: [2023-05-10 10:11:03,346] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 20: [2023-05-10 10:11:03,347] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 20: [2023-05-10 10:11:03,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 20: [2023-05-10 10:11:03,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 20: [2023-05-10 10:11:03,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:03,349] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 83 5: [2023-05-10 10:11:03,348] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 5: [2023-05-10 10:11:03,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 16: [2023-05-10 10:11:03,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 16: [2023-05-10 10:11:03,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 16: [2023-05-10 10:11:03,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 5: [2023-05-10 10:11:03,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 22: [2023-05-10 10:11:03,349] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 15: [2023-05-10 10:11:03,350] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 124 28: [2023-05-10 10:11:03,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 28: [2023-05-10 10:11:03,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 22: [2023-05-10 10:11:03,350] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 28: [2023-05-10 10:11:03,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 22: [2023-05-10 10:11:03,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 16: [2023-05-10 10:11:03,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 28: [2023-05-10 10:11:03,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 5: [2023-05-10 10:11:03,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 22: [2023-05-10 10:11:03,351] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 20: [2023-05-10 10:11:03,350] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 22: [2023-05-10 10:11:03,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,351] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 28: [2023-05-10 10:11:03,352] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 28: [2023-05-10 10:11:03,352] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 22: [2023-05-10 10:11:03,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:03,352] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:03,352] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:03,353] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:03,353] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:03,353] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:03,353] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:03,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 6: [2023-05-10 10:11:03,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_48_mp_rank_00_optim_states.pt... 6: [2023-05-10 10:11:03,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_52_mp_rank_00_optim_states.pt... 6: [2023-05-10 10:11:03,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_55_mp_rank_00_optim_states.pt... 6: [2023-05-10 10:11:03,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_51_mp_rank_00_optim_states.pt... 6: [2023-05-10 10:11:03,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_53_mp_rank_00_optim_states.pt... 6: [2023-05-10 10:11:03,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_50_mp_rank_00_optim_states.pt... 6: [2023-05-10 10:11:03,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_49_mp_rank_00_optim_states.pt... 30: [2023-05-10 10:11:03,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 6: [2023-05-10 10:11:03,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_54_mp_rank_00_optim_states.pt... 22: [2023-05-10 10:11:03,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:03,354] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:03,354] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:03,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:03,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,356] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 22: [2023-05-10 10:11:03,356] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:03,356] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:03,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:03,356] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:03,357] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 22: [2023-05-10 10:11:03,357] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 27: [2023-05-10 10:11:03,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_217_mp_rank_00_optim_states.pt... 27: [2023-05-10 10:11:03,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_218_mp_rank_00_optim_states.pt... 27: [2023-05-10 10:11:03,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_221_mp_rank_00_optim_states.pt... 27: [2023-05-10 10:11:03,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_220_mp_rank_00_optim_states.pt... 27: [2023-05-10 10:11:03,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_219_mp_rank_00_optim_states.pt... 27: [2023-05-10 10:11:03,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_216_mp_rank_00_optim_states.pt... 27: [2023-05-10 10:11:03,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_222_mp_rank_00_optim_states.pt... 27: [2023-05-10 10:11:03,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_223_mp_rank_00_optim_states.pt... 30: [2023-05-10 10:11:03,362] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 10: [2023-05-10 10:11:03,363] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_80_mp_rank_00_optim_states.pt. 30: [2023-05-10 10:11:03,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 24: [2023-05-10 10:11:03,364] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 10: [2023-05-10 10:11:03,364] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 80 24: [2023-05-10 10:11:03,364] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,365] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,365] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,366] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,366] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,368] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 13: [2023-05-10 10:11:03,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,369] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 24: [2023-05-10 10:11:03,369] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,368] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 13: [2023-05-10 10:11:03,369] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,369] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,369] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,369] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,369] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 10: [2023-05-10 10:11:03,369] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 80 13: [2023-05-10 10:11:03,369] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,370] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 13: [2023-05-10 10:11:03,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,370] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 4: [2023-05-10 10:11:03,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 4: [2023-05-10 10:11:03,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 4: [2023-05-10 10:11:03,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 4: [2023-05-10 10:11:03,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 4: [2023-05-10 10:11:03,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 4: [2023-05-10 10:11:03,373] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 4: [2023-05-10 10:11:03,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 11: [2023-05-10 10:11:03,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 11: [2023-05-10 10:11:03,374] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 26: [2023-05-10 10:11:03,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 26: [2023-05-10 10:11:03,375] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 11: [2023-05-10 10:11:03,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 11: [2023-05-10 10:11:03,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 11: [2023-05-10 10:11:03,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 11: [2023-05-10 10:11:03,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 11: [2023-05-10 10:11:03,376] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 11: [2023-05-10 10:11:03,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 14: [2023-05-10 10:11:03,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 14: [2023-05-10 10:11:03,377] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,377] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 14: [2023-05-10 10:11:03,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 14: [2023-05-10 10:11:03,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 14: [2023-05-10 10:11:03,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 14: [2023-05-10 10:11:03,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 14: [2023-05-10 10:11:03,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,378] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,378] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 4: [2023-05-10 10:11:03,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 4: [2023-05-10 10:11:03,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 4: [2023-05-10 10:11:03,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 11: [2023-05-10 10:11:03,379] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 26: [2023-05-10 10:11:03,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 26: [2023-05-10 10:11:03,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 26: [2023-05-10 10:11:03,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 26: [2023-05-10 10:11:03,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 26: [2023-05-10 10:11:03,379] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 26: [2023-05-10 10:11:03,380] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 4: [2023-05-10 10:11:03,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 26: [2023-05-10 10:11:03,380] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:03,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 4: [2023-05-10 10:11:03,381] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:03,381] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 4: [2023-05-10 10:11:03,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:03,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 13: [2023-05-10 10:11:03,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 4: [2023-05-10 10:11:03,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:03,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,382] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 13: [2023-05-10 10:11:03,382] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,383] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 11: [2023-05-10 10:11:03,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 14: [2023-05-10 10:11:03,384] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 14: [2023-05-10 10:11:03,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 14: [2023-05-10 10:11:03,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 30: [2023-05-10 10:11:03,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,385] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 13: [2023-05-10 10:11:03,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,385] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 11: [2023-05-10 10:11:03,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 13: [2023-05-10 10:11:03,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,386] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 13: [2023-05-10 10:11:03,386] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 30: [2023-05-10 10:11:03,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 14: [2023-05-10 10:11:03,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 11: [2023-05-10 10:11:03,387] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 20: [2023-05-10 10:11:03,387] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:03,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_187_mp_rank_00_optim_states.pt. 30: [2023-05-10 10:11:03,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:03,388] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 187 11: [2023-05-10 10:11:03,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 30: [2023-05-10 10:11:03,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:03,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_2_mp_rank_00_optim_states.pt... 0: [2023-05-10 10:11:03,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_7_mp_rank_00_optim_states.pt... 5: [2023-05-10 10:11:03,388] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 11: [2023-05-10 10:11:03,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 0: [2023-05-10 10:11:03,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt... 30: [2023-05-10 10:11:03,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 0: [2023-05-10 10:11:03,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_4_mp_rank_00_optim_states.pt... 0: [2023-05-10 10:11:03,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_5_mp_rank_00_optim_states.pt... 0: [2023-05-10 10:11:03,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_3_mp_rank_00_optim_states.pt... 0: [2023-05-10 10:11:03,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt... 0: [2023-05-10 10:11:03,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_6_mp_rank_00_optim_states.pt... 30: [2023-05-10 10:11:03,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,388] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 11: [2023-05-10 10:11:03,389] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 30: [2023-05-10 10:11:03,389] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 26: [2023-05-10 10:11:03,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:03,390] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_81_mp_rank_00_optim_states.pt. 26: [2023-05-10 10:11:03,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 26: [2023-05-10 10:11:03,390] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:03,390] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 81 26: [2023-05-10 10:11:03,391] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 30: [2023-05-10 10:11:03,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 26: [2023-05-10 10:11:03,392] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt... 10: [2023-05-10 10:11:03,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_86_mp_rank_00_optim_states.pt. 10: [2023-05-10 10:11:03,393] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 86 23: [2023-05-10 10:11:03,393] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 187 3: [2023-05-10 10:11:03,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,392] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:03,394] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,394] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,395] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,395] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,395] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 10: [2023-05-10 10:11:03,396] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 81 28: [2023-05-10 10:11:03,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,396] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:03,397] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 86 16: [2023-05-10 10:11:03,397] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,398] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,400] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:03,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_82_mp_rank_00_optim_states.pt. 10: [2023-05-10 10:11:03,404] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 82 5: [2023-05-10 10:11:03,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 5: [2023-05-10 10:11:03,404] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:03,405] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_84_mp_rank_00_optim_states.pt. 10: [2023-05-10 10:11:03,405] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 84 23: [2023-05-10 10:11:03,407] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_190_mp_rank_00_optim_states.pt. 23: [2023-05-10 10:11:03,407] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 190 20: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 22: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_177_mp_rank_00_optim_states.pt... 22: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_176_mp_rank_00_optim_states.pt... 22: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_181_mp_rank_00_optim_states.pt... 22: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_180_mp_rank_00_optim_states.pt... 22: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_182_mp_rank_00_optim_states.pt... 22: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_183_mp_rank_00_optim_states.pt... 22: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_178_mp_rank_00_optim_states.pt... 22: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_179_mp_rank_00_optim_states.pt... 25: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_206_mp_rank_00_optim_states.pt... 25: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_204_mp_rank_00_optim_states.pt... 25: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_203_mp_rank_00_optim_states.pt... 5: [2023-05-10 10:11:03,410] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 10: [2023-05-10 10:11:03,410] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 84 10: [2023-05-10 10:11:03,411] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 82 20: [2023-05-10 10:11:03,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 20: [2023-05-10 10:11:03,412] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:03,412] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 190 23: [2023-05-10 10:11:03,412] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_185_mp_rank_00_optim_states.pt. 20: [2023-05-10 10:11:03,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 23: [2023-05-10 10:11:03,413] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 185 5: [2023-05-10 10:11:03,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,413] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 3: [2023-05-10 10:11:03,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 5: [2023-05-10 10:11:03,413] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 20: [2023-05-10 10:11:03,414] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_195_mp_rank_00_optim_states.pt... 24: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_193_mp_rank_00_optim_states.pt... 24: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_192_mp_rank_00_optim_states.pt... 24: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_198_mp_rank_00_optim_states.pt... 24: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_194_mp_rank_00_optim_states.pt... 20: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 24: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_197_mp_rank_00_optim_states.pt... 24: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_196_mp_rank_00_optim_states.pt... 24: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_199_mp_rank_00_optim_states.pt... 3: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 5: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 5: [2023-05-10 10:11:03,415] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 5: [2023-05-10 10:11:03,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:03,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,416] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,416] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,417] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:03,417] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:03,418] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 185 20: [2023-05-10 10:11:03,418] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,418] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,419] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,420] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,420] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 3: [2023-05-10 10:11:03,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,421] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,421] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,422] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 30: [2023-05-10 10:11:03,422] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,423] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 16: [2023-05-10 10:11:03,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,423] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:03,425] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_72_mp_rank_00_optim_states.pt. 9: [2023-05-10 10:11:03,425] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 72 16: [2023-05-10 10:11:03,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,426] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,426] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,428] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 3: [2023-05-10 10:11:03,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,429] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:03,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,429] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 20: [2023-05-10 10:11:03,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:03,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:03,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 9: [2023-05-10 10:11:03,430] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 72 4: [2023-05-10 10:11:03,430] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,431] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 3: [2023-05-10 10:11:03,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,432] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 3: [2023-05-10 10:11:03,433] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 26: [2023-05-10 10:11:03,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 26: [2023-05-10 10:11:03,432] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:03,434] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_191_mp_rank_00_optim_states.pt. 23: [2023-05-10 10:11:03,434] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 191 5: [2023-05-10 10:11:03,434] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:03,435] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:03,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:03,435] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,435] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,435] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:03,436] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 17: [2023-05-10 10:11:03,436] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_137_mp_rank_00_optim_states.pt. 17: [2023-05-10 10:11:03,437] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 137 11: [2023-05-10 10:11:03,437] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 4: [2023-05-10 10:11:03,436] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 23: [2023-05-10 10:11:03,439] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 191 4: [2023-05-10 10:11:03,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 4: [2023-05-10 10:11:03,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 4: [2023-05-10 10:11:03,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 4: [2023-05-10 10:11:03,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 14: [2023-05-10 10:11:03,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 14: [2023-05-10 10:11:03,440] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 14: [2023-05-10 10:11:03,441] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,442] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,443] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,443] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:03,443] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 137 28: [2023-05-10 10:11:03,444] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 3: [2023-05-10 10:11:03,444] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 3: [2023-05-10 10:11:03,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 3: [2023-05-10 10:11:03,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 3: [2023-05-10 10:11:03,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 3: [2023-05-10 10:11:03,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 3: [2023-05-10 10:11:03,445] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,445] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 3: [2023-05-10 10:11:03,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 4: [2023-05-10 10:11:03,446] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 14: [2023-05-10 10:11:03,447] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 20: [2023-05-10 10:11:03,447] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:03,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:03,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:03,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:03,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:03,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,448] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:03,448] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 20: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 5: [2023-05-10 10:11:03,449] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,450] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 25: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_202_mp_rank_00_optim_states.pt... 25: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_207_mp_rank_00_optim_states.pt... 25: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_200_mp_rank_00_optim_states.pt... 25: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_205_mp_rank_00_optim_states.pt... 25: [2023-05-10 10:11:03,409] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_201_mp_rank_00_optim_states.pt... 11: [2023-05-10 10:11:03,451] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,451] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 11: [2023-05-10 10:11:03,451] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,451] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 28: [2023-05-10 10:11:03,452] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,452] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,452] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 11: [2023-05-10 10:11:03,452] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,454] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:03,454] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_139_mp_rank_00_optim_states.pt. 17: [2023-05-10 10:11:03,455] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 139 28: [2023-05-10 10:11:03,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,455] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,456] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,456] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 9: [2023-05-10 10:11:03,456] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_75_mp_rank_00_optim_states.pt. 28: [2023-05-10 10:11:03,456] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,456] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,456] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 9: [2023-05-10 10:11:03,457] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 75 28: [2023-05-10 10:11:03,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,457] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 10: [2023-05-10 10:11:03,457] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_87_mp_rank_00_optim_states.pt. 10: [2023-05-10 10:11:03,458] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 87 26: [2023-05-10 10:11:03,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 26: [2023-05-10 10:11:03,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 26: [2023-05-10 10:11:03,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 14: [2023-05-10 10:11:03,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 14: [2023-05-10 10:11:03,458] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 26: [2023-05-10 10:11:03,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,458] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 20: [2023-05-10 10:11:03,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:03,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_140_mp_rank_00_optim_states.pt. 26: [2023-05-10 10:11:03,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 26: [2023-05-10 10:11:03,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:03,459] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 140 26: [2023-05-10 10:11:03,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 20: [2023-05-10 10:11:03,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 20: [2023-05-10 10:11:03,459] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,459] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 26: [2023-05-10 10:11:03,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 20: [2023-05-10 10:11:03,460] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 11: [2023-05-10 10:11:03,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:03,460] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:03,461] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 139 20: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 11: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 26: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 26: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 11: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 20: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_245_mp_rank_00_optim_states.pt... 30: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_244_mp_rank_00_optim_states.pt... 30: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_243_mp_rank_00_optim_states.pt... 16: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_240_mp_rank_00_optim_states.pt... 30: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_241_mp_rank_00_optim_states.pt... 20: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 30: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_246_mp_rank_00_optim_states.pt... 30: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_242_mp_rank_00_optim_states.pt... 16: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 30: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_247_mp_rank_00_optim_states.pt... 16: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_104_mp_rank_00_optim_states.pt... 13: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_108_mp_rank_00_optim_states.pt... 16: [2023-05-10 10:11:03,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 13: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_110_mp_rank_00_optim_states.pt... 13: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_107_mp_rank_00_optim_states.pt... 20: [2023-05-10 10:11:03,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 13: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_109_mp_rank_00_optim_states.pt... 13: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_106_mp_rank_00_optim_states.pt... 13: [2023-05-10 10:11:03,461] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_105_mp_rank_00_optim_states.pt... 16: [2023-05-10 10:11:03,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 9: [2023-05-10 10:11:03,462] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 75 16: [2023-05-10 10:11:03,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 16: [2023-05-10 10:11:03,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:03,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_71_mp_rank_00_optim_states.pt. 16: [2023-05-10 10:11:03,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 16: [2023-05-10 10:11:03,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,462] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:03,463] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 71 20: [2023-05-10 10:11:03,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,463] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:03,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 20: [2023-05-10 10:11:03,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 28: [2023-05-10 10:11:03,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,464] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 20: [2023-05-10 10:11:03,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,464] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 10: [2023-05-10 10:11:03,464] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 87 10: [2023-05-10 10:11:03,465] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_85_mp_rank_00_optim_states.pt. 23: [2023-05-10 10:11:03,465] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_189_mp_rank_00_optim_states.pt. 23: [2023-05-10 10:11:03,465] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 189 17: [2023-05-10 10:11:03,465] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 140 10: [2023-05-10 10:11:03,465] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 85 14: [2023-05-10 10:11:03,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,465] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:03,465] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_186_mp_rank_00_optim_states.pt. 23: [2023-05-10 10:11:03,466] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 186 4: [2023-05-10 10:11:03,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 4: [2023-05-10 10:11:03,466] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 4: [2023-05-10 10:11:03,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 4: [2023-05-10 10:11:03,467] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,467] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:03,468] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 71 4: [2023-05-10 10:11:03,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,468] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:03,469] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_65_mp_rank_00_optim_states.pt. 11: [2023-05-10 10:11:03,469] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 11: [2023-05-10 10:11:03,469] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 11: [2023-05-10 10:11:03,469] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 4: [2023-05-10 10:11:03,469] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,469] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_36-model_00-model_states.pt. 4: [2023-05-10 10:11:03,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:03,470] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 65 4: [2023-05-10 10:11:03,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 10: [2023-05-10 10:11:03,470] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 85 4: [2023-05-10 10:11:03,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 4: [2023-05-10 10:11:03,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,470] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 4: [2023-05-10 10:11:03,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 23: [2023-05-10 10:11:03,471] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 189 4: [2023-05-10 10:11:03,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,470] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 4: [2023-05-10 10:11:03,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,471] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,471] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 23: [2023-05-10 10:11:03,472] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 186 14: [2023-05-10 10:11:03,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 4: [2023-05-10 10:11:03,473] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,473] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 4: [2023-05-10 10:11:03,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,474] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,474] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 8: [2023-05-10 10:11:03,475] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 65 14: [2023-05-10 10:11:03,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,475] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 4: [2023-05-10 10:11:03,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,475] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,477] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,477] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 4: [2023-05-10 10:11:03,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 4: [2023-05-10 10:11:03,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 11: [2023-05-10 10:11:03,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,478] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,478] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 11: [2023-05-10 10:11:03,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 11: [2023-05-10 10:11:03,479] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,479] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,480] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,481] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,481] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,482] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:03,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_69_mp_rank_00_optim_states.pt. 8: [2023-05-10 10:11:03,484] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 69 17: [2023-05-10 10:11:03,484] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_143_mp_rank_00_optim_states.pt. 17: [2023-05-10 10:11:03,485] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 143 14: [2023-05-10 10:11:03,487] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,488] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,488] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 14: [2023-05-10 10:11:03,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 26: [2023-05-10 10:11:03,489] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 14: [2023-05-10 10:11:03,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:03,489] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 69 26: [2023-05-10 10:11:03,489] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 17: [2023-05-10 10:11:03,490] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 143 17: [2023-05-10 10:11:03,491] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_141_mp_rank_00_optim_states.pt. 17: [2023-05-10 10:11:03,492] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 141 26: [2023-05-10 10:11:03,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,492] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 26: [2023-05-10 10:11:03,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 26: [2023-05-10 10:11:03,493] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 26: [2023-05-10 10:11:03,493] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 26: [2023-05-10 10:11:03,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 26: [2023-05-10 10:11:03,494] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,494] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 8: [2023-05-10 10:11:03,495] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_67_mp_rank_00_optim_states.pt. 20: [2023-05-10 10:11:03,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_167_mp_rank_00_optim_states.pt... 20: [2023-05-10 10:11:03,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_160_mp_rank_00_optim_states.pt... 20: [2023-05-10 10:11:03,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_162_mp_rank_00_optim_states.pt... 20: [2023-05-10 10:11:03,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_161_mp_rank_00_optim_states.pt... 20: [2023-05-10 10:11:03,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_164_mp_rank_00_optim_states.pt... 20: [2023-05-10 10:11:03,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_166_mp_rank_00_optim_states.pt... 20: [2023-05-10 10:11:03,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_165_mp_rank_00_optim_states.pt... 20: [2023-05-10 10:11:03,495] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_163_mp_rank_00_optim_states.pt... 8: [2023-05-10 10:11:03,496] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 67 26: [2023-05-10 10:11:03,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:03,496] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 141 17: [2023-05-10 10:11:03,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_138_mp_rank_00_optim_states.pt. 26: [2023-05-10 10:11:03,496] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 26: [2023-05-10 10:11:03,496] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:03,497] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 138 26: [2023-05-10 10:11:03,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 26: [2023-05-10 10:11:03,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 26: [2023-05-10 10:11:03,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 26: [2023-05-10 10:11:03,497] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 9: [2023-05-10 10:11:03,497] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_79_mp_rank_00_optim_states.pt. 26: [2023-05-10 10:11:03,498] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 9: [2023-05-10 10:11:03,498] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 79 11: [2023-05-10 10:11:03,498] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 11: [2023-05-10 10:11:03,499] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,499] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 5: [2023-05-10 10:11:03,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_40_mp_rank_00_optim_states.pt... 5: [2023-05-10 10:11:03,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_46_mp_rank_00_optim_states.pt... 5: [2023-05-10 10:11:03,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_42_mp_rank_00_optim_states.pt... 5: [2023-05-10 10:11:03,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_43_mp_rank_00_optim_states.pt... 5: [2023-05-10 10:11:03,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_45_mp_rank_00_optim_states.pt... 5: [2023-05-10 10:11:03,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_41_mp_rank_00_optim_states.pt... 5: [2023-05-10 10:11:03,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_44_mp_rank_00_optim_states.pt... 5: [2023-05-10 10:11:03,500] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_47_mp_rank_00_optim_states.pt... 8: [2023-05-10 10:11:03,501] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 67 3: [2023-05-10 10:11:03,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_28_mp_rank_00_optim_states.pt... 3: [2023-05-10 10:11:03,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_25_mp_rank_00_optim_states.pt... 3: [2023-05-10 10:11:03,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_30_mp_rank_00_optim_states.pt... 3: [2023-05-10 10:11:03,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_24_mp_rank_00_optim_states.pt... 3: [2023-05-10 10:11:03,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_26_mp_rank_00_optim_states.pt... 3: [2023-05-10 10:11:03,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_31_mp_rank_00_optim_states.pt... 3: [2023-05-10 10:11:03,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_29_mp_rank_00_optim_states.pt... 3: [2023-05-10 10:11:03,501] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_27_mp_rank_00_optim_states.pt... 11: [2023-05-10 10:11:03,502] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,502] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 11: [2023-05-10 10:11:03,502] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 17: [2023-05-10 10:11:03,502] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 138 28: [2023-05-10 10:11:03,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_226_mp_rank_00_optim_states.pt... 13: [2023-05-10 10:11:03,462] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_111_mp_rank_00_optim_states.pt... 11: [2023-05-10 10:11:03,503] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 28: [2023-05-10 10:11:03,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_228_mp_rank_00_optim_states.pt... 28: [2023-05-10 10:11:03,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_230_mp_rank_00_optim_states.pt... 28: [2023-05-10 10:11:03,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_224_mp_rank_00_optim_states.pt... 28: [2023-05-10 10:11:03,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_225_mp_rank_00_optim_states.pt... 28: [2023-05-10 10:11:03,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_227_mp_rank_00_optim_states.pt... 28: [2023-05-10 10:11:03,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_229_mp_rank_00_optim_states.pt... 28: [2023-05-10 10:11:03,503] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_231_mp_rank_00_optim_states.pt... 9: [2023-05-10 10:11:03,503] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 79 11: [2023-05-10 10:11:03,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 11: [2023-05-10 10:11:03,506] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 11: [2023-05-10 10:11:03,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,506] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt... 11: [2023-05-10 10:11:03,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 11: [2023-05-10 10:11:03,507] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/layer_38-model_00-model_states.pt. 9: [2023-05-10 10:11:03,516] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_78_mp_rank_00_optim_states.pt. 9: [2023-05-10 10:11:03,517] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 78 16: [2023-05-10 10:11:03,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_134_mp_rank_00_optim_states.pt... 16: [2023-05-10 10:11:03,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_132_mp_rank_00_optim_states.pt... 16: [2023-05-10 10:11:03,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_133_mp_rank_00_optim_states.pt... 16: [2023-05-10 10:11:03,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_131_mp_rank_00_optim_states.pt... 16: [2023-05-10 10:11:03,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_130_mp_rank_00_optim_states.pt... 16: [2023-05-10 10:11:03,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_129_mp_rank_00_optim_states.pt... 16: [2023-05-10 10:11:03,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_128_mp_rank_00_optim_states.pt... 16: [2023-05-10 10:11:03,521] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_135_mp_rank_00_optim_states.pt... 17: [2023-05-10 10:11:03,522] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_136_mp_rank_00_optim_states.pt. 9: [2023-05-10 10:11:03,522] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 78 17: [2023-05-10 10:11:03,523] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 136 9: [2023-05-10 10:11:03,524] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_73_mp_rank_00_optim_states.pt. 9: [2023-05-10 10:11:03,525] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 73 17: [2023-05-10 10:11:03,528] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 136 23: [2023-05-10 10:11:03,529] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_184_mp_rank_00_optim_states.pt. 23: [2023-05-10 10:11:03,530] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 184 9: [2023-05-10 10:11:03,530] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 73 17: [2023-05-10 10:11:03,530] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_142_mp_rank_00_optim_states.pt. 17: [2023-05-10 10:11:03,530] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 142 9: [2023-05-10 10:11:03,534] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_74_mp_rank_00_optim_states.pt. 9: [2023-05-10 10:11:03,535] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 74 23: [2023-05-10 10:11:03,535] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 184 17: [2023-05-10 10:11:03,536] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 142 9: [2023-05-10 10:11:03,540] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 74 29: [2023-05-10 10:11:03,543] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_232_mp_rank_00_optim_states.pt. 29: [2023-05-10 10:11:03,544] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 232 29: [2023-05-10 10:11:03,549] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 232 8: [2023-05-10 10:11:03,551] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_66_mp_rank_00_optim_states.pt. 8: [2023-05-10 10:11:03,552] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 66 11: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_94_mp_rank_00_optim_states.pt... 11: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_92_mp_rank_00_optim_states.pt... 11: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_90_mp_rank_00_optim_states.pt... 11: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_93_mp_rank_00_optim_states.pt... 11: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_89_mp_rank_00_optim_states.pt... 11: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_91_mp_rank_00_optim_states.pt... 11: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_88_mp_rank_00_optim_states.pt... 11: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_95_mp_rank_00_optim_states.pt... 26: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_213_mp_rank_00_optim_states.pt... 26: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_214_mp_rank_00_optim_states.pt... 26: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_210_mp_rank_00_optim_states.pt... 26: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_208_mp_rank_00_optim_states.pt... 26: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_209_mp_rank_00_optim_states.pt... 26: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_211_mp_rank_00_optim_states.pt... 26: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_215_mp_rank_00_optim_states.pt... 26: [2023-05-10 10:11:03,552] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_212_mp_rank_00_optim_states.pt... 4: [2023-05-10 10:11:03,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_32_mp_rank_00_optim_states.pt... 4: [2023-05-10 10:11:03,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_33_mp_rank_00_optim_states.pt... 4: [2023-05-10 10:11:03,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_39_mp_rank_00_optim_states.pt... 4: [2023-05-10 10:11:03,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_35_mp_rank_00_optim_states.pt... 8: [2023-05-10 10:11:03,557] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 66 14: [2023-05-10 10:11:03,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_113_mp_rank_00_optim_states.pt... 14: [2023-05-10 10:11:03,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_114_mp_rank_00_optim_states.pt... 14: [2023-05-10 10:11:03,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_112_mp_rank_00_optim_states.pt... 14: [2023-05-10 10:11:03,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_117_mp_rank_00_optim_states.pt... 14: [2023-05-10 10:11:03,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_119_mp_rank_00_optim_states.pt... 14: [2023-05-10 10:11:03,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_118_mp_rank_00_optim_states.pt... 14: [2023-05-10 10:11:03,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_116_mp_rank_00_optim_states.pt... 14: [2023-05-10 10:11:03,563] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_115_mp_rank_00_optim_states.pt... 19: [2023-05-10 10:11:03,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_153_mp_rank_00_optim_states.pt. 19: [2023-05-10 10:11:03,563] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_155_mp_rank_00_optim_states.pt. 19: [2023-05-10 10:11:03,563] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 155 19: [2023-05-10 10:11:03,563] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 153 29: [2023-05-10 10:11:03,566] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_234_mp_rank_00_optim_states.pt. 29: [2023-05-10 10:11:03,566] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 234 9: [2023-05-10 10:11:03,567] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_76_mp_rank_00_optim_states.pt. 9: [2023-05-10 10:11:03,567] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 76 8: [2023-05-10 10:11:03,569] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_64_mp_rank_00_optim_states.pt. 19: [2023-05-10 10:11:03,569] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 153 8: [2023-05-10 10:11:03,569] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 64 19: [2023-05-10 10:11:03,570] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 155 19: [2023-05-10 10:11:03,573] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_156_mp_rank_00_optim_states.pt. 9: [2023-05-10 10:11:03,573] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 76 19: [2023-05-10 10:11:03,573] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 156 29: [2023-05-10 10:11:03,574] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 234 29: [2023-05-10 10:11:03,574] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_237_mp_rank_00_optim_states.pt. 29: [2023-05-10 10:11:03,575] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 237 8: [2023-05-10 10:11:03,575] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 64 19: [2023-05-10 10:11:03,578] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 156 7: [2023-05-10 10:11:03,580] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_58_mp_rank_00_optim_states.pt. 29: [2023-05-10 10:11:03,580] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 237 7: [2023-05-10 10:11:03,580] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 58 29: [2023-05-10 10:11:03,582] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_238_mp_rank_00_optim_states.pt. 29: [2023-05-10 10:11:03,583] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 238 7: [2023-05-10 10:11:03,586] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 58 29: [2023-05-10 10:11:03,587] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 238 19: [2023-05-10 10:11:03,592] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_157_mp_rank_00_optim_states.pt. 19: [2023-05-10 10:11:03,592] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 157 4: [2023-05-10 10:11:03,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_38_mp_rank_00_optim_states.pt... 29: [2023-05-10 10:11:03,594] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_239_mp_rank_00_optim_states.pt. 4: [2023-05-10 10:11:03,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_36_mp_rank_00_optim_states.pt... 29: [2023-05-10 10:11:03,594] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 239 4: [2023-05-10 10:11:03,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_37_mp_rank_00_optim_states.pt... 4: [2023-05-10 10:11:03,553] [INFO] [torch_checkpoint_engine.py:21:load] [Torch] Loading checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_34_mp_rank_00_optim_states.pt... 19: [2023-05-10 10:11:03,596] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_154_mp_rank_00_optim_states.pt. 19: [2023-05-10 10:11:03,596] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 157 19: [2023-05-10 10:11:03,597] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 154 29: [2023-05-10 10:11:03,600] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 239 19: [2023-05-10 10:11:03,602] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 154 9: [2023-05-10 10:11:03,602] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_77_mp_rank_00_optim_states.pt. 9: [2023-05-10 10:11:03,603] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 77 7: [2023-05-10 10:11:03,604] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_63_mp_rank_00_optim_states.pt. 7: [2023-05-10 10:11:03,604] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 63 9: [2023-05-10 10:11:03,609] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 77 7: [2023-05-10 10:11:03,609] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 63 2: [2023-05-10 10:11:03,624] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_21_mp_rank_00_optim_states.pt. 2: [2023-05-10 10:11:03,625] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 21 2: [2023-05-10 10:11:03,631] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 21 8: [2023-05-10 10:11:03,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_68_mp_rank_00_optim_states.pt. 2: [2023-05-10 10:11:03,636] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_18_mp_rank_00_optim_states.pt. 8: [2023-05-10 10:11:03,637] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 68 2: [2023-05-10 10:11:03,637] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 18 7: [2023-05-10 10:11:03,641] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_61_mp_rank_00_optim_states.pt. 2: [2023-05-10 10:11:03,642] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 18 7: [2023-05-10 10:11:03,642] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 61 8: [2023-05-10 10:11:03,642] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 68 7: [2023-05-10 10:11:03,646] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_59_mp_rank_00_optim_states.pt. 7: [2023-05-10 10:11:03,647] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 59 7: [2023-05-10 10:11:03,648] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 61 19: [2023-05-10 10:11:03,651] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_152_mp_rank_00_optim_states.pt. 19: [2023-05-10 10:11:03,652] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 152 7: [2023-05-10 10:11:03,652] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 59 7: [2023-05-10 10:11:03,653] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_60_mp_rank_00_optim_states.pt. 7: [2023-05-10 10:11:03,654] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 60 19: [2023-05-10 10:11:03,659] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 152 7: [2023-05-10 10:11:03,659] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 60 1: [2023-05-10 10:11:03,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_9_mp_rank_00_optim_states.pt. 29: [2023-05-10 10:11:03,662] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_233_mp_rank_00_optim_states.pt. 29: [2023-05-10 10:11:03,662] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 233 1: [2023-05-10 10:11:03,663] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 9 18: [2023-05-10 10:11:03,664] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_148_mp_rank_00_optim_states.pt. 18: [2023-05-10 10:11:03,664] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 148 29: [2023-05-10 10:11:03,667] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 233 1: [2023-05-10 10:11:03,668] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 9 19: [2023-05-10 10:11:03,668] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_158_mp_rank_00_optim_states.pt. 19: [2023-05-10 10:11:03,669] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 158 18: [2023-05-10 10:11:03,671] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 148 19: [2023-05-10 10:11:03,674] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 158 23: [2023-05-10 10:11:03,680] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_188_mp_rank_00_optim_states.pt. 23: [2023-05-10 10:11:03,680] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 188 29: [2023-05-10 10:11:03,682] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_235_mp_rank_00_optim_states.pt. 29: [2023-05-10 10:11:03,682] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 235 23: [2023-05-10 10:11:03,685] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 188 29: [2023-05-10 10:11:03,687] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 235 8: [2023-05-10 10:11:03,688] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_70_mp_rank_00_optim_states.pt. 8: [2023-05-10 10:11:03,689] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 70 7: [2023-05-10 10:11:03,693] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_57_mp_rank_00_optim_states.pt. 7: [2023-05-10 10:11:03,693] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 57 8: [2023-05-10 10:11:03,694] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 70 7: [2023-05-10 10:11:03,699] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 57 29: [2023-05-10 10:11:03,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_236_mp_rank_00_optim_states.pt. 29: [2023-05-10 10:11:03,705] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 236 1: [2023-05-10 10:11:03,705] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_15_mp_rank_00_optim_states.pt. 1: [2023-05-10 10:11:03,706] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 15 2: [2023-05-10 10:11:03,707] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_17_mp_rank_00_optim_states.pt. 31: [2023-05-10 10:11:03,708] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_248_mp_rank_00_optim_states.pt. 2: [2023-05-10 10:11:03,708] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 17 31: [2023-05-10 10:11:03,708] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 248 27: [2023-05-10 10:11:03,709] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_217_mp_rank_00_optim_states.pt. 27: [2023-05-10 10:11:03,709] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 217 29: [2023-05-10 10:11:03,711] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 236 1: [2023-05-10 10:11:03,711] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 15 2: [2023-05-10 10:11:03,712] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_22_mp_rank_00_optim_states.pt. 7: [2023-05-10 10:11:03,712] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_62_mp_rank_00_optim_states.pt. 2: [2023-05-10 10:11:03,712] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 22 6: [2023-05-10 10:11:03,712] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_48_mp_rank_00_optim_states.pt. 7: [2023-05-10 10:11:03,713] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 62 6: [2023-05-10 10:11:03,713] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 48 31: [2023-05-10 10:11:03,713] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 248 27: [2023-05-10 10:11:03,713] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 217 2: [2023-05-10 10:11:03,714] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 17 2: [2023-05-10 10:11:03,715] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_20_mp_rank_00_optim_states.pt. 2: [2023-05-10 10:11:03,716] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 20 6: [2023-05-10 10:11:03,718] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 48 2: [2023-05-10 10:11:03,719] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 22 7: [2023-05-10 10:11:03,719] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 62 1: [2023-05-10 10:11:03,720] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_8_mp_rank_00_optim_states.pt. 1: [2023-05-10 10:11:03,721] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 8 2: [2023-05-10 10:11:03,721] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 20 7: [2023-05-10 10:11:03,723] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_56_mp_rank_00_optim_states.pt. 7: [2023-05-10 10:11:03,724] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 56 19: [2023-05-10 10:11:03,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_159_mp_rank_00_optim_states.pt. 19: [2023-05-10 10:11:03,724] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 159 31: [2023-05-10 10:11:03,724] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_251_mp_rank_00_optim_states.pt. 31: [2023-05-10 10:11:03,724] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 251 1: [2023-05-10 10:11:03,726] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 8 7: [2023-05-10 10:11:03,729] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 56 31: [2023-05-10 10:11:03,730] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 251 19: [2023-05-10 10:11:03,730] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 159 1: [2023-05-10 10:11:03,732] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_11_mp_rank_00_optim_states.pt. 1: [2023-05-10 10:11:03,733] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 11 1: [2023-05-10 10:11:03,738] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 11 2: [2023-05-10 10:11:03,747] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_16_mp_rank_00_optim_states.pt. 2: [2023-05-10 10:11:03,747] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 16 2: [2023-05-10 10:11:03,753] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 16 18: [2023-05-10 10:11:03,755] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_149_mp_rank_00_optim_states.pt. 2: [2023-05-10 10:11:03,756] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_23_mp_rank_00_optim_states.pt. 18: [2023-05-10 10:11:03,756] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 149 2: [2023-05-10 10:11:03,757] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 23 12: [2023-05-10 10:11:03,758] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_103_mp_rank_00_optim_states.pt. 12: [2023-05-10 10:11:03,759] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 103 18: [2023-05-10 10:11:03,761] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 149 27: [2023-05-10 10:11:03,762] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_222_mp_rank_00_optim_states.pt. 27: [2023-05-10 10:11:03,762] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 222 18: [2023-05-10 10:11:03,762] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_150_mp_rank_00_optim_states.pt. 2: [2023-05-10 10:11:03,763] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 23 18: [2023-05-10 10:11:03,763] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 150 12: [2023-05-10 10:11:03,764] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_102_mp_rank_00_optim_states.pt. 12: [2023-05-10 10:11:03,764] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 102 12: [2023-05-10 10:11:03,765] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 103 1: [2023-05-10 10:11:03,766] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_14_mp_rank_00_optim_states.pt. 1: [2023-05-10 10:11:03,766] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 14 27: [2023-05-10 10:11:03,768] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 222 18: [2023-05-10 10:11:03,768] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 150 31: [2023-05-10 10:11:03,769] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_250_mp_rank_00_optim_states.pt. 31: [2023-05-10 10:11:03,769] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 250 12: [2023-05-10 10:11:03,770] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 102 1: [2023-05-10 10:11:03,772] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 14 31: [2023-05-10 10:11:03,774] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 250 21: [2023-05-10 10:11:03,774] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_170_mp_rank_00_optim_states.pt. 21: [2023-05-10 10:11:03,774] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 170 18: [2023-05-10 10:11:03,775] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_145_mp_rank_00_optim_states.pt. 18: [2023-05-10 10:11:03,775] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 145 2: [2023-05-10 10:11:03,776] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_19_mp_rank_00_optim_states.pt. 2: [2023-05-10 10:11:03,777] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 19 12: [2023-05-10 10:11:03,779] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_98_mp_rank_00_optim_states.pt. 12: [2023-05-10 10:11:03,779] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 98 21: [2023-05-10 10:11:03,779] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 170 18: [2023-05-10 10:11:03,780] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 145 31: [2023-05-10 10:11:03,782] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_254_mp_rank_00_optim_states.pt. 31: [2023-05-10 10:11:03,782] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 254 2: [2023-05-10 10:11:03,783] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 19 12: [2023-05-10 10:11:03,785] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 98 31: [2023-05-10 10:11:03,788] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 254 21: [2023-05-10 10:11:03,789] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_169_mp_rank_00_optim_states.pt. 21: [2023-05-10 10:11:03,789] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 169 22: [2023-05-10 10:11:03,792] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_180_mp_rank_00_optim_states.pt. 18: [2023-05-10 10:11:03,793] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_146_mp_rank_00_optim_states.pt. 22: [2023-05-10 10:11:03,793] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 180 18: [2023-05-10 10:11:03,793] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 146 21: [2023-05-10 10:11:03,794] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 169 18: [2023-05-10 10:11:03,798] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 146 22: [2023-05-10 10:11:03,799] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 180 31: [2023-05-10 10:11:03,801] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_252_mp_rank_00_optim_states.pt. 31: [2023-05-10 10:11:03,801] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 252 31: [2023-05-10 10:11:03,804] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_253_mp_rank_00_optim_states.pt. 31: [2023-05-10 10:11:03,805] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 253 31: [2023-05-10 10:11:03,805] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_255_mp_rank_00_optim_states.pt. 31: [2023-05-10 10:11:03,805] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 255 1: [2023-05-10 10:11:03,807] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_10_mp_rank_00_optim_states.pt. 1: [2023-05-10 10:11:03,808] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 10 31: [2023-05-10 10:11:03,808] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 252 31: [2023-05-10 10:11:03,810] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 255 0: [2023-05-10 10:11:03,810] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_7_mp_rank_00_optim_states.pt. 0: [2023-05-10 10:11:03,811] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 7 31: [2023-05-10 10:11:03,811] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 253 1: [2023-05-10 10:11:03,813] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 10 24: [2023-05-10 10:11:03,814] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_199_mp_rank_00_optim_states.pt. 24: [2023-05-10 10:11:03,815] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 199 0: [2023-05-10 10:11:03,816] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 7 21: [2023-05-10 10:11:03,816] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_174_mp_rank_00_optim_states.pt. 21: [2023-05-10 10:11:03,817] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 174 27: [2023-05-10 10:11:03,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_220_mp_rank_00_optim_states.pt. 27: [2023-05-10 10:11:03,817] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 220 25: [2023-05-10 10:11:03,817] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_205_mp_rank_00_optim_states.pt. 25: [2023-05-10 10:11:03,817] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 205 24: [2023-05-10 10:11:03,819] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 199 21: [2023-05-10 10:11:03,821] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 174 1: [2023-05-10 10:11:03,822] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_13_mp_rank_00_optim_states.pt. 1: [2023-05-10 10:11:03,822] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 13 27: [2023-05-10 10:11:03,822] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 220 25: [2023-05-10 10:11:03,823] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 205 25: [2023-05-10 10:11:03,825] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_201_mp_rank_00_optim_states.pt. 25: [2023-05-10 10:11:03,825] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 201 24: [2023-05-10 10:11:03,826] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_198_mp_rank_00_optim_states.pt. 24: [2023-05-10 10:11:03,826] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 198 13: [2023-05-10 10:11:03,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_108_mp_rank_00_optim_states.pt. 0: [2023-05-10 10:11:03,827] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_5_mp_rank_00_optim_states.pt. 13: [2023-05-10 10:11:03,827] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 108 0: [2023-05-10 10:11:03,828] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 5 1: [2023-05-10 10:11:03,828] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 13 12: [2023-05-10 10:11:03,829] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_96_mp_rank_00_optim_states.pt. 12: [2023-05-10 10:11:03,829] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 96 25: [2023-05-10 10:11:03,830] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 201 1: [2023-05-10 10:11:03,831] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_12_mp_rank_00_optim_states.pt. 1: [2023-05-10 10:11:03,831] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 12 6: [2023-05-10 10:11:03,832] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_54_mp_rank_00_optim_states.pt. 24: [2023-05-10 10:11:03,832] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 198 6: [2023-05-10 10:11:03,833] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 54 0: [2023-05-10 10:11:03,833] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 5 13: [2023-05-10 10:11:03,834] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 108 12: [2023-05-10 10:11:03,836] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 96 1: [2023-05-10 10:11:03,837] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 12 6: [2023-05-10 10:11:03,838] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 54 18: [2023-05-10 10:11:03,840] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_151_mp_rank_00_optim_states.pt. 18: [2023-05-10 10:11:03,841] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 151 0: [2023-05-10 10:11:03,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_2_mp_rank_00_optim_states.pt. 18: [2023-05-10 10:11:03,844] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_144_mp_rank_00_optim_states.pt. 0: [2023-05-10 10:11:03,844] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 2 18: [2023-05-10 10:11:03,845] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 144 18: [2023-05-10 10:11:03,846] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 151 22: [2023-05-10 10:11:03,848] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_178_mp_rank_00_optim_states.pt. 22: [2023-05-10 10:11:03,848] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 178 0: [2023-05-10 10:11:03,850] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 2 18: [2023-05-10 10:11:03,850] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 144 6: [2023-05-10 10:11:03,850] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_52_mp_rank_00_optim_states.pt. 6: [2023-05-10 10:11:03,851] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 52 27: [2023-05-10 10:11:03,852] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_219_mp_rank_00_optim_states.pt. 27: [2023-05-10 10:11:03,852] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 219 22: [2023-05-10 10:11:03,854] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 178 21: [2023-05-10 10:11:03,855] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_172_mp_rank_00_optim_states.pt. 21: [2023-05-10 10:11:03,855] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 172 6: [2023-05-10 10:11:03,855] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 52 31: [2023-05-10 10:11:03,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_249_mp_rank_00_optim_states.pt. 31: [2023-05-10 10:11:03,856] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 249 30: [2023-05-10 10:11:03,856] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_244_mp_rank_00_optim_states.pt. 30: [2023-05-10 10:11:03,857] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 244 27: [2023-05-10 10:11:03,858] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 219 16: [2023-05-10 10:11:03,859] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_131_mp_rank_00_optim_states.pt. 16: [2023-05-10 10:11:03,860] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 131 25: [2023-05-10 10:11:03,860] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_200_mp_rank_00_optim_states.pt. 25: [2023-05-10 10:11:03,860] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 200 24: [2023-05-10 10:11:03,861] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_193_mp_rank_00_optim_states.pt. 24: [2023-05-10 10:11:03,861] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 193 12: [2023-05-10 10:11:03,862] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_97_mp_rank_00_optim_states.pt. 31: [2023-05-10 10:11:03,862] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 249 30: [2023-05-10 10:11:03,863] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 244 24: [2023-05-10 10:11:03,863] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_192_mp_rank_00_optim_states.pt. 12: [2023-05-10 10:11:03,863] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 97 21: [2023-05-10 10:11:03,863] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 172 24: [2023-05-10 10:11:03,863] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 192 16: [2023-05-10 10:11:03,865] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 131 25: [2023-05-10 10:11:03,865] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 200 27: [2023-05-10 10:11:03,865] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_216_mp_rank_00_optim_states.pt. 27: [2023-05-10 10:11:03,865] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 216 24: [2023-05-10 10:11:03,866] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 193 0: [2023-05-10 10:11:03,867] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_6_mp_rank_00_optim_states.pt. 0: [2023-05-10 10:11:03,868] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 6 18: [2023-05-10 10:11:03,868] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_147_mp_rank_00_optim_states.pt. 24: [2023-05-10 10:11:03,868] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 192 12: [2023-05-10 10:11:03,869] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 97 18: [2023-05-10 10:11:03,869] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 147 27: [2023-05-10 10:11:03,871] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 216 0: [2023-05-10 10:11:03,873] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 6 18: [2023-05-10 10:11:03,874] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 147 21: [2023-05-10 10:11:03,874] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_171_mp_rank_00_optim_states.pt. 21: [2023-05-10 10:11:03,874] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 171 21: [2023-05-10 10:11:03,880] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 171 20: [2023-05-10 10:11:03,880] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_161_mp_rank_00_optim_states.pt. 20: [2023-05-10 10:11:03,881] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 161 6: [2023-05-10 10:11:03,882] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_50_mp_rank_00_optim_states.pt. 6: [2023-05-10 10:11:03,883] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 50 6: [2023-05-10 10:11:03,883] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_49_mp_rank_00_optim_states.pt. 6: [2023-05-10 10:11:03,883] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 49 30: [2023-05-10 10:11:03,885] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_247_mp_rank_00_optim_states.pt. 30: [2023-05-10 10:11:03,885] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 247 5: [2023-05-10 10:11:03,888] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_43_mp_rank_00_optim_states.pt. 5: [2023-05-10 10:11:03,889] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 43 6: [2023-05-10 10:11:03,889] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 50 6: [2023-05-10 10:11:03,889] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 49 20: [2023-05-10 10:11:03,889] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 161 12: [2023-05-10 10:11:03,890] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_100_mp_rank_00_optim_states.pt. 12: [2023-05-10 10:11:03,890] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 100 21: [2023-05-10 10:11:03,891] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_173_mp_rank_00_optim_states.pt. 21: [2023-05-10 10:11:03,891] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 173 30: [2023-05-10 10:11:03,892] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 247 28: [2023-05-10 10:11:03,892] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_227_mp_rank_00_optim_states.pt. 28: [2023-05-10 10:11:03,892] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 227 5: [2023-05-10 10:11:03,893] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 43 25: [2023-05-10 10:11:03,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_202_mp_rank_00_optim_states.pt. 0: [2023-05-10 10:11:03,895] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_4_mp_rank_00_optim_states.pt. 25: [2023-05-10 10:11:03,895] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 202 12: [2023-05-10 10:11:03,896] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 100 0: [2023-05-10 10:11:03,896] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 4 21: [2023-05-10 10:11:03,897] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 173 25: [2023-05-10 10:11:03,897] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_203_mp_rank_00_optim_states.pt. 25: [2023-05-10 10:11:03,897] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 203 28: [2023-05-10 10:11:03,898] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 227 25: [2023-05-10 10:11:03,901] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 202 0: [2023-05-10 10:11:03,902] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 4 25: [2023-05-10 10:11:03,902] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 203 27: [2023-05-10 10:11:03,904] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_221_mp_rank_00_optim_states.pt. 27: [2023-05-10 10:11:03,904] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 221 27: [2023-05-10 10:11:03,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_218_mp_rank_00_optim_states.pt. 0: [2023-05-10 10:11:03,905] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt. 27: [2023-05-10 10:11:03,905] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 218 13: [2023-05-10 10:11:03,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_109_mp_rank_00_optim_states.pt. 0: [2023-05-10 10:11:03,906] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 1 13: [2023-05-10 10:11:03,906] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 109 30: [2023-05-10 10:11:03,906] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_246_mp_rank_00_optim_states.pt. 30: [2023-05-10 10:11:03,907] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 246 16: [2023-05-10 10:11:03,907] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_135_mp_rank_00_optim_states.pt. 16: [2023-05-10 10:11:03,907] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 135 22: [2023-05-10 10:11:03,908] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_183_mp_rank_00_optim_states.pt. 27: [2023-05-10 10:11:03,909] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 221 22: [2023-05-10 10:11:03,909] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 183 22: [2023-05-10 10:11:03,910] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_176_mp_rank_00_optim_states.pt. 22: [2023-05-10 10:11:03,911] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 176 0: [2023-05-10 10:11:03,911] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 1 27: [2023-05-10 10:11:03,911] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 218 30: [2023-05-10 10:11:03,911] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_243_mp_rank_00_optim_states.pt. 30: [2023-05-10 10:11:03,912] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 243 13: [2023-05-10 10:11:03,912] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 109 16: [2023-05-10 10:11:03,913] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 135 30: [2023-05-10 10:11:03,913] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 246 13: [2023-05-10 10:11:03,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_106_mp_rank_00_optim_states.pt. 6: [2023-05-10 10:11:03,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_55_mp_rank_00_optim_states.pt. 13: [2023-05-10 10:11:03,913] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 106 16: [2023-05-10 10:11:03,913] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_130_mp_rank_00_optim_states.pt. 6: [2023-05-10 10:11:03,914] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 55 16: [2023-05-10 10:11:03,914] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 130 22: [2023-05-10 10:11:03,914] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 183 27: [2023-05-10 10:11:03,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_223_mp_rank_00_optim_states.pt. 27: [2023-05-10 10:11:03,916] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 223 22: [2023-05-10 10:11:03,916] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 176 5: [2023-05-10 10:11:03,916] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_41_mp_rank_00_optim_states.pt. 5: [2023-05-10 10:11:03,917] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 41 30: [2023-05-10 10:11:03,918] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 243 16: [2023-05-10 10:11:03,919] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 130 13: [2023-05-10 10:11:03,919] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 106 3: [2023-05-10 10:11:03,919] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_24_mp_rank_00_optim_states.pt. 6: [2023-05-10 10:11:03,920] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 55 3: [2023-05-10 10:11:03,920] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 24 27: [2023-05-10 10:11:03,921] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 223 6: [2023-05-10 10:11:03,921] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_53_mp_rank_00_optim_states.pt. 5: [2023-05-10 10:11:03,922] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 41 6: [2023-05-10 10:11:03,922] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 53 12: [2023-05-10 10:11:03,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_101_mp_rank_00_optim_states.pt. 0: [2023-05-10 10:11:03,924] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_3_mp_rank_00_optim_states.pt. 12: [2023-05-10 10:11:03,924] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 101 0: [2023-05-10 10:11:03,925] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt. 0: [2023-05-10 10:11:03,925] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 3 0: [2023-05-10 10:11:03,925] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 0 3: [2023-05-10 10:11:03,926] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_31_mp_rank_00_optim_states.pt. 3: [2023-05-10 10:11:03,927] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 24 22: [2023-05-10 10:11:03,927] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_182_mp_rank_00_optim_states.pt. 3: [2023-05-10 10:11:03,927] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 31 22: [2023-05-10 10:11:03,927] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 182 6: [2023-05-10 10:11:03,927] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 53 20: [2023-05-10 10:11:03,929] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_165_mp_rank_00_optim_states.pt. 20: [2023-05-10 10:11:03,929] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 165 13: [2023-05-10 10:11:03,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_105_mp_rank_00_optim_states.pt. 13: [2023-05-10 10:11:03,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_111_mp_rank_00_optim_states.pt. 13: [2023-05-10 10:11:03,930] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 111 13: [2023-05-10 10:11:03,930] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 105 12: [2023-05-10 10:11:03,930] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 101 25: [2023-05-10 10:11:03,930] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_207_mp_rank_00_optim_states.pt. 0: [2023-05-10 10:11:03,931] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 0 0: [2023-05-10 10:11:03,931] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 3 25: [2023-05-10 10:11:03,931] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 207 3: [2023-05-10 10:11:03,932] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 31 6: [2023-05-10 10:11:03,932] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_51_mp_rank_00_optim_states.pt. 6: [2023-05-10 10:11:03,932] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 51 22: [2023-05-10 10:11:03,933] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 182 0: could not find arguments in the checkpoint ... 0: checkpoint version 3.0 20: [2023-05-10 10:11:03,935] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 165 20: [2023-05-10 10:11:03,936] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_160_mp_rank_00_optim_states.pt. 13: [2023-05-10 10:11:03,936] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 111 13: [2023-05-10 10:11:03,936] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 105 20: [2023-05-10 10:11:03,936] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 160 25: [2023-05-10 10:11:03,936] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 207 6: [2023-05-10 10:11:03,938] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 51 21: [2023-05-10 10:11:03,938] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_175_mp_rank_00_optim_states.pt. 21: [2023-05-10 10:11:03,939] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 175 20: [2023-05-10 10:11:03,941] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 160 16: [2023-05-10 10:11:03,943] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_132_mp_rank_00_optim_states.pt. 16: [2023-05-10 10:11:03,944] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 132 22: [2023-05-10 10:11:03,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_181_mp_rank_00_optim_states.pt. 30: [2023-05-10 10:11:03,944] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_241_mp_rank_00_optim_states.pt. 21: [2023-05-10 10:11:03,944] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 175 30: [2023-05-10 10:11:03,944] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 241 22: [2023-05-10 10:11:03,944] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 181 16: [2023-05-10 10:11:03,948] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 132 25: [2023-05-10 10:11:03,948] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_206_mp_rank_00_optim_states.pt. 25: [2023-05-10 10:11:03,949] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 206 3: [2023-05-10 10:11:03,949] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_26_mp_rank_00_optim_states.pt. 3: [2023-05-10 10:11:03,950] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 26 22: [2023-05-10 10:11:03,950] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 181 30: [2023-05-10 10:11:03,950] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 241 5: [2023-05-10 10:11:03,951] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_46_mp_rank_00_optim_states.pt. 5: [2023-05-10 10:11:03,951] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 46 24: [2023-05-10 10:11:03,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_194_mp_rank_00_optim_states.pt. 21: [2023-05-10 10:11:03,954] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_168_mp_rank_00_optim_states.pt. 24: [2023-05-10 10:11:03,954] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 194 25: [2023-05-10 10:11:03,954] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 206 21: [2023-05-10 10:11:03,954] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 168 3: [2023-05-10 10:11:03,955] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 26 5: [2023-05-10 10:11:03,956] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 46 24: [2023-05-10 10:11:03,958] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_195_mp_rank_00_optim_states.pt. 24: [2023-05-10 10:11:03,958] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 195 24: [2023-05-10 10:11:03,959] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 194 4: [2023-05-10 10:11:03,959] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_36_mp_rank_00_optim_states.pt. 4: [2023-05-10 10:11:03,960] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 36 21: [2023-05-10 10:11:03,960] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 168 12: [2023-05-10 10:11:03,960] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_99_mp_rank_00_optim_states.pt. 12: [2023-05-10 10:11:03,961] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 99 24: [2023-05-10 10:11:03,963] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 195 30: [2023-05-10 10:11:03,964] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_245_mp_rank_00_optim_states.pt. 30: [2023-05-10 10:11:03,965] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 245 4: [2023-05-10 10:11:03,965] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 36 5: [2023-05-10 10:11:03,965] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_42_mp_rank_00_optim_states.pt. 5: [2023-05-10 10:11:03,966] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 42 28: [2023-05-10 10:11:03,966] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_225_mp_rank_00_optim_states.pt. 28: [2023-05-10 10:11:03,967] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 225 12: [2023-05-10 10:11:03,967] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 99 5: [2023-05-10 10:11:03,971] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 42 30: [2023-05-10 10:11:03,972] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 245 28: [2023-05-10 10:11:03,972] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 225 22: [2023-05-10 10:11:03,974] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_177_mp_rank_00_optim_states.pt. 22: [2023-05-10 10:11:03,975] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 177 22: [2023-05-10 10:11:03,976] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_179_mp_rank_00_optim_states.pt. 22: [2023-05-10 10:11:03,977] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 179 26: [2023-05-10 10:11:03,978] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_212_mp_rank_00_optim_states.pt. 26: [2023-05-10 10:11:03,979] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 212 28: [2023-05-10 10:11:03,981] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_224_mp_rank_00_optim_states.pt. 28: [2023-05-10 10:11:03,981] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 224 22: [2023-05-10 10:11:03,982] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 177 22: [2023-05-10 10:11:03,984] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 179 26: [2023-05-10 10:11:03,984] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 212 24: [2023-05-10 10:11:03,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_197_mp_rank_00_optim_states.pt. 24: [2023-05-10 10:11:03,986] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 197 25: [2023-05-10 10:11:03,986] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_204_mp_rank_00_optim_states.pt. 28: [2023-05-10 10:11:03,987] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 224 25: [2023-05-10 10:11:03,987] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 204 25: [2023-05-10 10:11:03,993] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 204 24: [2023-05-10 10:11:03,993] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 197 13: [2023-05-10 10:11:03,996] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_107_mp_rank_00_optim_states.pt. 13: [2023-05-10 10:11:03,996] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 107 20: [2023-05-10 10:11:03,997] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_167_mp_rank_00_optim_states.pt. 20: [2023-05-10 10:11:03,997] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 167 28: [2023-05-10 10:11:03,998] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_229_mp_rank_00_optim_states.pt. 28: [2023-05-10 10:11:03,999] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 229 20: [2023-05-10 10:11:04,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_164_mp_rank_00_optim_states.pt. 20: [2023-05-10 10:11:04,000] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 164 14: [2023-05-10 10:11:04,000] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_117_mp_rank_00_optim_states.pt. 14: [2023-05-10 10:11:04,001] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 117 13: [2023-05-10 10:11:04,002] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 107 5: [2023-05-10 10:11:04,002] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_40_mp_rank_00_optim_states.pt. 5: [2023-05-10 10:11:04,003] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 40 28: [2023-05-10 10:11:04,003] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 229 20: [2023-05-10 10:11:04,004] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 167 14: [2023-05-10 10:11:04,006] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 117 20: [2023-05-10 10:11:04,006] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 164 5: [2023-05-10 10:11:04,008] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 40 13: [2023-05-10 10:11:04,008] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_110_mp_rank_00_optim_states.pt. 13: [2023-05-10 10:11:04,008] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 110 30: [2023-05-10 10:11:04,011] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_242_mp_rank_00_optim_states.pt. 30: [2023-05-10 10:11:04,012] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 242 13: [2023-05-10 10:11:04,014] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 110 28: [2023-05-10 10:11:04,015] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_230_mp_rank_00_optim_states.pt. 28: [2023-05-10 10:11:04,015] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 230 26: [2023-05-10 10:11:04,016] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_214_mp_rank_00_optim_states.pt. 26: [2023-05-10 10:11:04,016] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 214 30: [2023-05-10 10:11:04,017] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 242 16: [2023-05-10 10:11:04,018] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_133_mp_rank_00_optim_states.pt. 16: [2023-05-10 10:11:04,018] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 133 20: [2023-05-10 10:11:04,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_162_mp_rank_00_optim_states.pt. 20: [2023-05-10 10:11:04,019] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 162 24: [2023-05-10 10:11:04,019] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_196_mp_rank_00_optim_states.pt. 24: [2023-05-10 10:11:04,020] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 196 28: [2023-05-10 10:11:04,021] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 230 26: [2023-05-10 10:11:04,021] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 214 3: [2023-05-10 10:11:04,022] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_28_mp_rank_00_optim_states.pt. 3: [2023-05-10 10:11:04,023] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 28 16: [2023-05-10 10:11:04,023] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 133 20: [2023-05-10 10:11:04,024] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 162 28: [2023-05-10 10:11:04,024] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_228_mp_rank_00_optim_states.pt. 28: [2023-05-10 10:11:04,024] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 228 24: [2023-05-10 10:11:04,024] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 196 26: [2023-05-10 10:11:04,025] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_209_mp_rank_00_optim_states.pt. 26: [2023-05-10 10:11:04,026] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 209 28: [2023-05-10 10:11:04,027] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_231_mp_rank_00_optim_states.pt. 28: [2023-05-10 10:11:04,027] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 231 3: [2023-05-10 10:11:04,028] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 28 28: [2023-05-10 10:11:04,029] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 228 11: [2023-05-10 10:11:04,030] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_92_mp_rank_00_optim_states.pt. 11: [2023-05-10 10:11:04,030] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 92 26: [2023-05-10 10:11:04,031] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 209 28: [2023-05-10 10:11:04,032] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 231 20: [2023-05-10 10:11:04,033] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_166_mp_rank_00_optim_states.pt. 20: [2023-05-10 10:11:04,033] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 166 5: [2023-05-10 10:11:04,034] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_44_mp_rank_00_optim_states.pt. 5: [2023-05-10 10:11:04,035] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 44 11: [2023-05-10 10:11:04,036] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 92 26: [2023-05-10 10:11:04,036] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_215_mp_rank_00_optim_states.pt. 26: [2023-05-10 10:11:04,037] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 215 14: [2023-05-10 10:11:04,039] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_112_mp_rank_00_optim_states.pt. 14: [2023-05-10 10:11:04,039] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 112 20: [2023-05-10 10:11:04,039] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 166 11: [2023-05-10 10:11:04,040] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_88_mp_rank_00_optim_states.pt. 5: [2023-05-10 10:11:04,040] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 44 11: [2023-05-10 10:11:04,040] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 88 11: [2023-05-10 10:11:04,041] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_90_mp_rank_00_optim_states.pt. 11: [2023-05-10 10:11:04,042] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 90 26: [2023-05-10 10:11:04,042] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 215 4: [2023-05-10 10:11:04,042] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_34_mp_rank_00_optim_states.pt. 4: [2023-05-10 10:11:04,043] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 34 11: [2023-05-10 10:11:04,043] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_95_mp_rank_00_optim_states.pt. 11: [2023-05-10 10:11:04,044] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 95 11: [2023-05-10 10:11:04,045] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 88 11: [2023-05-10 10:11:04,046] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_91_mp_rank_00_optim_states.pt. 14: [2023-05-10 10:11:04,046] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 112 11: [2023-05-10 10:11:04,047] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 91 11: [2023-05-10 10:11:04,047] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 90 11: [2023-05-10 10:11:04,048] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_89_mp_rank_00_optim_states.pt. 11: [2023-05-10 10:11:04,048] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 89 4: [2023-05-10 10:11:04,048] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 34 26: [2023-05-10 10:11:04,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_213_mp_rank_00_optim_states.pt. 11: [2023-05-10 10:11:04,050] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 95 26: [2023-05-10 10:11:04,050] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 213 20: [2023-05-10 10:11:04,050] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_163_mp_rank_00_optim_states.pt. 20: [2023-05-10 10:11:04,050] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 163 11: [2023-05-10 10:11:04,055] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 91 26: [2023-05-10 10:11:04,055] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 213 20: [2023-05-10 10:11:04,056] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 163 14: [2023-05-10 10:11:04,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_119_mp_rank_00_optim_states.pt. 11: [2023-05-10 10:11:04,057] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 89 4: [2023-05-10 10:11:04,057] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_38_mp_rank_00_optim_states.pt. 14: [2023-05-10 10:11:04,057] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 119 4: [2023-05-10 10:11:04,058] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 38 16: [2023-05-10 10:11:04,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_134_mp_rank_00_optim_states.pt. 3: [2023-05-10 10:11:04,058] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_25_mp_rank_00_optim_states.pt. 16: [2023-05-10 10:11:04,058] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 134 3: [2023-05-10 10:11:04,059] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 25 4: [2023-05-10 10:11:04,063] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 38 14: [2023-05-10 10:11:04,063] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 119 16: [2023-05-10 10:11:04,063] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 134 14: [2023-05-10 10:11:04,064] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_113_mp_rank_00_optim_states.pt. 3: [2023-05-10 10:11:04,064] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 25 5: [2023-05-10 10:11:04,065] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_45_mp_rank_00_optim_states.pt. 14: [2023-05-10 10:11:04,065] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 113 5: [2023-05-10 10:11:04,065] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 45 28: [2023-05-10 10:11:04,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_226_mp_rank_00_optim_states.pt. 28: [2023-05-10 10:11:04,066] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 226 4: [2023-05-10 10:11:04,066] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_35_mp_rank_00_optim_states.pt. 4: [2023-05-10 10:11:04,067] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 35 26: [2023-05-10 10:11:04,068] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_210_mp_rank_00_optim_states.pt. 26: [2023-05-10 10:11:04,068] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 210 14: [2023-05-10 10:11:04,070] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 113 5: [2023-05-10 10:11:04,070] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 45 4: [2023-05-10 10:11:04,072] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 35 4: [2023-05-10 10:11:04,072] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_37_mp_rank_00_optim_states.pt. 28: [2023-05-10 10:11:04,072] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 226 4: [2023-05-10 10:11:04,073] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 37 26: [2023-05-10 10:11:04,074] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 210 3: [2023-05-10 10:11:04,074] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_29_mp_rank_00_optim_states.pt. 3: [2023-05-10 10:11:04,075] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 29 4: [2023-05-10 10:11:04,078] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 37 3: [2023-05-10 10:11:04,080] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 29 26: [2023-05-10 10:11:04,082] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_208_mp_rank_00_optim_states.pt. 26: [2023-05-10 10:11:04,082] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 208 26: [2023-05-10 10:11:04,089] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 208 13: [2023-05-10 10:11:04,094] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_104_mp_rank_00_optim_states.pt. 13: [2023-05-10 10:11:04,095] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 104 3: [2023-05-10 10:11:04,095] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_27_mp_rank_00_optim_states.pt. 3: [2023-05-10 10:11:04,096] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 27 4: [2023-05-10 10:11:04,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_39_mp_rank_00_optim_states.pt. 14: [2023-05-10 10:11:04,096] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_118_mp_rank_00_optim_states.pt. 14: [2023-05-10 10:11:04,096] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 118 4: [2023-05-10 10:11:04,096] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 39 13: [2023-05-10 10:11:04,100] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 104 3: [2023-05-10 10:11:04,101] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 27 14: [2023-05-10 10:11:04,101] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 118 4: [2023-05-10 10:11:04,102] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 39 26: [2023-05-10 10:11:04,104] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_211_mp_rank_00_optim_states.pt. 26: [2023-05-10 10:11:04,104] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 211 11: [2023-05-10 10:11:04,109] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_93_mp_rank_00_optim_states.pt. 11: [2023-05-10 10:11:04,109] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 93 26: [2023-05-10 10:11:04,111] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 211 14: [2023-05-10 10:11:04,114] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_114_mp_rank_00_optim_states.pt. 14: [2023-05-10 10:11:04,114] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 114 11: [2023-05-10 10:11:04,115] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 93 4: [2023-05-10 10:11:04,118] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_33_mp_rank_00_optim_states.pt. 4: [2023-05-10 10:11:04,119] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 33 14: [2023-05-10 10:11:04,120] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 114 4: [2023-05-10 10:11:04,124] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 33 14: [2023-05-10 10:11:04,126] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_116_mp_rank_00_optim_states.pt. 14: [2023-05-10 10:11:04,126] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 116 3: [2023-05-10 10:11:04,126] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_30_mp_rank_00_optim_states.pt. 3: [2023-05-10 10:11:04,127] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 30 4: [2023-05-10 10:11:04,129] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_32_mp_rank_00_optim_states.pt. 4: [2023-05-10 10:11:04,130] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 32 14: [2023-05-10 10:11:04,132] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 116 3: [2023-05-10 10:11:04,133] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 30 4: [2023-05-10 10:11:04,135] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 32 14: [2023-05-10 10:11:04,137] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_115_mp_rank_00_optim_states.pt. 14: [2023-05-10 10:11:04,138] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 115 5: [2023-05-10 10:11:04,139] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_47_mp_rank_00_optim_states.pt. 5: [2023-05-10 10:11:04,139] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 47 11: [2023-05-10 10:11:04,142] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_94_mp_rank_00_optim_states.pt. 11: [2023-05-10 10:11:04,143] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 94 14: [2023-05-10 10:11:04,143] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 115 5: [2023-05-10 10:11:04,145] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 47 11: [2023-05-10 10:11:04,149] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 94 30: [2023-05-10 10:11:04,149] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_240_mp_rank_00_optim_states.pt. 30: [2023-05-10 10:11:04,150] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 240 30: [2023-05-10 10:11:04,156] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 240 16: [2023-05-10 10:11:04,172] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_129_mp_rank_00_optim_states.pt. 16: [2023-05-10 10:11:04,173] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 129 16: [2023-05-10 10:11:04,178] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 129 16: [2023-05-10 10:11:04,228] [INFO] [torch_checkpoint_engine.py:23:load] [Torch] Loaded checkpoint from lm1-2b8-55b-c4-perplexity/global_step52452/bf16_zero_pp_rank_128_mp_rank_00_optim_states.pt. 16: [2023-05-10 10:11:04,228] [INFO] [engine.py:2844:_get_all_zero_checkpoint_state_dicts] successfully read 256 ZeRO state_dicts for rank 128 16: [2023-05-10 10:11:04,234] [INFO] [engine.py:2784:_load_zero_checkpoint] loading 256 zero partition checkpoints for rank 128 0: successfully loaded checkpoint from lm1-2b8-55b-c4-perplexity at iteration 0 31: time (ms) | load-checkpoint: 18327.30 0: estimated model parameters: 2.80902656 0: estimated model parameters without embeddings: 2.67500544 0: [after model, optimizer, and learning rate scheduler are built] datetime: 2023-05-10 10:11:05 0: > building train, validation, and test datasets ... 0: > datasets target sizes (minimum size): 0: train: 1 0: validation: 51200 0: test: 51200 0: > building train, validation, and test datasets for GPT ... 0: > building dataset index ... 0: reading sizes... 0: reading pointers... 0: reading document index... 0: creating numpy buffer of mmap... 0: creating memory view of numpy buffer... 0: > finished creating indexed dataset in 0.038050 seconds 0: number of documents: 3133972 0: > dataset split: 0: train: 0: document indices in [0, 3133972) total of 3133972 documents 0: > loading doc-idx mapping from /scratch/project_462000119/data/c4_subsampled/gpt2tok_c4_en_1B5_text_document_train_indexmap_1ns_2048sl_1234s_doc_idx.npy 0: > loading sample-idx mapping from /scratch/project_462000119/data/c4_subsampled/gpt2tok_c4_en_1B5_text_document_train_indexmap_1ns_2048sl_1234s_sample_idx.npy 0: > loading shuffle-idx mapping from /scratch/project_462000119/data/c4_subsampled/gpt2tok_c4_en_1B5_text_document_train_indexmap_1ns_2048sl_1234s_shuffle_idx.npy 0: loaded indexed file in 0.100 seconds 0: total number of samples: 731002 0: total number of epochs: 1 0: > building dataset index ... 0: reading sizes... 0: reading pointers... 0: reading document index... 0: creating numpy buffer of mmap... 0: creating memory view of numpy buffer... 0: > finished creating indexed dataset in 0.071802 seconds 0: number of documents: 364608 0: > dataset split: 0: validation: 0: document indices in [0, 364608) total of 364608 documents 0: > loading doc-idx mapping from /scratch/project_462000119/data/c4_validation/gpt2tok_c4validation_rerun_text_document_validation_indexmap_51200ns_2048sl_1234s_doc_idx.npy 0: > loading sample-idx mapping from /scratch/project_462000119/data/c4_validation/gpt2tok_c4validation_rerun_text_document_validation_indexmap_51200ns_2048sl_1234s_sample_idx.npy 0: > loading shuffle-idx mapping from /scratch/project_462000119/data/c4_validation/gpt2tok_c4validation_rerun_text_document_validation_indexmap_51200ns_2048sl_1234s_shuffle_idx.npy 0: loaded indexed file in 0.099 seconds 0: total number of samples: 84978 0: total number of epochs: 1 0: > finished creating GPT datasets ... 0: [after dataloaders are built] datetime: 2023-05-10 10:11:27 0: done with setup ... 0: training ... 31: time (ms) | model-and-optimizer-setup: 51831.53 | train/valid/test-data-iterators-setup: 21703.75 0: [after training is done] datetime: 2023-05-10 10:11:27 31: ----------------------------------------------------------------------------------------------------------------- 31: validation loss at the end of training for val data | lm loss value: 2.604075E+00 | lm loss PPL: 1.351871E+01 | 31: ----------------------------------------------------------------------------------------------------------------- END 3489889: Wed 10 May 2023 10:12:20 AM EEST