beautifulsoup4 pandas pyarrow safetensors