MinerU / docs /how_to_download_models_en.md
derful's picture
Upload folder using huggingface_hub
240e0a0 verified

A newer version of the Gradio SDK is available: 5.7.1

Upgrade

Install Git LFS

Before you begin, make sure Git Large File Storage (Git LFS) is installed on your system. Install it using the following command:

git lfs install

Download the Model from Hugging Face

To download the PDF-Extract-Kit model from Hugging Face, use the following command:

git lfs clone https://huggingface.co/wanderkid/PDF-Extract-Kit

Ensure that Git LFS is enabled during the clone to properly download all large files.

Download the Model from ModelScope

SDK Download

# First, install the ModelScope library using pip:
pip install modelscope
# Use the following Python code to download the model using the ModelScope SDK:
from modelscope import snapshot_download
model_dir = snapshot_download('wanderkid/PDF-Extract-Kit')

Git Download

Alternatively, you can use Git to clone the model repository from ModelScope:

git clone https://www.modelscope.cn/wanderkid/PDF-Extract-Kit.git

Put model files here:

./
β”œβ”€β”€ Layout
β”‚   β”œβ”€β”€ config.json
β”‚   └── weights.pth
β”œβ”€β”€ MFD
β”‚   └── weights.pt
β”œβ”€β”€ MFR
β”‚   └── UniMERNet
β”‚       β”œβ”€β”€ config.json
β”‚       β”œβ”€β”€ preprocessor_config.json
β”‚       β”œβ”€β”€ pytorch_model.bin
β”‚       β”œβ”€β”€ README.md
β”‚       β”œβ”€β”€ tokenizer_config.json
β”‚       └── tokenizer.json
└── README.md