How much GPU memory is needed to finetune MPT-7B Instruct model?

#31

by skshreyas714 - opened May 23, 2023

May 23, 2023

I benchmarked this model for Sentiment Classification but the performance was very poor. So I want to finetune this model for a Multilingual Sentiment Classification dataset. Wanted to know the GPU requirements for finetuning it in FP16 mode.

datacow

May 26, 2023

At least 84 GB: (https://github.com/mosaicml/llm-foundry/tree/main/scripts/train#how-many-gpus-do-i-need-to-train-a-llm)

abhi-mosaic

Jun 3, 2023

Closing as stale. As noted above, to finetune with FP32 weights, and FP32 LionW optimizer state, and FP32 gradients, it would take about 7 * 4 * 3 = 84GB total memory.

abhi-mosaic changed discussion status to closed Jun 3, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment