FP32 or FP16

#20

by timbmg - opened 29 days ago

Discussion

timbmg

29 days ago

Hi,

I am wondering whether to use fp32 or fp16. In the config.json, fp32 is set.

However, in the paper (page 5), it says:

After training, we directly take the last checkpoint for evaluation. We run model training on up to 8 NVIDIA A100 GPUs with 80GB memory and model evaluation on up to 8 NVIDIA Tesla V100 GPUs with 32GB memory. Models are trained with mixed precision using fp16 and evaluated with half precision fp16 as well.

This is also the configuration for MTEB in the Readme.

thenlper

Alibaba-NLP org 21 days ago

We recommend using fp16 for model inference.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment