OutOfMemoryError when build the Space

#20
by spark12x - opened

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 108.00 MiB. GPU 0 has a total capacty of 44.32 GiB of which 43.25 MiB is free. Including non-PyTorch memory, this process has 0 bytes memory in use. Of the allocated memory 43.77 GiB is allocated by PyTorch, and 19.15 MiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF
/home/user/.pyenv/versions/3.10.15/lib/python3.10/site-packages/torchvision/transforms/functional_tensor.py:5: UserWarning: The torchvision.transforms.functional_tensor module is deprecated in 0.15 and will be removed in 0.17. Please don't rely on it. You probably just need to use APIs in torchvision.transforms.functional or in torchvision.transforms.v2.functional.
warnings.warn(
Please 'pip install apex'

A100 Need to apply

I use (4x Nvidia A10G - large 48 vCPU 184 GB 96 GB ) which is much more than A100, still CUDA out of memory

Sign up or log in to comment