Anyone succeeded in running this model?

#4
by code-runner - opened

I get the following error running AutoTokenizer.from_pretrained():
ValueError: Unrecognized configuration class <class 'transformers_modules.iGeniusAI.Italia-9B-Instruct-v0.1.e821f1462547cca2a6ff4f7af102d37d9a79fafd.configuration_italia.ItaliaConfig'> to build an AutoTokenizer.

The model has been correctly downloaded.

This morning I cleared the cache (~/.cache/huggingface/hub/models--iGeniusAI--Italia-9B-Instruct-v0.1) and repeated the same operations.
The error disappeared, so the first download was probably corrupted somehow.
However, on my Macbook Pro M1 32GB RAM the sample test application has been running for 30 minutes, taking 35GB or RAM. I'm using device='mps'.
My guess is that it is swapping a lot and there is not enough RAM.

So my new question is: was anybody able to to run this model locally on Apple Silicon?

Hello code-runner,
Unfortunately, the model cannot be executed on the MPS backend because the aten::isin.Tensor_Tensor_out operator is not yet supported on MPS. However, you can still run the model on your CPU by either setting device = "cpu" or exporting the environment variable PYTORCH_ENABLE_MPS_FALLBACK=1.

We tested the model on a MacBook Pro M3 Max with 36GB of memory, although the model itself requires significantly less. When loaded with torch_dtype = torch.float16, the process utilized approximately 16GB of RAM.

Sign up or log in to comment