Error in loading falcon-7b-instruct

#48

by gtmanuj - opened Jun 26, 2023

Jun 26, 2023

Hi, I am getting the following ValueError when loading Falcon-7b instruct model, can someone please help me solve this issue.

vipulad

Jun 28, 2023

Can you share the line of code where you load the model?

gtmanuj

Jun 28, 2023

from transformers import AutoTokenizer
import transformers
import torch
model = "tiiuae/falcon-7b-instruct"
tokenizer = AutoTokenizer.from_pretrained(model)

pipeline = transformers.pipeline(
"text-generation",
model=model,
tokenizer=tokenizer,
torch_dtype=torch.bfloat16,
trust_remote_code=True,
device_map="auto",
)

sequences = pipeline(
"Write a poem about Valencia",
max_length=200,
do_sample=True,
top_k=10,
num_return_sequences=1,
eos_token_id=tokenizer.eos_token_id,
)

for seq in sequences:
print(f"Result: {seq['generated_text']}")

vipulad

Jun 28, 2023

Strange! it works in my environment, try loading the model as

model = AutoModelForCasualLM("tiiuae/falcon-7b-instruct", torch_dtype=torch.bfloat16, trust_remote_code=True, device_map="auto", )

gtmanuj

Jun 28, 2023

What is the size of RAM your device has?

gtmanuj

Jun 28, 2023

And can you please share the full code which worked.

vipulad

Jun 28, 2023

Only 16 GB, if you feel the issue is with RAM, try increasing the size of swap file (page file in windows).

However if RAM is the case, usually python process get terminated. Error you posted is related to identify the model type while loading it. just a wild guess, try removing the downloaded model from the cache and run the script again

vipulad

Jun 28, 2023

And can you please share the full code which worked.

Sorry, I may not able to share the code. :(
However I could able to run the code snippet you've shared

gtmanuj

Jun 28, 2023

I also have 16 gb ram, the above code which you told, should have AutoModelForCausalLM.from_pretrained right?
And I have already written the below in pipeline:
torch_dtype=torch.bfloat16, trust_remote_code=True, device_map="auto"

gtmanuj

Jun 28, 2023

Could you run the code snippet shared or you had to make some modifications

gtmanuj

Jun 28, 2023

I made the modification you suggested and I got the following error.

ValueError: The current device_map had weights offloaded to the disk. Please provide an offload_folder for them. Alternatively, make sure you have safetensors installed if the model you are using offers the weights in this format.

vipulad

Jun 28, 2023

Could you run the code snippet shared or you had to make some modifications

I ran the one you have shared. Well I had to fix indentations (in the last line), other than that I did not do any change.

gtmanuj

Jun 28, 2023

Ok, did it give the expected output of a poem?

vipulad

Jun 28, 2023

Yep. It gave a nice poem

gtmanuj

Jun 28, 2023

Which environment you used, I am using vs code

vipulad

Jun 28, 2023

Well mine was on a cloud instance, access though the terminal. I do not think issue is related to the environment.

let's see if some one else who has faced the same issue replies to this

gtmanuj

Jun 28, 2023

Okay Thanks

suryakumaran

Jul 6, 2023

I made the modification you suggested and I got the following error.

ValueError: The current device_map had weights offloaded to the disk. Please provide an offload_folder for them. Alternatively, make sure you have safetensors installed if the model you are using offers the weights in this format.

I am getting the same error

PenskeRichland

Jul 10, 2023

•

edited Jul 10, 2023

The model is over 16GB so you will need to figure out where the remainder can be stored. If you dont have GPU, then it must be stored on hard drive, so the error you are getting is saying that you need to identify some folder for it store the spillover from the model.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment