模型加载问题

by murphy - opened Mar 27

Mar 27

每次compute_score都会进行模型加载，耗时严重

  0%|          | 0/8 [00:00<?, ?it/s]You're using a LlamaTokenizerFast tokenizer. Please note that with a fast tokenizer, using the `__call__` method is faster than using a method to encode the text followed by a call to the `pad` method to get a padded encoding.
You're using a LlamaTokenizerFast tokenizer. Please note that with a fast tokenizer, using the `__call__` method is faster than using a method to encode the text followed by a call to the `pad` method to get a padded encoding.
You're using a LlamaTokenizerFast tokenizer. Please note that with a fast tokenizer, using the `__call__` method is faster than using a method to encode the text followed by a call to the `pad` method to get a padded encoding.
You're using a LlamaTokenizerFast tokenizer. Please note that with a fast tokenizer, using the `__call__` method is faster than using a method to encode the text followed by a call to the `pad` method to get a padded encoding.
You're using a LlamaTokenizerFast tokenizer. Please note that with a fast tokenizer, using the `__call__` method is faster than using a method to encode the text followed by a call to the `pad` method to get a padded encoding.
You're using a LlamaTokenizerFast tokenizer. Please note that with a fast tokenizer, using the `__call__` method is faster than using a method to encode the text followed by a call to the `pad` method to get a padded encoding.
You're using a LlamaTokenizerFast tokenizer. Please note that with a fast tokenizer, using the `__call__` method is faster than using a method to encode the text followed by a call to the `pad` method to get a padded encoding.
You're using a LlamaTokenizerFast tokenizer. Please note that with a fast tokenizer, using the `__call__` method is faster than using a method to encode the text followed by a call to the `pad` method to get a padded encoding.
100%|██████████| 8/8 [00:12<00:00,  1.53s/it]

Shitao

Beijing Academy of Artificial Intelligence org Mar 29

收到反馈。这个是会加载dataloader，而不是加载模型。
后续我们会尝试优化。

cfli

Apr 2

现在代码已经更新，在compute_score的时候可以传参use_dataloader=False来解决这个问题

Lzh2015

Apr 16

现在代码已经更新，在compute_score的时候可以传参use_dataloader=False来解决这个问题

想问下这个代码更新到哪里了，我填了后报错了。

cfli

Apr 16

可以按这样的方式进行更新，git clone https://github.com/FlagOpen/FlagEmbedding.git+pip install .

CosWind

Apr 25

可以按这样的方式进行更新，git clone https://github.com/FlagOpen/FlagEmbedding.git+pip install .

我是用 xinference 部署也是遇到这个问题，耗时特别严重，也能用这个办法解决吗？

cfli

Apr 26

可以按这样的方式进行更新，git clone https://github.com/FlagOpen/FlagEmbedding.git+pip install .

我是用 xinference 部署也是遇到这个问题，耗时特别严重，也能用这个办法解决吗？

可以使用git clone https://github.com/FlagOpen/FlagEmbedding.git+pip install .更新代码，这样就可以在compute_score的时候设置参数use_dataloader=False，从而避免使用dataloader导致的额外耗时问题

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment