How to run on Colab's CPU?

by deepakkaura26 - opened Jun 25, 2023

Jun 25, 2023

Can someone suggest or show me through piece of code that how to run this model (i.e MPT-30B-CHAT) on colab's CPU

Jun 27, 2023

Colab has only 12.7 GB of RAM and MPT-30B-CHAT files are almost 60 GB so it's not possible.

Jun 28, 2023

@beoswindvip Can you suggest me which other models I can use ?

Jun 28, 2023

@beoswindvip Can you suggest me which other models I can use ?

You can run 7B models(4bit or 8bit quantization) on the Colab Free Plan GPU,
Such as https://huggingface.co/TheBloke/vicuna-7B-v1.3-GPTQ .

Jun 28, 2023

@swulling Does this or these 7B models can run easily on CPU also ?

Jun 28, 2023

@swulling Does this or these 7B models can run easily on CPU also ?

You can use ggml version of the models to run on CPU.

try GPT4ALL or LLaMA.cpp

Jun 28, 2023

@swulling firstly thanku so much and one last question,

from text_generation import InferenceAPIClient

client = InferenceAPIClient("OpenAssistant/oasst-sft-4-pythia-12b-epoch-3.5")

print(complete_answer)

Apart from OpenAssistant/oasst-sft-4-pythia-12b-epoch-3.5 model as per above piece of code "which other models I can use"?

Jun 29, 2023

•

I suggest choosing a Chat model with a higher ranking to achieve better results.

Apart from OpenAssistant/oasst-sft-4-pythia-12b-epoch-3.5 model as per above piece of code "which other models I can use"?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment