Spaces:

radames
/

Candle-phi1-phi2-wasm-demo

Running

A Python equiv

by Pythonic456 - opened Sep 30, 2023

Sep 30, 2023

How would I use and run the 4 bit quantized model on my local machine? Sorry, I am not very experienced with this side of Python/Torch etc. Any help is much appreciated!

bayang

Nov 18, 2023

You can make it with Candle to infer your quantized model.
So you gain in quick load model and inferencing time also. And from Python, you just call via the REST API.
@Pythonic456

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment