A 8bit version of Model

#12
by varun500 - opened
No description provided.

A 8bit version of the model would be helpful which can be loaded in 16GB of GPU VRAM

TheBloke changed pull request status to closed

Please just use load_in_8bit=Truewith an HF model like I've told you!

Sure will do that

Sign up or log in to comment