A 8bit version of Model
#12
by
varun500
- opened
No description provided.
A 8bit version of the model would be helpful which can be loaded in 16GB of GPU VRAM
TheBloke
changed pull request status to
closed
Please just use load_in_8bit=True
with an HF model like I've told you!
Sure will do that