Thanks!

by smcleod - opened Apr 10

Discussion

smcleod

Apr 10

Just wanted to say a quick thanks for getting the torrent and converting it, really appreciate it :)

smcleod changed discussion status to closed Apr 10

v2ray

Owner Apr 10

uwo

RageshAntony

Apr 10

@v2ray

How much VRAM does it require ? Any GPT-Q versions available?

v2ray

Owner Apr 10

@RageshAntony It's a 140B model, so it would require around 280GB VRAM to load, and a few more GBs for the context and KV cache.
And afaik there's currently no GPTQ version available, but someone would do it soon I think.

RageshAntony

Apr 10

@v2ray 280GB ??

Even H100 has 80GB only? How to load this then ?

v2ray

Owner Apr 10

@RageshAntony You use multiple GPUs, or use a quantized version(Which doesn't exist yet).

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment