Post
923
neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8
Requant of the big llama, using 20% less memory
neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8
Requant of the big llama, using 20% less memory
neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8