SeaLLM-7B-Chat quantized to run locally with modest GPU
Model Description
- This is a quantized model of SeaLLM-7B-Chat.
- SeaLLMs is a family of LLMs pre-trained from Meta's LLaMA 2 and optimized for numerous Southeast Asian languages, including Vietnamese ๐ป๐ณ, Indonesian ๐ฎ๐ฉ, Thai ๐น๐ญ, Malay ๐ฒ๐พ, Khmer ๐ฐ๐ญ, Lao ๐ฑ๐ฆ, Tagalog ๐ต๐ญ and Burmese ๐ฒ๐ฒ.
- The quantization has been done with ExLlamaV2, a fast LLM inference library.
Citation
- SeaLLMs: https://huggingface.co/SeaLLMs
- ExLlamaV2: https://github.com/turboderp/exllamav2