Model Depot
Collection
Leading generative models packaged in OpenVino format optimized for use on AI PCs
•
50 items
•
Updated
•
2
llama-2-13b-chat-ov is an OpenVino int4 quantized version of Llama-2-13B-Chat, providing a fast inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
llama-2-13b-chat is the official 13b chat finetuned version of Llama2, and is one of the classic and best all-around chat models from 2023.
Base model
meta-llama/Llama-2-13b-chat-hf