Model Depot - ONNX
Collection
Leading Models packaged in ONNX format optimized for use with AI PCs
•
20 items
•
Updated
llama-3.1-instruct-ov is an ONNX int4 quantized version of Llama 3.1 Instruct, providing a fast inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
llama-3.1-instruct is a leading open source general foundation model from Meta.
Base model
meta-llama/Llama-3.1-8B