amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-fp32-onnx-ryzen-strix
Text Generation
•
Updated
•
10
ONNX Runtime generate() API based models quantized by Quark and optimized for Ryzen AI Strix Point NPU