Xenova
/

Phi-3-mini-4k-instruct

Text Generation

Transformers.js

Model card Files Files and versions Community

Phi-3-mini-4k-instruct / README.md

Xenova's picture

Xenova HF staff

Update README.md

b294d83 verified 7 months ago

|

583 Bytes

metadata

license: mit
pipeline_tag: text-generation
library_name: transformers.js
tags:
  - ONNX
  - DML
  - ONNXRuntime
  - nlp
  - conversational

Phi-3 Mini-4K-Instruct ONNX model for onnxruntime-web

This is the same models as the official phi3 onnx model with a few changes to make it work for onnxruntime-web:

the model is fp16 with int4 block quantization for weights
the 'logits' output is fp32
the model uses MHA instead of GQA
onnx and external data file need to stay below 2GB to be cacheable in chromium