AlessandroW's picture
Update README.md
27e96b2 verified
|
raw
history blame
3.01 kB
metadata
license: mit
license_link: https://huggingface.co/microsoft/Phi-3-mini-128k-instruct/resolve/main/LICENSE
language:
  - en
pipeline_tag: text-generation
tags:
  - nlp
  - code

Model Summary

This repo provides the GGUF format for the Phi-3-Mini-128K-Instruct.

For more details check out the original model at microsoft/Phi-3-mini-128k-instruct.

The Phi-3-Mini-4K-Instruct is a 3.8B parameters, lightweight, state-of-the-art open model trained with the Phi-3 datasets that includes both synthetic data and the filtered publicly available websites data with a focus on high-quality and reasoning dense properties. The model belongs to the Phi-3 family with the Mini version in two variants 4K and 128K which is the context length (in tokens) it can support. The model has underwent a post-training process that incorporates both supervised fine-tuning and direct preference optimization to ensure precise instruction adherence and robust safety measures. When assessed against benchmarks testing common sense, language understanding, math, code, long context and logical reasoning, Phi-3 Mini-4K-Instruct showcased a robust and state-of-the-art performance among models with less than 13 billion parameters.

Resources and Technical Documentation:

This repo provides GGUF files for the Phi-3 Mini-128K-Instruct model.

Name Quant method Bits Size Use case
Phi-3-mini-128k-instruct-Q4_K_M.gguf Q4_K_M 4 2.39 GB medium, balanced quality - recommended
Phi-3-mini-128k-instruct-f16.gguf None 16 7.2 GB minimal quality loss

License

The model is licensed under the MIT license.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft’s Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party’s policies.