Phi-3-mini ran at 4k max length.
Phi-3-Mini on llama.cpp based server
Experiment with and compare different tokenizers