optimum-neuron-cache / inference-cache-config

Commit History

Remove variants from main mistral config
ef07aca
verified

dacorvo HF staff commited on

Add mistral most popular variants
d3983e8
verified

dacorvo HF staff commited on

Add most popular llama variants
594abb2
verified

dacorvo HF staff commited on

Added teknium/OpenHermes-2.5-Mistral-7B
1518247
verified

dacorvo HF staff commited on

Added Llama-70b batch_size 4 to inference cache
593822e
verified

dacorvo HF staff commited on

Create mistral.json
b5d0afd
verified

philschmid HF staff commited on

Create gpt2.json
3bdb891
verified

philschmid HF staff commited on

Create inference-cache-config/llama.json
1960ccb
verified

philschmid HF staff commited on