Please add the following model to the neuron cache
Llama 7b is already present in the cache: please go to the model card, select deploy and look at the Inferentia code snippet.
· Sign up or log in to comment