vllm multi-lora deployment

by zhongwei - opened May 4

May 4

It is great software and model, I would deploy the Llama-3-8B-Instruct-80K-QLoRA-Merged with vllm.
Is there any solution to deploy this QLora model with base model meta-llama/Meta-Llama-3-8B-Instruct at vllm with multi-lora deployment?

namespace-Pt

Owner May 4

Hi, I'm not familiar with vllm. I think this LoRA model is no different against any other LoRA models. Maybe you can just use the default loading method for it. But remember to set the rope_theta to 200M for the base model (i.e. meta-llama/Meta-Llama-3-8B-Instruct) when using this LoRA.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment