Qwen1.5-32B?

#4
by haili-tian - opened

Qwen1.5-72B need more memory for kv cache, it's not a friendly-to-deploy model. May Qwen1.5-32B is suitable for most situation for deployment.

Sign up or log in to comment