Qwen1.5-32B?
#4
by
haili-tian
- opened
Qwen1.5-72B need more memory for kv cache, it's not a friendly-to-deploy model. May Qwen1.5-32B is suitable for most situation for deployment.
Qwen1.5-72B need more memory for kv cache, it's not a friendly-to-deploy model. May Qwen1.5-32B is suitable for most situation for deployment.