intermediate_size which was incompatible with VLLM parallel inference

#1
by cduk - opened

Can someone please explain what the intermediate_size issue is?

Sign up or log in to comment