Mismatched `inputs_embeds` and `logits` shape for decoder_with_past and other decoders?

#9
by carlesonielfa - opened

Hi! Thanks for uploading the conversion to ONNX. I'm trying to use this on a Triton server where the merged graph does not seem to be supported. However, I can see that for the decoder_with_past the shapes are

  • inputs_embeds [batch_size,16,768]
  • logits [batch_size, 16, 51289]

And for the decoder and decoder_merged exports the shapes are

  • inputs_embeds [batch_size, decoder_sequence_length, 768]
  • logits [batch_size,decoder_sequence_length,51289]

Is this right? Thank you in advance!

Same error

Sign up or log in to comment