How to use this onnx model

by mkj69 - opened Mar 8

Discussion

mkj69

Mar 8

Ask how to use this onnx model, and especially how to configure this splitter based on these json files?

fxmarty

Owner Mar 22

Hi @mkj69 this model is not meant to be used. Please use the last up to date optimum version to export decoder models in a single ONNX file:

optimum-cli export onnx --model gpt2 gpt2_onnx

The prefill step needs to pass 0-length past key values (KV cache). You can inspect the exported model with Netron to understand what are the inputs/outputs.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment