Credit to original: https://huggingface.co/deepghs/ml-danbooru-onnx
Modified to add an intermediate vector (B, 384) from decoder output (sum reduced) for your random use, like similiar image search.
Also added sigmoid to graph end so you don't do it manually.