--- license: cc-by-4.0 tags: - yahoo-open-source-software-incubator - image-to-text - image-captioning inference: false --- # Object Relation Transformer The Object Relation Transformer is a Transformer-based image captioning model. You can find more details about the model in our [NeurIPS 2019 paper](https://papers.nips.cc/paper/9293-image-captioning-transforming-objects-into-words.pdf). This model repository contains two variants of the Object Relation Transformer, as well as a couple of baseline models. Please find more details about all these models within the [README of our Github repository](https://github.com/yahoo/object_relation_transformer?tab=readme-ov-file#model-zoo-and-results). ## Citation If you find these models useful, please consider citing (no obligation at all): ``` @article{herdade2019image, title={Image Captioning: Transforming Objects into Words}, author={Herdade, Simao and Kappeler, Armin and Boakye, Kofi and Soares, Joao}, journal={arXiv preprint arXiv:1906.05963}, year={2019} } ``` ## Maintainers - Joao Soares: jvbsoares@yahooinc.com ## License The contents of this repository are (c) by Verizon Media. The contents of this repository are licensed under a Creative Commons Attribution 4.0 International License. You should have received a copy of the license along with this work. If not, see .