arxiv:2310.03668

GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction

Published on Oct 5, 2023

Upvote

Authors:

Oscar Sainz ,

Iker García-Ferrero ,

Rodrigo Agerri ,

Oier Lopez de Lacalle ,

Eneko Agirre

Abstract

Large Language Models (LLMs) combined with instruction tuning have made significant progress when generalizing to unseen tasks. However, they have been less successful in Information Extraction (IE), lagging behind task-specific models. Typically, IE tasks are characterized by complex annotation guidelines which describe the task and give examples to humans. Previous attempts to leverage such information have failed, even with the largest models, as they are not able to follow the guidelines out-of-the-box. In this paper we propose GoLLIE (Guideline-following Large Language Model for IE), a model able to improve zero-shot results on unseen IE tasks by virtue of being fine-tuned to comply with annotation guidelines. Comprehensive evaluation empirically demonstrates that GoLLIE is able to generalize to and follow unseen guidelines, outperforming previous attempts at zero-shot information extraction. The ablation study shows that detailed guidelines is key for good results.

View arXiv page View PDF Add to collection

Community

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 3

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2310.03668 in a dataset README.md to link it from this page.

GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction

Abstract

Community

Models citing this paper 3

Datasets citing this paper 0

Spaces citing this paper 1

Collections including this paper 3