Explicit, Implicit, and Scattered: Revisiting Event Extraction to Capture Complex Arguments
Prior works formulate the extraction of event-specific arguments as a span extraction problem, where event arguments are explicit -- i.e. assumed to be contiguous spans of text in a document. In this study, we revisit this definition of Event Extraction (EE) by introducing two key argument types tha...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Prior works formulate the extraction of event-specific arguments as a span
extraction problem, where event arguments are explicit -- i.e. assumed to be
contiguous spans of text in a document. In this study, we revisit this
definition of Event Extraction (EE) by introducing two key argument types that
cannot be modeled by existing EE frameworks. First, implicit arguments are
event arguments which are not explicitly mentioned in the text, but can be
inferred through context. Second, scattered arguments are event arguments that
are composed of information scattered throughout the text. These two argument
types are crucial to elicit the full breadth of information required for proper
event modeling.
To support the extraction of explicit, implicit, and scattered arguments, we
develop a novel dataset, DiscourseEE, which includes 7,464 argument annotations
from online health discourse. Notably, 51.2% of the arguments are implicit, and
17.4% are scattered, making DiscourseEE a unique corpus for complex event
extraction. Additionally, we formulate argument extraction as a text generation
problem to facilitate the extraction of complex argument types. We provide a
comprehensive evaluation of state-of-the-art models and highlight critical open
challenges in generative event extraction. Our data and codebase are available
at https://omar-sharif03.github.io/DiscourseEE. |
---|---|
DOI: | 10.48550/arxiv.2410.03594 |