Complex Question Enhanced Transfer Learning for Zero-shot Joint Information Extraction

Zero-shot information extraction (IE) tasks have attracted great attention recently. However, how to jointly model multiple IE tasks in the zero-shot scenario is still an open question. In this paper, we focus on zero-shot joint IE tasks and highlight how to transfer the knowledge of cross-task rela...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE/ACM transactions on audio, speech, and language processing speech, and language processing, 2024-01, Vol.32, p.1-15
Hauptverfasser:	Zhang, Ying, Meng, Fandong, Chen, Yufeng, Xu, Jinan, Zhou, Jie
Format:	Artikel
Sprache:	eng
Schlagworte:	Complex Question Data mining Data models Event Detection Information Extraction Information retrieval Joining processes Knowledge management Machine Reading Comprehension Named Entity Recognition Questions Relation Extraction Speech processing Synthetic data Task analysis Training Training data
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Zero-shot information extraction (IE) tasks have attracted great attention recently. However, how to jointly model multiple IE tasks in the zero-shot scenario is still an open question. In this paper, we focus on zero-shot joint IE tasks and highlight how to transfer the knowledge of cross-task relations from the source domain to the target domain. To solve this problem, we first unify all IE tasks with a machine reading comprehension (MRC) framework, which can make the most of training data and enhance its ability on span extraction. Then, we generate complex questions to explicitly model cross-task relations with natural language descriptions, thereby providing prior knowledge for pre-defined types and building more general linkages among different entities and triggers as well. Specifically, we define three operations for generating templates for complex questions, i.e., intersecting , connecting , and composing . Besides, we design an efficient training strategy to exploit the synthetic data with complex questions. We evaluate our approach on four datasets from different domains for various IE tasks. Experimental results show the effectiveness of our approach in improving the performance of zero-shot joint IE tasks in multiple domains.
ISSN:	2329-9290 2329-9304
DOI:	10.1109/TASLP.2023.3304481