Automatic extraction and incorporation of purpose data into PurposeNet
PurposeNet is a knowledge base of objects and actions in which the knowledge is organized around purpose. Such knowledge also connects with language - namely, verbs for related actions. It can be used with an embedded reasoner, resulting in an effective system for QA, topic-listing, summarization an...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | PurposeNet is a knowledge base of objects and actions in which the knowledge is organized around purpose. Such knowledge also connects with language - namely, verbs for related actions. It can be used with an embedded reasoner, resulting in an effective system for QA, topic-listing, summarization and other tasks. However, extracting PurposeNet related data manually is time-consuming, labor-intensive, and expensive. This paper describes a framework for automatic purpose data extraction, given a corpus. It identifies a set of lexico-syntactic patterns that are easily recognizable, that occur frequently and across text genre boundaries, and that indisputably indicate the lexical relation of purpose data. It also deals with the subsequent automatic incorporation of this data into the PurposeNet resource. The results are used to augment and critique the structure of a large hand-built resource. The cases where purpose data is incomplete has also been analyzed. The extent of success, in terms of richness of the resource, achieved in the process is also discussed. |
---|---|
DOI: | 10.1109/ICCET.2010.5486346 |