Repairing inconsistent taxonomies using MAP inference and rules of thumb

Several authors have developed relation extraction methods for automatically learning or refining taxonomies from large text corpora such as the Web. However, without appropriate post-processing, such taxonomies are often inconsistent (e.g. they contain cycles). A standard approach to repairing such...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Merhej, Elie, Schockaert, Steven, De Cock, Martine, Blondeel, Marjon, Alfarone, Daniele, Davis, Jesse
Format: Tagungsbericht
Sprache:eng
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Several authors have developed relation extraction methods for automatically learning or refining taxonomies from large text corpora such as the Web. However, without appropriate post-processing, such taxonomies are often inconsistent (e.g. they contain cycles). A standard approach to repairing such inconsistencies is to identify a minimally consistent subset of the extracted facts. For example, we could aim to minimize the sum of the confidence weights of the facts that are removed for restoring consistency. In this paper, we present MAP inference as a base method for this approach, and analyze how it can be improved by taking into account dependencies between the extracted facts. These dependencies correspond to rules of thumb such as "if a given fact is wrong then all facts that have been extracted from the same sentence are also likely to be wrong", which we encode in Markov logic. We present experimental results to demonstrate the potential of this idea.