Repairing inconsistent taxonomies using MAP inference and rules of thumb
Several authors have developed relation extraction methods for automatically learning or refining taxonomies from large text corpora such as the Web. However, without appropriate post-processing, such taxonomies are often inconsistent (e.g. they contain cycles). A standard approach to repairing such...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Several authors have developed relation extraction methods for automatically learning or refining taxonomies from large text corpora such as the Web. However, without appropriate post-processing,
such taxonomies are often inconsistent (e.g. they contain cycles). A standard approach to repairing such inconsistencies is to identify a minimally consistent subset of the extracted facts. For example, we could aim to minimize the sum of the confidence weights of the facts that are removed for restoring consistency. In this paper, we present MAP inference as a base method for this approach, and analyze how it can be improved by taking into account dependencies between the extracted facts. These dependencies correspond
to rules of thumb such as "if a given fact is wrong then all facts that have been extracted from the same sentence are also likely to be wrong", which we encode in Markov logic. We present experimental results to demonstrate the potential of this idea. |
---|