Learning domain taxonomies: the TaxoLine approach
PurposeThe aim of this paper is to present an online framework for building a domain taxonomy, called TaxoLine, from Web documents automatically.Design/methodology/approachTaxoLine proposes an innovative methodology that combines frequency and conditional mutual information to improve the quality of...
Gespeichert in:
Veröffentlicht in: | International journal of Web information systems 2017-08, Vol.13 (3), p.281-301 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | PurposeThe aim of this paper is to present an online framework for building a domain taxonomy, called TaxoLine, from Web documents automatically.Design/methodology/approachTaxoLine proposes an innovative methodology that combines frequency and conditional mutual information to improve the quality of the domain taxonomy. The system also includes a set of mechanisms that improve the execution time needed to build the ontology.FindingsThe performance of the TaxoLine framework was applied to nine different financial corpora. The generated taxonomies are evaluated against a gold-standard ontology and are compared to state-of-the-art ontology learning methods.Originality/valueThe experimental results show that TaxoLine produces high precision and recall for both concept and relation extraction than well-known ontology learning algorithms. Furthermore, it also shows promising results in terms of execution time needed to build the domain taxonomy. |
---|---|
ISSN: | 1744-0084 1744-0092 |
DOI: | 10.1108/IJWIS-04-2017-0024 |