Learning domain taxonomies: the TaxoLine approach

PurposeThe aim of this paper is to present an online framework for building a domain taxonomy, called TaxoLine, from Web documents automatically.Design/methodology/approachTaxoLine proposes an innovative methodology that combines frequency and conditional mutual information to improve the quality of...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of Web information systems 2017-08, Vol.13 (3), p.281-301
Hauptverfasser: El Idrissi Esserhrouchni, Omar, Frikh, Bouchra, Ouhbi, Brahim, Ibrahim, Ismail Khalil
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:PurposeThe aim of this paper is to present an online framework for building a domain taxonomy, called TaxoLine, from Web documents automatically.Design/methodology/approachTaxoLine proposes an innovative methodology that combines frequency and conditional mutual information to improve the quality of the domain taxonomy. The system also includes a set of mechanisms that improve the execution time needed to build the ontology.FindingsThe performance of the TaxoLine framework was applied to nine different financial corpora. The generated taxonomies are evaluated against a gold-standard ontology and are compared to state-of-the-art ontology learning methods.Originality/valueThe experimental results show that TaxoLine produces high precision and recall for both concept and relation extraction than well-known ontology learning algorithms. Furthermore, it also shows promising results in terms of execution time needed to build the domain taxonomy.
ISSN:1744-0084
1744-0092
DOI:10.1108/IJWIS-04-2017-0024