A graph based keyword extraction model using collective node weight

•This paper proposes a keyword extraction method called KECNW.•KECNW is a novel unsupervised graph based keyword extraction method.•It determines node weight by collectively taking various influencing parameters.•This model is validated with five datasets.•Its performance is far better than the othe...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2018-05, Vol.97, p.51-59
Hauptverfasser: Biswas, Saroj Kr, Bordoloi, Monali, Shreya, Jacob
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•This paper proposes a keyword extraction method called KECNW.•KECNW is a novel unsupervised graph based keyword extraction method.•It determines node weight by collectively taking various influencing parameters.•This model is validated with five datasets.•Its performance is far better than the other existing methods. In the recent times, a huge amount of text is being generated for social purposes on twitter social networking site. Summarizing and analysing of twitter content is an important task as it benefits many applications such as information retrieval, automatic indexing, automatic classification, automatic clustering, automatic filtering etc. One of the most important tasks in analyzing tweets is automatic keyword extraction. There are some graph based approaches for keyword extraction which determine keywords only based on centrality measure. However, the importance of a keyword in twitter depends on various parameters such as frequency, centrality, position and strength of neighbors of the keyword. Therefore, this paper proposes a novel unsupervised graph based keyword extraction method called Keyword Extraction using Collective Node Weight (KECNW) which determines the importance of a keyword by collectively taking various influencing parameters. The KECNW is based on Node Edge rank centrality with node weight depending on various parameters. The model is validated with five datasets: Uri Attack, American Election, Harry Potter, IPL and Donald Trump. The result of KECMW is compared with three existing models. It is observed from the experimental results that the proposed method is far better than the others. The performances are shown in terms of precision, recall and F-measure.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2017.12.025