Design and analysis of text document clustering using salp swarm algorithm

In the technological era, exponential increase of unorganized text documents offers increased difficulties retrieving the most relevant data. The document clustering is a most prominent technique that transforms unorganized contents into organized contents in the form of clusters. The recognition te...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of supercomputing 2022-09, Vol.78 (14), p.16197-16213
Hauptverfasser: Ponnusamy, Muruganantham, Bedi, Pradeep, Suresh, Tamilarasi, Alagarsamy, Aravindhan, Manikandan, R., Yuvaraj, N.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In the technological era, exponential increase of unorganized text documents offers increased difficulties retrieving the most relevant data. The document clustering is a most prominent technique that transforms unorganized contents into organized contents in the form of clusters. The recognition technique always undergoes clustering of text documents with misleading or redundant information that degrades document clustering quality. In this study, a salp swarm algorithm (SSA) is used for clustering the text documents. The study is improved with a similarity and a distance-based measurements as an objective function in the clustering domain. The experimental validation is conducted to show the efficacy of SSA-based similarity distance measurement that prominently improves the quality of clustering the text documents. The comparison with existing methods shows that the proposed SSA offers better clustering of text documents in accuracy, sensitivity, specificity, and f -measure.
ISSN:0920-8542
1573-0484
DOI:10.1007/s11227-022-04525-0