Clustering of text documents by projective dimension of subspaces using part neural network

The paper deals with clustering of text documents by neural networks. For representation of text documents is used the Vector Space (VS) model, which describes the text documents by VS matrix X. Multidimensional space of matrix X for text documents clustering requires the high computational complexi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Krakovsky, R., Mokris, I.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The paper deals with clustering of text documents by neural networks. For representation of text documents is used the Vector Space (VS) model, which describes the text documents by VS matrix X. Multidimensional space of matrix X for text documents clustering requires the high computational complexity therefore it is needed of its reduction. In our approach for reduction of the text document space we used decomposition of multidimensional space of matrix X by projection into subspaces. The presented approach for creation of subspaces of multidimensional spaces uses the Projective Adaptive Resonance Theory (PART) neural network which enables this way of reduction of multidimensional text document space and also the text document clustering. Efficiency of clustering the text documents by subspaces of multidimensional space it is influenced by properties of PART and because of the optimal parameters of PART have to be set. Thanks to exact settings of distance and vigilance parameter of PART it is possible to find the clusters, their centers in the projective dimensions of subspaces and create outlier cluster for noisy data sets. The utilization of PART neural network to the text document clustering can easy discover the intrinsic clusters in used sets of documents.
DOI:10.1109/SACI.2012.6250002