Extending the Growing Hierarchal SOM for clustering documents in graphs domain

The growing hierarchal self-organizing map (GHSOM) is the most efficient model among the variants of SOM. It is used successfully in document clustering and in various pattern recognition applications effectively. The main constraint that limits the implementation of this model and all the other var...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Hussin, M.F., Farra, M.R., El-Sonbaty, Y.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The growing hierarchal self-organizing map (GHSOM) is the most efficient model among the variants of SOM. It is used successfully in document clustering and in various pattern recognition applications effectively. The main constraint that limits the implementation of this model and all the other variants of SOM models is that they work only with vector space model (VSM). In this paper, we extend the GHSOM to work in the graph domain to enhance the quality of clusters. Specifically, we represent the documents by graphs and then cluster those documents by using a new algorithm G-GHSOM: graph-based growing merarchal SOM after modifying its operations to work with the graph instead of vector space. We have tested the G-GHSOM on two different document collections using three different measures for evaluating clustering quality. The experimental results of the proposed G-GHSOM show an improvement in terms of clustering quality compared to classical GHSOM.
ISSN:2161-4393
2161-4407
DOI:10.1109/IJCNN.2008.4634377