Network Optimization based on Genetic Algorithm for High-Level Data Classification

High-level data classification techniques are capable of considering not only physical aspects of the data, such as space, distance, proximity, distribution, but can also consider their functional, topological and structural aspects. High-level techniques are commonly defined in two major steps: the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Revista IEEE América Latina 2023-02, Vol.21 (2), p.295-301
Hauptverfasser: Moura Fernandes, Janayna, Barbosa de Oliveira, Gina Maira, Guimaraes Carneiro, Murillo
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:High-level data classification techniques are capable of considering not only physical aspects of the data, such as space, distance, proximity, distribution, but can also consider their functional, topological and structural aspects. High-level techniques are commonly defined in two major steps: the construction of a network from the feature vector data and the uncovering of its underlying patterns using complex networks properties. In the network construction step, heuristics based on k-nearest neighbors strategies have been widely adopted, while several complex network measures (e.g. PageRank) have been modeled to learn high-level patterns of the input data. As both steps are directly related, i.e., the network configuration impacts directly the results obtained by the classifier, in this paper we develop a genetic algorithm (GA) to optimize the network construction step. To be specific, we hypothesize that the salient features of GAs, such as their robust search mechanism and binary representation, may provide a more powerful network representation in the context of the high-level classification based on importance characterization. In summary, extensive experiments with real data sets demonstrate that the networks provided by our GA strategy achieved higher predictive accuracy than those of a widely adopted method based on the nearest neighbors heuristic and competitive results against state-of-the-art ones.
ISSN:1548-0992
1548-0992
DOI:10.1109/TLA.2023.10015222