Graph based semi-supervised classification with probabilistic nearest neighbors
•PNN jointly learns graph structure and probability transition matrix for inference.•PNN explore complex data structure preferably, with low computation complexity.•PNN enhances relevance between graph construction and inference.•PNN is optimized according to min-max normalization for discriminate d...
Gespeichert in:
Veröffentlicht in: | Pattern recognition letters 2020-05, Vol.133, p.94-101 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | •PNN jointly learns graph structure and probability transition matrix for inference.•PNN explore complex data structure preferably, with low computation complexity.•PNN enhances relevance between graph construction and inference.•PNN is optimized according to min-max normalization for discriminate data.•PNN is more conducive to classification accuracy and efficiency.
Label propagation (LP) is one of the state-of-the-art graph based semi-supervised learning (GSSL) algorithm. Probability transition matrix (PTM) is the key for LP to propagate label information among samples. Conventionally, PTM is calculated based on the graph constructed in advance, and graph construction independent of PTM calculation. It leads to complex steps for acquiring PTM, and more importantly, brings about the lack of correlation between graph construction and inference. Based on adaptive neighbors-based method, probabilistic nearest neighbors (PNN) based graph construction algorithm is proposed for effective ℓ2 norm optimization, and the solving process of the objective function is optimized by incorporating min-max normalization. The derived PNN matrix is more discriminative and directly serve as PTM for LP. It makes PTM computation more conveniently and more applicable for classification task. In addition, number of neighbors is adaptively determined on the premise of its preset value. Experimental results show that the proposed PNN algorithm specializes in reflecting probability differences of neighboring nodes in a graph, and positive results are achieved in semi-supervised classification. The average classification accuracy on synthetic data sets is 84.24%, and that on image data sets achieves 89.08%. |
---|---|
ISSN: | 0167-8655 1872-7344 |
DOI: | 10.1016/j.patrec.2020.01.021 |