Contrastive Graph Convolutional Networks with adaptive augmentation for text classification

Text classification is an important research topic in natural language processing (NLP), and Graph Neural Networks (GNNs) have recently been applied in this task. However, in existing graph-based models, text graphs constructed by rules are not real graph data and introduce massive noise. More impor...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Information processing & management 2022-07, Vol.59 (4), p.102946, Article 102946
Hauptverfasser: Yang, Yintao, Miao, Rui, Wang, Yili, Wang, Xin
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Text classification is an important research topic in natural language processing (NLP), and Graph Neural Networks (GNNs) have recently been applied in this task. However, in existing graph-based models, text graphs constructed by rules are not real graph data and introduce massive noise. More importantly, for fixed corpus-level graph structure, these models cannot sufficiently exploit the labeled and unlabeled information of nodes. Meanwhile, contrastive learning has been developed as an effective method in graph domain to fully utilize the information of nodes. Therefore, we propose a new graph-based model for text classification named CGA2TC, which introduces contrastive learning with an adaptive augmentation strategy into obtaining more robust node representation. First, we explore word co-occurrence and document word relationships to construct a text graph. Then, we design an adaptive augmentation strategy for the text graph with noise to generate two contrastive views that effectively solve the noise problem and preserve essential structure. Specifically, we design noise-based and centrality-based augmentation strategies on the topological structure of text graph to disturb the unimportant connections and thus highlight the relatively important edges. As for the labeled nodes, we take the nodes with same label as multiple positive samples and assign them to anchor node, while we employ consistency training on unlabeled nodes to constrain model predictions. Finally, to reduce the resource consumption of contrastive learning, we adopt a random sample method to select some nodes to calculate contrastive loss. The experimental results on several benchmark datasets can demonstrate the effectiveness of CGA2TC on the text classification task. •We study a novel problem of applying supervised graph contrastive learning to text classification, and propose a contrastive graph representation learning framework called CGA2TC.•The CGA2TC not only makes full use of labeled and unlabeled data but also randomly utilizes some nodes in the contrastive training process instead of all nodes to reduce resource consumption.•The CGA2TC with adaptive augmentation enables more effective preservation of the graph’s structure and obtains robust text representations for the text classification task.•We select two centrality measures for nodes in the adaptive augmentation section to measure the importance of the nodes, and then remove the edges of the nodes based on their importa
ISSN:0306-4573
1873-5371
DOI:10.1016/j.ipm.2022.102946