Heterogeneous hypergraph learning for literature retrieval based on citation intents
Literature retrieval helps scientists find previous work that is relative to their own research or even get new research ideas. However, the discrepancy between retrieval results and the ultimate intention of citation is neglected by most literature retrieval models. Citation intent refers to the re...
Gespeichert in:
Veröffentlicht in: | Scientometrics 2024-07, Vol.129 (7), p.4167-4188 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Literature retrieval helps scientists find previous work that is relative to their own research or even get new research ideas. However, the discrepancy between retrieval results and the ultimate intention of citation is neglected by most literature retrieval models. Citation intent refers to the researcher’s motivation for citing a paper. A citation intent graph with homogeneous nodes and heterogeneous hyperedges can represent different types of citation intents. By leveraging the citation intent information included in a hypergraph, a retrieval model can guide researchers on where to cite its retrieval result by understanding the citation behaviour in the graph. We present a ranking model called CitenGL (
Ci
tation In
ten
t
G
raph
L
earning) that aims to extract citation intent information and textual matching signals. The proposed model consists of a heterogeneous hypergraph encoder and a lightweight deep fusion unit for efficiency trade-offs. Compared to traditional literature retrieval, our model fills the gap between retrieval results and citation intention and yields an understandable graph-structured output. We evaluated our model on publicly available full-text paper datasets. Experimental results show that CitenGL outperforms most existing neural ranking models that only consider textual information, which illustrates the effectiveness of integrating citation intent information with textual information. Further ablation analyses show how citation intent information complements text-matching signals and citation networks. |
---|---|
ISSN: | 0138-9130 1588-2861 |
DOI: | 10.1007/s11192-024-05066-4 |