Inconsistency of evaluation metrics in link prediction
Link prediction is a paradigmatic and challenging problem in network science, which aims to predict missing links, future links and temporal links based on known topology. Along with the increasing number of link prediction algorithms, a critical yet previously ignored risk is that the evaluation me...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Link prediction is a paradigmatic and challenging problem in network science,
which aims to predict missing links, future links and temporal links based on
known topology. Along with the increasing number of link prediction algorithms,
a critical yet previously ignored risk is that the evaluation metrics for
algorithm performance are usually chosen at will. This paper implements
extensive experiments on hundreds of real networks and 25 well-known
algorithms, revealing significant inconsistency among evaluation metrics,
namely different metrics probably produce remarkably different rankings of
algorithms. Therefore, we conclude that any single metric cannot
comprehensively or credibly evaluate algorithm performance. Further analysis
suggests the usage of at least two metrics: one is the area under the receiver
operating characteristic curve (AUC), and the other is one of the following
three candidates, say the area under the precision-recall curve (AUPR), the
area under the precision curve (AUC-Precision), and the normalized discounted
cumulative gain (NDCG). In addition, as we have proved the essential
equivalence of threshold-dependent metrics, if in a link prediction task, some
specific thresholds are meaningful, we can consider any one threshold-dependent
metric with those thresholds. This work completes a missing part in the
landscape of link prediction, and provides a starting point toward a
well-accepted criterion or standard to select proper evaluation metrics for
link prediction. |
---|---|
DOI: | 10.48550/arxiv.2402.08893 |