Reducing systematic review workload through certainty-based screening
[Display omitted] •Active learning is promising in the areas with complex topics in systematic reviews.•Certainty criteria is promising to accelerate screening regardless of the topic.•Certainty criteria performs as well as uncertainty criteria in classification.•Weighting positive instances is prom...
Gespeichert in:
Veröffentlicht in: | Journal of biomedical informatics 2014-10, Vol.51, p.242-253 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | [Display omitted]
•Active learning is promising in the areas with complex topics in systematic reviews.•Certainty criteria is promising to accelerate screening regardless of the topic.•Certainty criteria performs as well as uncertainty criteria in classification.•Weighting positive instances is promising to overcome the data imbalance.•Unsupervised methods enhance the classification performance.
In systematic reviews, the growing number of published studies imposes a significant screening workload on reviewers. Active learning is a promising approach to reduce the workload by automating some of the screening decisions, but it has been evaluated for a limited number of disciplines. The suitability of applying active learning to complex topics in disciplines such as social science has not been studied, and the selection of useful criteria and enhancements to address the data imbalance problem in systematic reviews remains an open problem. We applied active learning with two criteria (certainty and uncertainty) and several enhancements in both clinical medicine and social science (specifically, public health) areas, and compared the results in both. The results show that the certainty criterion is useful for finding relevant documents, and weighting positive instances is promising to overcome the data imbalance problem in both data sets. Latent dirichlet allocation (LDA) is also shown to be promising when little manually-assigned information is available. Active learning is effective in complex topics, although its efficiency is limited due to the difficulties in text classification. The most promising criterion and weighting method are the same regardless of the review topic, and unsupervised techniques like LDA have a possibility to boost the performance of active learning without manual annotation. |
---|---|
ISSN: | 1532-0464 1532-0480 1532-0480 |
DOI: | 10.1016/j.jbi.2014.06.005 |