SWIFT-Active Screener: Accelerated document screening through active learning and integrated recall estimation

•Machine learning tools can reduce screening burden in systematic reviews.•SWIFT-Active Screener is a machine learning web app for document screening.•It uses one model to prioritize documents and another to estimate recall.•On average, 95% of relevant articles are revealed after screening 40% of th...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Environment international 2020-05, Vol.138, p.105623-105623, Article 105623
Hauptverfasser: Howard, Brian E., Phillips, Jason, Tandon, Arpit, Maharana, Adyasha, Elmore, Rebecca, Mav, Deepak, Sedykh, Alex, Thayer, Kristina, Merrick, B. Alex, Walker, Vickie, Rooney, Andrew, Shah, Ruchir R.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Machine learning tools can reduce screening burden in systematic reviews.•SWIFT-Active Screener is a machine learning web app for document screening.•It uses one model to prioritize documents and another to estimate recall.•On average, 95% of relevant articles are revealed after screening 40% of the total.•Savings are increased on larger reviews with more references to screen. In the screening phase of systematic review, researchers use detailed inclusion/exclusion criteria to decide whether each article in a set of candidate articles is relevant to the research question under consideration. A typical review may require screening thousands or tens of thousands of articles in and can utilize hundreds of person-hours of labor. Here we introduce SWIFT-Active Screener, a web-based, collaborative systematic review software application, designed to reduce the overall screening burden required during this resource-intensive phase of the review process. To prioritize articles for review, SWIFT-Active Screener uses active learning, a type of machine learning that incorporates user feedback during screening. Meanwhile, a negative binomial model is employed to estimate the number of relevant articles remaining in the unscreened document list. Using a simulation involving 26 diverse systematic review datasets that were previously screened by reviewers, we evaluated both the document prioritization and recall estimation methods. On average, 95% of the relevant articles were identified after screening only 40% of the total reference list. In the 5 document sets with 5,000 or more references, 95% recall was achieved after screening only 34% of the available references, on average. Furthermore, the recall estimator we have proposed provides a useful, conservative estimate of the percentage of relevant documents identified during the screening process. SWIFT-Active Screener can result in significant time savings compared to traditional screening and the savings are increased for larger project sizes. Moreover, the integration of explicit recall estimation during screening solves an important challenge faced by all machine learning systems for document screening: when to stop screening a prioritized reference list. The software is currently available in the form of a multi-user, collaborative, online web application.
ISSN:0160-4120
1873-6750
DOI:10.1016/j.envint.2020.105623