Streamlining Quality Review of Mass Spectrometry Data in the Clinical Laboratory by Use of Machine Learning

Turnaround time and productivity of clinical mass spectrometric (MS) testing are hampered by time-consuming manual review of the analytical quality of MS data before release of patient results. To determine whether a classification model created by using standard machine learning algorithms can veri...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Archives of pathology & laboratory medicine (1976) 2019-08, Vol.143 (8), p.990-998
Hauptverfasser:	Yu, Min, Bazydlo, Lindsay A L, Bruns, David E, Harrison, Jr, James H
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Artificial intelligence Automation Automation, Laboratory - methods Automation, Laboratory - standards Bioinformatics Biomedical laboratories Cancer Cannabinoids Chromatography Classification Clinical Laboratory Techniques - methods Clinical Laboratory Techniques - standards Data mining Datasets Dronabinol - analogs & derivatives Dronabinol - urine Gas chromatography Gas Chromatography-Mass Spectrometry Humans Laboratories Learning algorithms Learning strategies Libraries Liquid chromatography Machine Learning Mass spectrometry Mass Spectrometry - methods Mass Spectrometry - standards Mass spectroscopy Production management Prostate Proteins Proteomics Quality standards Ratios Reference Standards Reproducibility of Results Retention Retrospective Studies Reviews Scientific imaging Spectroscopy Urine
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Turnaround time and productivity of clinical mass spectrometric (MS) testing are hampered by time-consuming manual review of the analytical quality of MS data before release of patient results. To determine whether a classification model created by using standard machine learning algorithms can verify analytically acceptable MS results and thereby reduce manual review requirements. We obtained retrospective data from gas chromatography-MS analyses of 11-nor-9-carboxy-delta-9-tetrahydrocannabinol (THC-COOH) in 1267 urine samples. The data for each sample had been labeled previously as either analytically unacceptable or acceptable by manual review. The dataset was randomly split into training and test sets (848 and 419 samples, respectively), maintaining equal proportions of acceptable (90%) and unacceptable (10%) results in each set. We used stratified 10-fold cross-validation in assessing the abilities of 6 supervised machine learning algorithms to distinguish unacceptable from acceptable assay results in the training dataset. The classifier with the highest recall was used to build a final model, and its performance was evaluated against the test dataset. In comparison testing of the 6 classifiers, a model based on the Support Vector Machines algorithm yielded the highest recall and acceptable precision. After optimization, this model correctly identified all unacceptable results in the test dataset (100% recall) with a precision of 81%. Automated data review identified all analytically unacceptable assays in the test dataset, while reducing the manual review requirement by about 87%. This automation strategy can focus manual review only on assays likely to be problematic, allowing improved throughput and turnaround time without reducing quality.
ISSN:	0003-9985 1543-2165 1543-2165
DOI:	10.5858/arpa.2018-0238-OA