Revealing Unknown Controlled Substances and New Psychoactive Substances Using High-Resolution LC–MS-MS Machine Learning Models and the Hybrid Similarity Search Algorithm

Abstract High-resolution liquid chromatography–tandem mass spectrometry (LC--MS-MS)-based machine learning models are constructed to address the analytical challenge of identifying unknown controlled substances and new psychoactive substances (NPSs). Using a training set composed of 770 LC–MS-MS bar...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of analytical toxicology 2022-08, Vol.46 (7), p.732-742
Hauptverfasser: Lee, So Yeon, Lee, Sang Tak, Suh, Sungill, Ko, Bum Jun, Oh, Han Bin
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 742
container_issue 7
container_start_page 732
container_title Journal of analytical toxicology
container_volume 46
creator Lee, So Yeon
Lee, Sang Tak
Suh, Sungill
Ko, Bum Jun
Oh, Han Bin
description Abstract High-resolution liquid chromatography–tandem mass spectrometry (LC--MS-MS)-based machine learning models are constructed to address the analytical challenge of identifying unknown controlled substances and new psychoactive substances (NPSs). Using a training set composed of 770 LC–MS-MS barcode spectra (with binary entries 0 or 1) obtained generally by high-resolution mass spectrometers, three classification machine learning models were generated and evaluated. The three models are artificial neural network (ANN), support vector machine (SVM) and k-nearest neighbor (k-NN) models. In these models, controlled substances and NPSs were classified into 13 subgroups (benzylpiperazine, opiate, benzodiazepine, amphetamine, cocaine, methcathinone, classical cannabinoid, fentanyl, 2C series, indazole carbonyl compound, indole carbonyl compound, phencyclidine and others). Using 193 LC–MS-MS barcode spectra as an external test set, accuracy of the ANN, SVM and k-NN models were evaluated as 72.5%, 90.0% and 94.3%, respectively. Also, the hybrid similarity search (HSS) algorithm was evaluated to examine whether this algorithm can successfully identify unknown controlled substances and NPSs whose data are unavailable in the database. When only 24 representative LC–MS-MS spectra of controlled substances and NPSs were selectively included in the database, it was found that HSS can successfully identify compounds with high reliability. The machine learning models and HSS algorithms are incorporated into our home-coded artificial intelligence screener for narcotic drugs and psychotropic substances standalone software that is equipped with a graphic user interface. The use of this software allows unknown controlled substances and NPSs to be identified in a convenient manner.
doi_str_mv 10.1093/jat/bkab098
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2571053220</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1093/jat/bkab098</oup_id><sourcerecordid>2571053220</sourcerecordid><originalsourceid>FETCH-LOGICAL-c297t-805568c0782105a08971183acc6afb44ec97a4474e3cd31b5584689918dd738f3</originalsourceid><addsrcrecordid>eNp9kUFO4zAUQC3ESJQOKy7gFUJCATt2EmeJKqBILTNq6TpynN_G4NrFdoq6mzvMMbgVJyFVWcxqVl9f_-n9xUPonJJrSkp28yLjTf0qa1KKIzSgJc-SlBN2jAaE8jzhRU5O0GkIL4TQXORsgD5msAVptF3hhX217t3ikbPRO2OgwfOuDlFaBQFL2-AneMe_w061Tqqot_DvfRH2jrFetckMgjNd1M7iyejzz9_pPJnO8VSqVlvAE5De7tmpa8AcxLEFPN7VXvcv9Vob6XXc4XlPqhbfmpXr93b9E_1YShPg7HsO0eL-7nk0Tia_Hh5Ht5NEpWURE0GyLBeKFCKlJJNElAWlgkmlcrmsOQdVFpLzggNTDaN1lgmei7KkomkKJpZsiC4P3o13bx2EWK11UGCMtOC6UKVZ0YtZmpIevTqgyrsQPCyrjddr6XcVJdU-SdUnqb6T9PTFgXbd5r_gF1W_kBU</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2571053220</pqid></control><display><type>article</type><title>Revealing Unknown Controlled Substances and New Psychoactive Substances Using High-Resolution LC–MS-MS Machine Learning Models and the Hybrid Similarity Search Algorithm</title><source>Oxford University Press Journals All Titles (1996-Current)</source><source>EZB-FREE-00999 freely available EZB journals</source><source>Alma/SFX Local Collection</source><creator>Lee, So Yeon ; Lee, Sang Tak ; Suh, Sungill ; Ko, Bum Jun ; Oh, Han Bin</creator><creatorcontrib>Lee, So Yeon ; Lee, Sang Tak ; Suh, Sungill ; Ko, Bum Jun ; Oh, Han Bin</creatorcontrib><description>Abstract High-resolution liquid chromatography–tandem mass spectrometry (LC--MS-MS)-based machine learning models are constructed to address the analytical challenge of identifying unknown controlled substances and new psychoactive substances (NPSs). Using a training set composed of 770 LC–MS-MS barcode spectra (with binary entries 0 or 1) obtained generally by high-resolution mass spectrometers, three classification machine learning models were generated and evaluated. The three models are artificial neural network (ANN), support vector machine (SVM) and k-nearest neighbor (k-NN) models. In these models, controlled substances and NPSs were classified into 13 subgroups (benzylpiperazine, opiate, benzodiazepine, amphetamine, cocaine, methcathinone, classical cannabinoid, fentanyl, 2C series, indazole carbonyl compound, indole carbonyl compound, phencyclidine and others). Using 193 LC–MS-MS barcode spectra as an external test set, accuracy of the ANN, SVM and k-NN models were evaluated as 72.5%, 90.0% and 94.3%, respectively. Also, the hybrid similarity search (HSS) algorithm was evaluated to examine whether this algorithm can successfully identify unknown controlled substances and NPSs whose data are unavailable in the database. When only 24 representative LC–MS-MS spectra of controlled substances and NPSs were selectively included in the database, it was found that HSS can successfully identify compounds with high reliability. The machine learning models and HSS algorithms are incorporated into our home-coded artificial intelligence screener for narcotic drugs and psychotropic substances standalone software that is equipped with a graphic user interface. The use of this software allows unknown controlled substances and NPSs to be identified in a convenient manner.</description><identifier>ISSN: 0146-4760</identifier><identifier>EISSN: 1945-2403</identifier><identifier>DOI: 10.1093/jat/bkab098</identifier><language>eng</language><publisher>US: Oxford University Press</publisher><ispartof>Journal of analytical toxicology, 2022-08, Vol.46 (7), p.732-742</ispartof><rights>The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c297t-805568c0782105a08971183acc6afb44ec97a4474e3cd31b5584689918dd738f3</citedby><cites>FETCH-LOGICAL-c297t-805568c0782105a08971183acc6afb44ec97a4474e3cd31b5584689918dd738f3</cites><orcidid>0000-0001-5900-2623</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,1584,27924,27925</link.rule.ids></links><search><creatorcontrib>Lee, So Yeon</creatorcontrib><creatorcontrib>Lee, Sang Tak</creatorcontrib><creatorcontrib>Suh, Sungill</creatorcontrib><creatorcontrib>Ko, Bum Jun</creatorcontrib><creatorcontrib>Oh, Han Bin</creatorcontrib><title>Revealing Unknown Controlled Substances and New Psychoactive Substances Using High-Resolution LC–MS-MS Machine Learning Models and the Hybrid Similarity Search Algorithm</title><title>Journal of analytical toxicology</title><description>Abstract High-resolution liquid chromatography–tandem mass spectrometry (LC--MS-MS)-based machine learning models are constructed to address the analytical challenge of identifying unknown controlled substances and new psychoactive substances (NPSs). Using a training set composed of 770 LC–MS-MS barcode spectra (with binary entries 0 or 1) obtained generally by high-resolution mass spectrometers, three classification machine learning models were generated and evaluated. The three models are artificial neural network (ANN), support vector machine (SVM) and k-nearest neighbor (k-NN) models. In these models, controlled substances and NPSs were classified into 13 subgroups (benzylpiperazine, opiate, benzodiazepine, amphetamine, cocaine, methcathinone, classical cannabinoid, fentanyl, 2C series, indazole carbonyl compound, indole carbonyl compound, phencyclidine and others). Using 193 LC–MS-MS barcode spectra as an external test set, accuracy of the ANN, SVM and k-NN models were evaluated as 72.5%, 90.0% and 94.3%, respectively. Also, the hybrid similarity search (HSS) algorithm was evaluated to examine whether this algorithm can successfully identify unknown controlled substances and NPSs whose data are unavailable in the database. When only 24 representative LC–MS-MS spectra of controlled substances and NPSs were selectively included in the database, it was found that HSS can successfully identify compounds with high reliability. The machine learning models and HSS algorithms are incorporated into our home-coded artificial intelligence screener for narcotic drugs and psychotropic substances standalone software that is equipped with a graphic user interface. The use of this software allows unknown controlled substances and NPSs to be identified in a convenient manner.</description><issn>0146-4760</issn><issn>1945-2403</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kUFO4zAUQC3ESJQOKy7gFUJCATt2EmeJKqBILTNq6TpynN_G4NrFdoq6mzvMMbgVJyFVWcxqVl9f_-n9xUPonJJrSkp28yLjTf0qa1KKIzSgJc-SlBN2jAaE8jzhRU5O0GkIL4TQXORsgD5msAVptF3hhX217t3ikbPRO2OgwfOuDlFaBQFL2-AneMe_w061Tqqot_DvfRH2jrFetckMgjNd1M7iyejzz9_pPJnO8VSqVlvAE5De7tmpa8AcxLEFPN7VXvcv9Vob6XXc4XlPqhbfmpXr93b9E_1YShPg7HsO0eL-7nk0Tia_Hh5Ht5NEpWURE0GyLBeKFCKlJJNElAWlgkmlcrmsOQdVFpLzggNTDaN1lgmei7KkomkKJpZsiC4P3o13bx2EWK11UGCMtOC6UKVZ0YtZmpIevTqgyrsQPCyrjddr6XcVJdU-SdUnqb6T9PTFgXbd5r_gF1W_kBU</recordid><startdate>20220813</startdate><enddate>20220813</enddate><creator>Lee, So Yeon</creator><creator>Lee, Sang Tak</creator><creator>Suh, Sungill</creator><creator>Ko, Bum Jun</creator><creator>Oh, Han Bin</creator><general>Oxford University Press</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-5900-2623</orcidid></search><sort><creationdate>20220813</creationdate><title>Revealing Unknown Controlled Substances and New Psychoactive Substances Using High-Resolution LC–MS-MS Machine Learning Models and the Hybrid Similarity Search Algorithm</title><author>Lee, So Yeon ; Lee, Sang Tak ; Suh, Sungill ; Ko, Bum Jun ; Oh, Han Bin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c297t-805568c0782105a08971183acc6afb44ec97a4474e3cd31b5584689918dd738f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lee, So Yeon</creatorcontrib><creatorcontrib>Lee, Sang Tak</creatorcontrib><creatorcontrib>Suh, Sungill</creatorcontrib><creatorcontrib>Ko, Bum Jun</creatorcontrib><creatorcontrib>Oh, Han Bin</creatorcontrib><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of analytical toxicology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lee, So Yeon</au><au>Lee, Sang Tak</au><au>Suh, Sungill</au><au>Ko, Bum Jun</au><au>Oh, Han Bin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Revealing Unknown Controlled Substances and New Psychoactive Substances Using High-Resolution LC–MS-MS Machine Learning Models and the Hybrid Similarity Search Algorithm</atitle><jtitle>Journal of analytical toxicology</jtitle><date>2022-08-13</date><risdate>2022</risdate><volume>46</volume><issue>7</issue><spage>732</spage><epage>742</epage><pages>732-742</pages><issn>0146-4760</issn><eissn>1945-2403</eissn><abstract>Abstract High-resolution liquid chromatography–tandem mass spectrometry (LC--MS-MS)-based machine learning models are constructed to address the analytical challenge of identifying unknown controlled substances and new psychoactive substances (NPSs). Using a training set composed of 770 LC–MS-MS barcode spectra (with binary entries 0 or 1) obtained generally by high-resolution mass spectrometers, three classification machine learning models were generated and evaluated. The three models are artificial neural network (ANN), support vector machine (SVM) and k-nearest neighbor (k-NN) models. In these models, controlled substances and NPSs were classified into 13 subgroups (benzylpiperazine, opiate, benzodiazepine, amphetamine, cocaine, methcathinone, classical cannabinoid, fentanyl, 2C series, indazole carbonyl compound, indole carbonyl compound, phencyclidine and others). Using 193 LC–MS-MS barcode spectra as an external test set, accuracy of the ANN, SVM and k-NN models were evaluated as 72.5%, 90.0% and 94.3%, respectively. Also, the hybrid similarity search (HSS) algorithm was evaluated to examine whether this algorithm can successfully identify unknown controlled substances and NPSs whose data are unavailable in the database. When only 24 representative LC–MS-MS spectra of controlled substances and NPSs were selectively included in the database, it was found that HSS can successfully identify compounds with high reliability. The machine learning models and HSS algorithms are incorporated into our home-coded artificial intelligence screener for narcotic drugs and psychotropic substances standalone software that is equipped with a graphic user interface. The use of this software allows unknown controlled substances and NPSs to be identified in a convenient manner.</abstract><cop>US</cop><pub>Oxford University Press</pub><doi>10.1093/jat/bkab098</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0001-5900-2623</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0146-4760
ispartof Journal of analytical toxicology, 2022-08, Vol.46 (7), p.732-742
issn 0146-4760
1945-2403
language eng
recordid cdi_proquest_miscellaneous_2571053220
source Oxford University Press Journals All Titles (1996-Current); EZB-FREE-00999 freely available EZB journals; Alma/SFX Local Collection
title Revealing Unknown Controlled Substances and New Psychoactive Substances Using High-Resolution LC–MS-MS Machine Learning Models and the Hybrid Similarity Search Algorithm
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T13%3A00%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Revealing%20Unknown%20Controlled%20Substances%20and%20New%20Psychoactive%20Substances%20Using%20High-Resolution%20LC%E2%80%93MS-MS%20Machine%20Learning%20Models%20and%20the%20Hybrid%20Similarity%20Search%20Algorithm&rft.jtitle=Journal%20of%20analytical%20toxicology&rft.au=Lee,%20So%20Yeon&rft.date=2022-08-13&rft.volume=46&rft.issue=7&rft.spage=732&rft.epage=742&rft.pages=732-742&rft.issn=0146-4760&rft.eissn=1945-2403&rft_id=info:doi/10.1093/jat/bkab098&rft_dat=%3Cproquest_cross%3E2571053220%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2571053220&rft_id=info:pmid/&rft_oup_id=10.1093/jat/bkab098&rfr_iscdi=true