Revealing Unknown Controlled Substances and New Psychoactive Substances Using High-Resolution LC–MS-MS Machine Learning Models and the Hybrid Similarity Search Algorithm
Abstract High-resolution liquid chromatography–tandem mass spectrometry (LC--MS-MS)-based machine learning models are constructed to address the analytical challenge of identifying unknown controlled substances and new psychoactive substances (NPSs). Using a training set composed of 770 LC–MS-MS bar...
Gespeichert in:
Veröffentlicht in: | Journal of analytical toxicology 2022-08, Vol.46 (7), p.732-742 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 742 |
---|---|
container_issue | 7 |
container_start_page | 732 |
container_title | Journal of analytical toxicology |
container_volume | 46 |
creator | Lee, So Yeon Lee, Sang Tak Suh, Sungill Ko, Bum Jun Oh, Han Bin |
description | Abstract
High-resolution liquid chromatography–tandem mass spectrometry (LC--MS-MS)-based machine learning models are constructed to address the analytical challenge of identifying unknown controlled substances and new psychoactive substances (NPSs). Using a training set composed of 770 LC–MS-MS barcode spectra (with binary entries 0 or 1) obtained generally by high-resolution mass spectrometers, three classification machine learning models were generated and evaluated. The three models are artificial neural network (ANN), support vector machine (SVM) and k-nearest neighbor (k-NN) models. In these models, controlled substances and NPSs were classified into 13 subgroups (benzylpiperazine, opiate, benzodiazepine, amphetamine, cocaine, methcathinone, classical cannabinoid, fentanyl, 2C series, indazole carbonyl compound, indole carbonyl compound, phencyclidine and others). Using 193 LC–MS-MS barcode spectra as an external test set, accuracy of the ANN, SVM and k-NN models were evaluated as 72.5%, 90.0% and 94.3%, respectively. Also, the hybrid similarity search (HSS) algorithm was evaluated to examine whether this algorithm can successfully identify unknown controlled substances and NPSs whose data are unavailable in the database. When only 24 representative LC–MS-MS spectra of controlled substances and NPSs were selectively included in the database, it was found that HSS can successfully identify compounds with high reliability. The machine learning models and HSS algorithms are incorporated into our home-coded artificial intelligence screener for narcotic drugs and psychotropic substances standalone software that is equipped with a graphic user interface. The use of this software allows unknown controlled substances and NPSs to be identified in a convenient manner. |
doi_str_mv | 10.1093/jat/bkab098 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2571053220</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1093/jat/bkab098</oup_id><sourcerecordid>2571053220</sourcerecordid><originalsourceid>FETCH-LOGICAL-c297t-805568c0782105a08971183acc6afb44ec97a4474e3cd31b5584689918dd738f3</originalsourceid><addsrcrecordid>eNp9kUFO4zAUQC3ESJQOKy7gFUJCATt2EmeJKqBILTNq6TpynN_G4NrFdoq6mzvMMbgVJyFVWcxqVl9f_-n9xUPonJJrSkp28yLjTf0qa1KKIzSgJc-SlBN2jAaE8jzhRU5O0GkIL4TQXORsgD5msAVptF3hhX217t3ikbPRO2OgwfOuDlFaBQFL2-AneMe_w061Tqqot_DvfRH2jrFetckMgjNd1M7iyejzz9_pPJnO8VSqVlvAE5De7tmpa8AcxLEFPN7VXvcv9Vob6XXc4XlPqhbfmpXr93b9E_1YShPg7HsO0eL-7nk0Tia_Hh5Ht5NEpWURE0GyLBeKFCKlJJNElAWlgkmlcrmsOQdVFpLzggNTDaN1lgmei7KkomkKJpZsiC4P3o13bx2EWK11UGCMtOC6UKVZ0YtZmpIevTqgyrsQPCyrjddr6XcVJdU-SdUnqb6T9PTFgXbd5r_gF1W_kBU</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2571053220</pqid></control><display><type>article</type><title>Revealing Unknown Controlled Substances and New Psychoactive Substances Using High-Resolution LC–MS-MS Machine Learning Models and the Hybrid Similarity Search Algorithm</title><source>Oxford University Press Journals All Titles (1996-Current)</source><source>EZB-FREE-00999 freely available EZB journals</source><source>Alma/SFX Local Collection</source><creator>Lee, So Yeon ; Lee, Sang Tak ; Suh, Sungill ; Ko, Bum Jun ; Oh, Han Bin</creator><creatorcontrib>Lee, So Yeon ; Lee, Sang Tak ; Suh, Sungill ; Ko, Bum Jun ; Oh, Han Bin</creatorcontrib><description>Abstract
High-resolution liquid chromatography–tandem mass spectrometry (LC--MS-MS)-based machine learning models are constructed to address the analytical challenge of identifying unknown controlled substances and new psychoactive substances (NPSs). Using a training set composed of 770 LC–MS-MS barcode spectra (with binary entries 0 or 1) obtained generally by high-resolution mass spectrometers, three classification machine learning models were generated and evaluated. The three models are artificial neural network (ANN), support vector machine (SVM) and k-nearest neighbor (k-NN) models. In these models, controlled substances and NPSs were classified into 13 subgroups (benzylpiperazine, opiate, benzodiazepine, amphetamine, cocaine, methcathinone, classical cannabinoid, fentanyl, 2C series, indazole carbonyl compound, indole carbonyl compound, phencyclidine and others). Using 193 LC–MS-MS barcode spectra as an external test set, accuracy of the ANN, SVM and k-NN models were evaluated as 72.5%, 90.0% and 94.3%, respectively. Also, the hybrid similarity search (HSS) algorithm was evaluated to examine whether this algorithm can successfully identify unknown controlled substances and NPSs whose data are unavailable in the database. When only 24 representative LC–MS-MS spectra of controlled substances and NPSs were selectively included in the database, it was found that HSS can successfully identify compounds with high reliability. The machine learning models and HSS algorithms are incorporated into our home-coded artificial intelligence screener for narcotic drugs and psychotropic substances standalone software that is equipped with a graphic user interface. The use of this software allows unknown controlled substances and NPSs to be identified in a convenient manner.</description><identifier>ISSN: 0146-4760</identifier><identifier>EISSN: 1945-2403</identifier><identifier>DOI: 10.1093/jat/bkab098</identifier><language>eng</language><publisher>US: Oxford University Press</publisher><ispartof>Journal of analytical toxicology, 2022-08, Vol.46 (7), p.732-742</ispartof><rights>The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c297t-805568c0782105a08971183acc6afb44ec97a4474e3cd31b5584689918dd738f3</citedby><cites>FETCH-LOGICAL-c297t-805568c0782105a08971183acc6afb44ec97a4474e3cd31b5584689918dd738f3</cites><orcidid>0000-0001-5900-2623</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,1584,27924,27925</link.rule.ids></links><search><creatorcontrib>Lee, So Yeon</creatorcontrib><creatorcontrib>Lee, Sang Tak</creatorcontrib><creatorcontrib>Suh, Sungill</creatorcontrib><creatorcontrib>Ko, Bum Jun</creatorcontrib><creatorcontrib>Oh, Han Bin</creatorcontrib><title>Revealing Unknown Controlled Substances and New Psychoactive Substances Using High-Resolution LC–MS-MS Machine Learning Models and the Hybrid Similarity Search Algorithm</title><title>Journal of analytical toxicology</title><description>Abstract
High-resolution liquid chromatography–tandem mass spectrometry (LC--MS-MS)-based machine learning models are constructed to address the analytical challenge of identifying unknown controlled substances and new psychoactive substances (NPSs). Using a training set composed of 770 LC–MS-MS barcode spectra (with binary entries 0 or 1) obtained generally by high-resolution mass spectrometers, three classification machine learning models were generated and evaluated. The three models are artificial neural network (ANN), support vector machine (SVM) and k-nearest neighbor (k-NN) models. In these models, controlled substances and NPSs were classified into 13 subgroups (benzylpiperazine, opiate, benzodiazepine, amphetamine, cocaine, methcathinone, classical cannabinoid, fentanyl, 2C series, indazole carbonyl compound, indole carbonyl compound, phencyclidine and others). Using 193 LC–MS-MS barcode spectra as an external test set, accuracy of the ANN, SVM and k-NN models were evaluated as 72.5%, 90.0% and 94.3%, respectively. Also, the hybrid similarity search (HSS) algorithm was evaluated to examine whether this algorithm can successfully identify unknown controlled substances and NPSs whose data are unavailable in the database. When only 24 representative LC–MS-MS spectra of controlled substances and NPSs were selectively included in the database, it was found that HSS can successfully identify compounds with high reliability. The machine learning models and HSS algorithms are incorporated into our home-coded artificial intelligence screener for narcotic drugs and psychotropic substances standalone software that is equipped with a graphic user interface. The use of this software allows unknown controlled substances and NPSs to be identified in a convenient manner.</description><issn>0146-4760</issn><issn>1945-2403</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kUFO4zAUQC3ESJQOKy7gFUJCATt2EmeJKqBILTNq6TpynN_G4NrFdoq6mzvMMbgVJyFVWcxqVl9f_-n9xUPonJJrSkp28yLjTf0qa1KKIzSgJc-SlBN2jAaE8jzhRU5O0GkIL4TQXORsgD5msAVptF3hhX217t3ikbPRO2OgwfOuDlFaBQFL2-AneMe_w061Tqqot_DvfRH2jrFetckMgjNd1M7iyejzz9_pPJnO8VSqVlvAE5De7tmpa8AcxLEFPN7VXvcv9Vob6XXc4XlPqhbfmpXr93b9E_1YShPg7HsO0eL-7nk0Tia_Hh5Ht5NEpWURE0GyLBeKFCKlJJNElAWlgkmlcrmsOQdVFpLzggNTDaN1lgmei7KkomkKJpZsiC4P3o13bx2EWK11UGCMtOC6UKVZ0YtZmpIevTqgyrsQPCyrjddr6XcVJdU-SdUnqb6T9PTFgXbd5r_gF1W_kBU</recordid><startdate>20220813</startdate><enddate>20220813</enddate><creator>Lee, So Yeon</creator><creator>Lee, Sang Tak</creator><creator>Suh, Sungill</creator><creator>Ko, Bum Jun</creator><creator>Oh, Han Bin</creator><general>Oxford University Press</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-5900-2623</orcidid></search><sort><creationdate>20220813</creationdate><title>Revealing Unknown Controlled Substances and New Psychoactive Substances Using High-Resolution LC–MS-MS Machine Learning Models and the Hybrid Similarity Search Algorithm</title><author>Lee, So Yeon ; Lee, Sang Tak ; Suh, Sungill ; Ko, Bum Jun ; Oh, Han Bin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c297t-805568c0782105a08971183acc6afb44ec97a4474e3cd31b5584689918dd738f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lee, So Yeon</creatorcontrib><creatorcontrib>Lee, Sang Tak</creatorcontrib><creatorcontrib>Suh, Sungill</creatorcontrib><creatorcontrib>Ko, Bum Jun</creatorcontrib><creatorcontrib>Oh, Han Bin</creatorcontrib><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of analytical toxicology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lee, So Yeon</au><au>Lee, Sang Tak</au><au>Suh, Sungill</au><au>Ko, Bum Jun</au><au>Oh, Han Bin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Revealing Unknown Controlled Substances and New Psychoactive Substances Using High-Resolution LC–MS-MS Machine Learning Models and the Hybrid Similarity Search Algorithm</atitle><jtitle>Journal of analytical toxicology</jtitle><date>2022-08-13</date><risdate>2022</risdate><volume>46</volume><issue>7</issue><spage>732</spage><epage>742</epage><pages>732-742</pages><issn>0146-4760</issn><eissn>1945-2403</eissn><abstract>Abstract
High-resolution liquid chromatography–tandem mass spectrometry (LC--MS-MS)-based machine learning models are constructed to address the analytical challenge of identifying unknown controlled substances and new psychoactive substances (NPSs). Using a training set composed of 770 LC–MS-MS barcode spectra (with binary entries 0 or 1) obtained generally by high-resolution mass spectrometers, three classification machine learning models were generated and evaluated. The three models are artificial neural network (ANN), support vector machine (SVM) and k-nearest neighbor (k-NN) models. In these models, controlled substances and NPSs were classified into 13 subgroups (benzylpiperazine, opiate, benzodiazepine, amphetamine, cocaine, methcathinone, classical cannabinoid, fentanyl, 2C series, indazole carbonyl compound, indole carbonyl compound, phencyclidine and others). Using 193 LC–MS-MS barcode spectra as an external test set, accuracy of the ANN, SVM and k-NN models were evaluated as 72.5%, 90.0% and 94.3%, respectively. Also, the hybrid similarity search (HSS) algorithm was evaluated to examine whether this algorithm can successfully identify unknown controlled substances and NPSs whose data are unavailable in the database. When only 24 representative LC–MS-MS spectra of controlled substances and NPSs were selectively included in the database, it was found that HSS can successfully identify compounds with high reliability. The machine learning models and HSS algorithms are incorporated into our home-coded artificial intelligence screener for narcotic drugs and psychotropic substances standalone software that is equipped with a graphic user interface. The use of this software allows unknown controlled substances and NPSs to be identified in a convenient manner.</abstract><cop>US</cop><pub>Oxford University Press</pub><doi>10.1093/jat/bkab098</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0001-5900-2623</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0146-4760 |
ispartof | Journal of analytical toxicology, 2022-08, Vol.46 (7), p.732-742 |
issn | 0146-4760 1945-2403 |
language | eng |
recordid | cdi_proquest_miscellaneous_2571053220 |
source | Oxford University Press Journals All Titles (1996-Current); EZB-FREE-00999 freely available EZB journals; Alma/SFX Local Collection |
title | Revealing Unknown Controlled Substances and New Psychoactive Substances Using High-Resolution LC–MS-MS Machine Learning Models and the Hybrid Similarity Search Algorithm |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T13%3A00%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Revealing%20Unknown%20Controlled%20Substances%20and%20New%20Psychoactive%20Substances%20Using%20High-Resolution%20LC%E2%80%93MS-MS%20Machine%20Learning%20Models%20and%20the%20Hybrid%20Similarity%20Search%20Algorithm&rft.jtitle=Journal%20of%20analytical%20toxicology&rft.au=Lee,%20So%20Yeon&rft.date=2022-08-13&rft.volume=46&rft.issue=7&rft.spage=732&rft.epage=742&rft.pages=732-742&rft.issn=0146-4760&rft.eissn=1945-2403&rft_id=info:doi/10.1093/jat/bkab098&rft_dat=%3Cproquest_cross%3E2571053220%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2571053220&rft_id=info:pmid/&rft_oup_id=10.1093/jat/bkab098&rfr_iscdi=true |