K 1 K 2 NN: A novel multi-label classification approach based on neighbors for predicting COVID-19 drug side effects

COVID-19, a novel ailment, has received comparatively fewer drugs for its treatment. Side Effects (SE) of a COVID-19 drug could cause long-term health issues. Hence, SE prediction is essential in COVID-19 drug development. Efficient models are also needed to predict COVID-19 drug SE since most exist...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Computational biology and chemistry 2024-04, Vol.110, p.108066
Hauptverfasser:	Das, Pranab, Mazumder, Dilwar Hussain
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page	108066
container_title	Computational biology and chemistry
container_volume	110
creator	Das, Pranab Mazumder, Dilwar Hussain
description	COVID-19, a novel ailment, has received comparatively fewer drugs for its treatment. Side Effects (SE) of a COVID-19 drug could cause long-term health issues. Hence, SE prediction is essential in COVID-19 drug development. Efficient models are also needed to predict COVID-19 drug SE since most existing research has proposed many classifiers to predict SE for diseases other than COVID-19. This work proposes a novel classifier based on neighbors named K K Nearest Neighbors (K K NN) to predict the SE of the COVID-19 drug from 17 molecules' descriptors and the chemical 1D structure of the drugs. The model is implemented based on the proposition that chemically similar drugs may be assigned similar drug SE, and co-occurring SE may be assigned to chemically similar drugs. The K K NN model chooses the first K neighbors to the test drug sample by calculating its similarity with the train drug samples. It then assigns the test sample with the SE label having the majority count on the SE labels of these K neighbor drugs obtained through a voting mechanism. The model then calculates the SE-SE similarity using the Jaccard similarity measure from the SE co-occurrence values. Finally, the model chooses the most similar K SE neighbors for those SE determined by the K neighbor drugs and assigns these SE to that test drug sample. The proposed K K NN model has showcased promising performance with the highest accuracy of 97.53% on chemical 1D drug structure and outperforms the state-of-the-art multi-label classifiers. In addition, we demonstrate the successful application of the proposed model on gene expression signature datasets, which aided in evaluating its performance and confirming its accuracy and robustness.
doi_str_mv	10.1016/j.compbiolchem.2024.108066
format	Article
fullrecord	<record><control><sourceid>pubmed</sourceid><recordid>TN_cdi_pubmed_primary_38579549</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>38579549</sourcerecordid><originalsourceid>FETCH-pubmed_primary_385795493</originalsourceid><addsrcrecordid>eNqFjktLxDAYRYMgzvj4C_LhvjVJH9O6k9FBGRg3Iu6GPNsMSROSVPDf24WuXd3LuWdxEbojuCSYtPenUngXuPFWjMqVFNN6GTrctmdoTepNW_S0-1yhy5ROGNMK4-YCraqu2fRN3a9R3gOBPVA4HB7gESb_pSy42WZTWMaXLixLyWgjWDZ-AhZC9EyMwFlSEhYyKTOM3McE2kcIUUkjspkG2L59vD4VpAcZ5wGSkQqU1krkdI3ONbNJ3fzmFbrdPb9vX4owc6fkMUTjWPw-_t2s_hV-AAPxUMI</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>K 1 K 2 NN: A novel multi-label classification approach based on neighbors for predicting COVID-19 drug side effects</title><source>Elsevier ScienceDirect Journals</source><creator>Das, Pranab ; Mazumder, Dilwar Hussain</creator><creatorcontrib>Das, Pranab ; Mazumder, Dilwar Hussain</creatorcontrib><description>COVID-19, a novel ailment, has received comparatively fewer drugs for its treatment. Side Effects (SE) of a COVID-19 drug could cause long-term health issues. Hence, SE prediction is essential in COVID-19 drug development. Efficient models are also needed to predict COVID-19 drug SE since most existing research has proposed many classifiers to predict SE for diseases other than COVID-19. This work proposes a novel classifier based on neighbors named K K Nearest Neighbors (K K NN) to predict the SE of the COVID-19 drug from 17 molecules' descriptors and the chemical 1D structure of the drugs. The model is implemented based on the proposition that chemically similar drugs may be assigned similar drug SE, and co-occurring SE may be assigned to chemically similar drugs. The K K NN model chooses the first K neighbors to the test drug sample by calculating its similarity with the train drug samples. It then assigns the test sample with the SE label having the majority count on the SE labels of these K neighbor drugs obtained through a voting mechanism. The model then calculates the SE-SE similarity using the Jaccard similarity measure from the SE co-occurrence values. Finally, the model chooses the most similar K SE neighbors for those SE determined by the K neighbor drugs and assigns these SE to that test drug sample. The proposed K K NN model has showcased promising performance with the highest accuracy of 97.53% on chemical 1D drug structure and outperforms the state-of-the-art multi-label classifiers. In addition, we demonstrate the successful application of the proposed model on gene expression signature datasets, which aided in evaluating its performance and confirming its accuracy and robustness.</description><identifier>EISSN: 1476-928X</identifier><identifier>DOI: 10.1016/j.compbiolchem.2024.108066</identifier><identifier>PMID: 38579549</identifier><language>eng</language><publisher>England</publisher><ispartof>Computational biology and chemistry, 2024-04, Vol.110, p.108066</ispartof><rights>Copyright © 2024 Elsevier Ltd. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38579549$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Das, Pranab</creatorcontrib><creatorcontrib>Mazumder, Dilwar Hussain</creatorcontrib><title>K 1 K 2 NN: A novel multi-label classification approach based on neighbors for predicting COVID-19 drug side effects</title><title>Computational biology and chemistry</title><addtitle>Comput Biol Chem</addtitle><description>COVID-19, a novel ailment, has received comparatively fewer drugs for its treatment. Side Effects (SE) of a COVID-19 drug could cause long-term health issues. Hence, SE prediction is essential in COVID-19 drug development. Efficient models are also needed to predict COVID-19 drug SE since most existing research has proposed many classifiers to predict SE for diseases other than COVID-19. This work proposes a novel classifier based on neighbors named K K Nearest Neighbors (K K NN) to predict the SE of the COVID-19 drug from 17 molecules' descriptors and the chemical 1D structure of the drugs. The model is implemented based on the proposition that chemically similar drugs may be assigned similar drug SE, and co-occurring SE may be assigned to chemically similar drugs. The K K NN model chooses the first K neighbors to the test drug sample by calculating its similarity with the train drug samples. It then assigns the test sample with the SE label having the majority count on the SE labels of these K neighbor drugs obtained through a voting mechanism. The model then calculates the SE-SE similarity using the Jaccard similarity measure from the SE co-occurrence values. Finally, the model chooses the most similar K SE neighbors for those SE determined by the K neighbor drugs and assigns these SE to that test drug sample. The proposed K K NN model has showcased promising performance with the highest accuracy of 97.53% on chemical 1D drug structure and outperforms the state-of-the-art multi-label classifiers. In addition, we demonstrate the successful application of the proposed model on gene expression signature datasets, which aided in evaluating its performance and confirming its accuracy and robustness.</description><issn>1476-928X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNqFjktLxDAYRYMgzvj4C_LhvjVJH9O6k9FBGRg3Iu6GPNsMSROSVPDf24WuXd3LuWdxEbojuCSYtPenUngXuPFWjMqVFNN6GTrctmdoTepNW_S0-1yhy5ROGNMK4-YCraqu2fRN3a9R3gOBPVA4HB7gESb_pSy42WZTWMaXLixLyWgjWDZ-AhZC9EyMwFlSEhYyKTOM3McE2kcIUUkjspkG2L59vD4VpAcZ5wGSkQqU1krkdI3ONbNJ3fzmFbrdPb9vX4owc6fkMUTjWPw-_t2s_hV-AAPxUMI</recordid><startdate>20240402</startdate><enddate>20240402</enddate><creator>Das, Pranab</creator><creator>Mazumder, Dilwar Hussain</creator><scope>NPM</scope></search><sort><creationdate>20240402</creationdate><title>K 1 K 2 NN: A novel multi-label classification approach based on neighbors for predicting COVID-19 drug side effects</title><author>Das, Pranab ; Mazumder, Dilwar Hussain</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-pubmed_primary_385795493</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Das, Pranab</creatorcontrib><creatorcontrib>Mazumder, Dilwar Hussain</creatorcontrib><collection>PubMed</collection><jtitle>Computational biology and chemistry</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Das, Pranab</au><au>Mazumder, Dilwar Hussain</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>K 1 K 2 NN: A novel multi-label classification approach based on neighbors for predicting COVID-19 drug side effects</atitle><jtitle>Computational biology and chemistry</jtitle><addtitle>Comput Biol Chem</addtitle><date>2024-04-02</date><risdate>2024</risdate><volume>110</volume><spage>108066</spage><pages>108066-</pages><eissn>1476-928X</eissn><abstract>COVID-19, a novel ailment, has received comparatively fewer drugs for its treatment. Side Effects (SE) of a COVID-19 drug could cause long-term health issues. Hence, SE prediction is essential in COVID-19 drug development. Efficient models are also needed to predict COVID-19 drug SE since most existing research has proposed many classifiers to predict SE for diseases other than COVID-19. This work proposes a novel classifier based on neighbors named K K Nearest Neighbors (K K NN) to predict the SE of the COVID-19 drug from 17 molecules' descriptors and the chemical 1D structure of the drugs. The model is implemented based on the proposition that chemically similar drugs may be assigned similar drug SE, and co-occurring SE may be assigned to chemically similar drugs. The K K NN model chooses the first K neighbors to the test drug sample by calculating its similarity with the train drug samples. It then assigns the test sample with the SE label having the majority count on the SE labels of these K neighbor drugs obtained through a voting mechanism. The model then calculates the SE-SE similarity using the Jaccard similarity measure from the SE co-occurrence values. Finally, the model chooses the most similar K SE neighbors for those SE determined by the K neighbor drugs and assigns these SE to that test drug sample. The proposed K K NN model has showcased promising performance with the highest accuracy of 97.53% on chemical 1D drug structure and outperforms the state-of-the-art multi-label classifiers. In addition, we demonstrate the successful application of the proposed model on gene expression signature datasets, which aided in evaluating its performance and confirming its accuracy and robustness.</abstract><cop>England</cop><pmid>38579549</pmid><doi>10.1016/j.compbiolchem.2024.108066</doi></addata></record>
fulltext	fulltext
identifier	EISSN: 1476-928X
ispartof	Computational biology and chemistry, 2024-04, Vol.110, p.108066
issn	1476-928X
language	eng
recordid	cdi_pubmed_primary_38579549
source	Elsevier ScienceDirect Journals
title	K 1 K 2 NN: A novel multi-label classification approach based on neighbors for predicting COVID-19 drug side effects
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T23%3A09%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pubmed&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=K%201%20K%202%20NN:%20A%20novel%20multi-label%20classification%20approach%20based%20on%20neighbors%20for%20predicting%20COVID-19%20drug%20side%20effects&rft.jtitle=Computational%20biology%20and%20chemistry&rft.au=Das,%20Pranab&rft.date=2024-04-02&rft.volume=110&rft.spage=108066&rft.pages=108066-&rft.eissn=1476-928X&rft_id=info:doi/10.1016/j.compbiolchem.2024.108066&rft_dat=%3Cpubmed%3E38579549%3C/pubmed%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/38579549&rfr_iscdi=true