Prediction and Optimization of NaV1.7 Sodium Channel Inhibitors Based on Machine Learning and Simulated Annealing

Although the NaV1.7 sodium channel is a promising drug target for pain, traditional screening strategies for discovery of NaV1.7 inhibitors are very painstaking and time-consuming. Herein, we aimed to build machine learning models for screening and design of potent and effective NaV1.7 sodium channe...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of chemical information and modeling 2020-06, Vol.60 (6), p.2739-2753
Hauptverfasser: Kong, Weikaixin, Tu, Xinyu, Huang, Weiran, Yang, Yang, Xie, Zhengwei, Huang, Zhuo
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2753
container_issue 6
container_start_page 2739
container_title Journal of chemical information and modeling
container_volume 60
creator Kong, Weikaixin
Tu, Xinyu
Huang, Weiran
Yang, Yang
Xie, Zhengwei
Huang, Zhuo
description Although the NaV1.7 sodium channel is a promising drug target for pain, traditional screening strategies for discovery of NaV1.7 inhibitors are very painstaking and time-consuming. Herein, we aimed to build machine learning models for screening and design of potent and effective NaV1.7 sodium channel inhibitors. We customized the imbalanced data set from ChEMBL and BindingDB to train and filter the best classification model. Then, the whole-cell voltage-clamp was employed to validate the inhibitors. We assembled a molecular group optimization method by combining the Grammar Variational Autoencoder, classification model, and simulated annealing. We found that the RF-CDK model (random forest + CDK fingerprint) performs best in the imbalanced data set. Of the three compounds that may have inhibitory effects, nortriptyline has been experimentally verified. In the molecule optimization process, 40 molecules located in the applicability domain of RF-CDK were used as a starting point, among which 34 molecules evolved to molecules with greater molecular scores (MS). The molecule with the highest MS was derived from CHEMBL2325245. The model and method we developed for NaV1.7 inhibitors are also applicable to other targets.
doi_str_mv 10.1021/acs.jcim.9b01180
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_miscellaneous_2404641660</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2404641660</sourcerecordid><originalsourceid>FETCH-LOGICAL-j282t-a2dd73fb957d298beeeea20f3b607134d0de0e563bb54b82db61d08cfd3a7b5f3</originalsourceid><addsrcrecordid>eNpdjr1PwzAUxC0EEqWwM1piYUl4thMnGUvFR6VCkQqIrXqOHeoocdo4WfjrsQos3PJOp987HSGXDGIGnN1g6eO6tG1cKGAshyMyYWlSRIWEj-M_nxbylJx5XwMIUUg-IfuX3mhbDrZzFJ2mq91gW_uFh6Cr6DO-szij607bsaXzLTpnGrpwW6vs0PWe3qI3mgb4CcutdYYuDfbOus9D3dq2Y4NDIGbhEZuQn5OTChtvLn7vlLzd373OH6Pl6mExny2jmud8iJBrnYlKFWmmeZErE4QcKqEkZEwkGrQBk0qhVJqonGslmYa8rLTATKWVmJLrn95d3-1H44dNa31pmgad6Ua_4QkkMmFSQkCv_qF1N_YurAsUywAYZyC-ASTVbHU</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2417001210</pqid></control><display><type>article</type><title>Prediction and Optimization of NaV1.7 Sodium Channel Inhibitors Based on Machine Learning and Simulated Annealing</title><source>ACS Publications</source><creator>Kong, Weikaixin ; Tu, Xinyu ; Huang, Weiran ; Yang, Yang ; Xie, Zhengwei ; Huang, Zhuo</creator><creatorcontrib>Kong, Weikaixin ; Tu, Xinyu ; Huang, Weiran ; Yang, Yang ; Xie, Zhengwei ; Huang, Zhuo</creatorcontrib><description>Although the NaV1.7 sodium channel is a promising drug target for pain, traditional screening strategies for discovery of NaV1.7 inhibitors are very painstaking and time-consuming. Herein, we aimed to build machine learning models for screening and design of potent and effective NaV1.7 sodium channel inhibitors. We customized the imbalanced data set from ChEMBL and BindingDB to train and filter the best classification model. Then, the whole-cell voltage-clamp was employed to validate the inhibitors. We assembled a molecular group optimization method by combining the Grammar Variational Autoencoder, classification model, and simulated annealing. We found that the RF-CDK model (random forest + CDK fingerprint) performs best in the imbalanced data set. Of the three compounds that may have inhibitory effects, nortriptyline has been experimentally verified. In the molecule optimization process, 40 molecules located in the applicability domain of RF-CDK were used as a starting point, among which 34 molecules evolved to molecules with greater molecular scores (MS). The molecule with the highest MS was derived from CHEMBL2325245. The model and method we developed for NaV1.7 inhibitors are also applicable to other targets.</description><identifier>ISSN: 1549-9596</identifier><identifier>EISSN: 1549-960X</identifier><identifier>DOI: 10.1021/acs.jcim.9b01180</identifier><language>eng</language><publisher>Washington: American Chemical Society</publisher><subject>Classification ; Computer simulation ; Datasets ; Inhibitors ; Machine learning ; Optimization ; Screening ; Simulated annealing ; Sodium</subject><ispartof>Journal of chemical information and modeling, 2020-06, Vol.60 (6), p.2739-2753</ispartof><rights>Copyright American Chemical Society Jun 22, 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924</link.rule.ids></links><search><creatorcontrib>Kong, Weikaixin</creatorcontrib><creatorcontrib>Tu, Xinyu</creatorcontrib><creatorcontrib>Huang, Weiran</creatorcontrib><creatorcontrib>Yang, Yang</creatorcontrib><creatorcontrib>Xie, Zhengwei</creatorcontrib><creatorcontrib>Huang, Zhuo</creatorcontrib><title>Prediction and Optimization of NaV1.7 Sodium Channel Inhibitors Based on Machine Learning and Simulated Annealing</title><title>Journal of chemical information and modeling</title><description>Although the NaV1.7 sodium channel is a promising drug target for pain, traditional screening strategies for discovery of NaV1.7 inhibitors are very painstaking and time-consuming. Herein, we aimed to build machine learning models for screening and design of potent and effective NaV1.7 sodium channel inhibitors. We customized the imbalanced data set from ChEMBL and BindingDB to train and filter the best classification model. Then, the whole-cell voltage-clamp was employed to validate the inhibitors. We assembled a molecular group optimization method by combining the Grammar Variational Autoencoder, classification model, and simulated annealing. We found that the RF-CDK model (random forest + CDK fingerprint) performs best in the imbalanced data set. Of the three compounds that may have inhibitory effects, nortriptyline has been experimentally verified. In the molecule optimization process, 40 molecules located in the applicability domain of RF-CDK were used as a starting point, among which 34 molecules evolved to molecules with greater molecular scores (MS). The molecule with the highest MS was derived from CHEMBL2325245. The model and method we developed for NaV1.7 inhibitors are also applicable to other targets.</description><subject>Classification</subject><subject>Computer simulation</subject><subject>Datasets</subject><subject>Inhibitors</subject><subject>Machine learning</subject><subject>Optimization</subject><subject>Screening</subject><subject>Simulated annealing</subject><subject>Sodium</subject><issn>1549-9596</issn><issn>1549-960X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNpdjr1PwzAUxC0EEqWwM1piYUl4thMnGUvFR6VCkQqIrXqOHeoocdo4WfjrsQos3PJOp987HSGXDGIGnN1g6eO6tG1cKGAshyMyYWlSRIWEj-M_nxbylJx5XwMIUUg-IfuX3mhbDrZzFJ2mq91gW_uFh6Cr6DO-szij607bsaXzLTpnGrpwW6vs0PWe3qI3mgb4CcutdYYuDfbOus9D3dq2Y4NDIGbhEZuQn5OTChtvLn7vlLzd373OH6Pl6mExny2jmud8iJBrnYlKFWmmeZErE4QcKqEkZEwkGrQBk0qhVJqonGslmYa8rLTATKWVmJLrn95d3-1H44dNa31pmgad6Ua_4QkkMmFSQkCv_qF1N_YurAsUywAYZyC-ASTVbHU</recordid><startdate>20200622</startdate><enddate>20200622</enddate><creator>Kong, Weikaixin</creator><creator>Tu, Xinyu</creator><creator>Huang, Weiran</creator><creator>Yang, Yang</creator><creator>Xie, Zhengwei</creator><creator>Huang, Zhuo</creator><general>American Chemical Society</general><scope>7SC</scope><scope>7SR</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope></search><sort><creationdate>20200622</creationdate><title>Prediction and Optimization of NaV1.7 Sodium Channel Inhibitors Based on Machine Learning and Simulated Annealing</title><author>Kong, Weikaixin ; Tu, Xinyu ; Huang, Weiran ; Yang, Yang ; Xie, Zhengwei ; Huang, Zhuo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-j282t-a2dd73fb957d298beeeea20f3b607134d0de0e563bb54b82db61d08cfd3a7b5f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Classification</topic><topic>Computer simulation</topic><topic>Datasets</topic><topic>Inhibitors</topic><topic>Machine learning</topic><topic>Optimization</topic><topic>Screening</topic><topic>Simulated annealing</topic><topic>Sodium</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kong, Weikaixin</creatorcontrib><creatorcontrib>Tu, Xinyu</creatorcontrib><creatorcontrib>Huang, Weiran</creatorcontrib><creatorcontrib>Yang, Yang</creatorcontrib><creatorcontrib>Xie, Zhengwei</creatorcontrib><creatorcontrib>Huang, Zhuo</creatorcontrib><collection>Computer and Information Systems Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of chemical information and modeling</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kong, Weikaixin</au><au>Tu, Xinyu</au><au>Huang, Weiran</au><au>Yang, Yang</au><au>Xie, Zhengwei</au><au>Huang, Zhuo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction and Optimization of NaV1.7 Sodium Channel Inhibitors Based on Machine Learning and Simulated Annealing</atitle><jtitle>Journal of chemical information and modeling</jtitle><date>2020-06-22</date><risdate>2020</risdate><volume>60</volume><issue>6</issue><spage>2739</spage><epage>2753</epage><pages>2739-2753</pages><issn>1549-9596</issn><eissn>1549-960X</eissn><abstract>Although the NaV1.7 sodium channel is a promising drug target for pain, traditional screening strategies for discovery of NaV1.7 inhibitors are very painstaking and time-consuming. Herein, we aimed to build machine learning models for screening and design of potent and effective NaV1.7 sodium channel inhibitors. We customized the imbalanced data set from ChEMBL and BindingDB to train and filter the best classification model. Then, the whole-cell voltage-clamp was employed to validate the inhibitors. We assembled a molecular group optimization method by combining the Grammar Variational Autoencoder, classification model, and simulated annealing. We found that the RF-CDK model (random forest + CDK fingerprint) performs best in the imbalanced data set. Of the three compounds that may have inhibitory effects, nortriptyline has been experimentally verified. In the molecule optimization process, 40 molecules located in the applicability domain of RF-CDK were used as a starting point, among which 34 molecules evolved to molecules with greater molecular scores (MS). The molecule with the highest MS was derived from CHEMBL2325245. The model and method we developed for NaV1.7 inhibitors are also applicable to other targets.</abstract><cop>Washington</cop><pub>American Chemical Society</pub><doi>10.1021/acs.jcim.9b01180</doi><tpages>15</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1549-9596
ispartof Journal of chemical information and modeling, 2020-06, Vol.60 (6), p.2739-2753
issn 1549-9596
1549-960X
language eng
recordid cdi_proquest_miscellaneous_2404641660
source ACS Publications
subjects Classification
Computer simulation
Datasets
Inhibitors
Machine learning
Optimization
Screening
Simulated annealing
Sodium
title Prediction and Optimization of NaV1.7 Sodium Channel Inhibitors Based on Machine Learning and Simulated Annealing
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T09%3A59%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20and%20Optimization%20of%20NaV1.7%20Sodium%20Channel%20Inhibitors%20Based%20on%20Machine%20Learning%20and%20Simulated%20Annealing&rft.jtitle=Journal%20of%20chemical%20information%20and%20modeling&rft.au=Kong,%20Weikaixin&rft.date=2020-06-22&rft.volume=60&rft.issue=6&rft.spage=2739&rft.epage=2753&rft.pages=2739-2753&rft.issn=1549-9596&rft.eissn=1549-960X&rft_id=info:doi/10.1021/acs.jcim.9b01180&rft_dat=%3Cproquest%3E2404641660%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2417001210&rft_id=info:pmid/&rfr_iscdi=true