Machine learning-assisted data-driven optimization and understanding of the multiple stage process for extraction of polysaccharides and secondary metabolites from natural products

Currently, extraction process optimization is generally based on a few features, regardless of their different changing trends and the panoramic view of the extraction process. Comprehensive evaluation and understanding is hard to establish due to the small number of experiments. Here, machine learn...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Green chemistry : an international journal and green chemistry resource : GC 2023-04, Vol.25 (8), p.357-368
Hauptverfasser: Ma, Jiamu, Yao, Jianling, Ren, Xueyang, Dong, Ying, Song, Ruolan, Zhong, Xiangjian, Zheng, Yuan, Shan, Dongjie, Lv, Fang, Li, Xianxian, Deng, Qingyue, He, Yingyu, Yuan, Ruijuan, She, Gaimei
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 368
container_issue 8
container_start_page 357
container_title Green chemistry : an international journal and green chemistry resource : GC
container_volume 25
creator Ma, Jiamu
Yao, Jianling
Ren, Xueyang
Dong, Ying
Song, Ruolan
Zhong, Xiangjian
Zheng, Yuan
Shan, Dongjie
Lv, Fang
Li, Xianxian
Deng, Qingyue
He, Yingyu
Yuan, Ruijuan
She, Gaimei
description Currently, extraction process optimization is generally based on a few features, regardless of their different changing trends and the panoramic view of the extraction process. Comprehensive evaluation and understanding is hard to establish due to the small number of experiments. Here, machine learning-assisted optimization is demonstrated for better understanding the complex extraction process based on data from an orthogonal experimental design (OED). From two perspectives of panoramic characteristics and specific characteristics, several observations are adopted to evaluate the performance of the extraction process, including quantitative 1 H NMR, HPLC fingerprint, molecular weight, yield of dry extract and content of components. The close relationship between influencing factors and the extraction performance is described by grey relation analysis. With the help of radial basis function neural network (RBFNN), a nonlinear fitting regression equation is developed for every observation and influencing factor. A genetic algorithm is then introduced for multi-objective optimization and Pareto fronts are obtained. To select the best combination of water extraction process and ethanol extraction process, a list of the combinations of Pareto front points from those extraction processes is formed and ranked using CRITIC-TOPSIS. Finally, the ideal extraction is characterized by molecular weight, monosaccharide composition and UHPLC-MS/MS. With the verification between OED experiments and machine learning, the changing rates of all observations range from 1.33% to 30.11%, which confirms that machine learning-assisted optimization gives better performance than conventional OED. Molecular weight could range from 61.5~594.9 kDa with some are over measuring range, furthermore mannose and glucose are the most abuntant monosaccharides of the polysaccharide from ideal extraction. 160 components are identified via UHPLC-MS/MS as well. In conclusion, ML is a powerful tool for predicting and understanding extraction processes, thus accelerating the development of eco-friendly extraction processes. A machine learning strategy mainly consist of radial basis function neural network and genetic algorithm for predicting and understanding multi-objective extraction process.
doi_str_mv 10.1039/d2gc04574e
format Article
fullrecord <record><control><sourceid>proquest_rsc_p</sourceid><recordid>TN_cdi_proquest_journals_2804879200</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2804879200</sourcerecordid><originalsourceid>FETCH-LOGICAL-c281t-653dbc046aef454e1b9cc915e01aa20ed16cfde931265a2e757780e52d16e2903</originalsourceid><addsrcrecordid>eNpFkU1LxDAQhosouH5cvAsBb0I1Sb-2R1k_QfGi5zKbTHeztEnNpKL-Ln-g2V3R0wy8z7zDzJskJ4JfCJ7Vl1ouFM-LKsedZCLyMktrWfHdv76U-8kB0YpzIaoynyTfT6CWxiLrELw1dpECkaGAmmkIkGpv3tEyNwTTmy8IxlkGVrPRavQUYhtnmGtZWCLrxy6YoUMWhQWywTuFRKx1nuFH8KA24xEeXPdJoNQSvNFIG0dC5awG_8l6DDB3nQlRab3rmYUweujWhnpUgY6SvRY6wuPfepi83t68zO7Tx-e7h9nVY6rkVIS0LDI9j-8oAdu8yFHMa6VqUSAXAJKjFqVqNdaZkGUBEquiqqYcCxkFlDXPDpOzrW9c_DYihWblRm_jykZOeT6tasnX1PmWUt4ReWybwZs-HtII3qxTaa7l3WyTyk2ET7ewJ_XH_aeW_QDTso8i</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2804879200</pqid></control><display><type>article</type><title>Machine learning-assisted data-driven optimization and understanding of the multiple stage process for extraction of polysaccharides and secondary metabolites from natural products</title><source>Royal Society Of Chemistry Journals 2008-</source><source>Alma/SFX Local Collection</source><creator>Ma, Jiamu ; Yao, Jianling ; Ren, Xueyang ; Dong, Ying ; Song, Ruolan ; Zhong, Xiangjian ; Zheng, Yuan ; Shan, Dongjie ; Lv, Fang ; Li, Xianxian ; Deng, Qingyue ; He, Yingyu ; Yuan, Ruijuan ; She, Gaimei</creator><creatorcontrib>Ma, Jiamu ; Yao, Jianling ; Ren, Xueyang ; Dong, Ying ; Song, Ruolan ; Zhong, Xiangjian ; Zheng, Yuan ; Shan, Dongjie ; Lv, Fang ; Li, Xianxian ; Deng, Qingyue ; He, Yingyu ; Yuan, Ruijuan ; She, Gaimei</creatorcontrib><description>Currently, extraction process optimization is generally based on a few features, regardless of their different changing trends and the panoramic view of the extraction process. Comprehensive evaluation and understanding is hard to establish due to the small number of experiments. Here, machine learning-assisted optimization is demonstrated for better understanding the complex extraction process based on data from an orthogonal experimental design (OED). From two perspectives of panoramic characteristics and specific characteristics, several observations are adopted to evaluate the performance of the extraction process, including quantitative 1 H NMR, HPLC fingerprint, molecular weight, yield of dry extract and content of components. The close relationship between influencing factors and the extraction performance is described by grey relation analysis. With the help of radial basis function neural network (RBFNN), a nonlinear fitting regression equation is developed for every observation and influencing factor. A genetic algorithm is then introduced for multi-objective optimization and Pareto fronts are obtained. To select the best combination of water extraction process and ethanol extraction process, a list of the combinations of Pareto front points from those extraction processes is formed and ranked using CRITIC-TOPSIS. Finally, the ideal extraction is characterized by molecular weight, monosaccharide composition and UHPLC-MS/MS. With the verification between OED experiments and machine learning, the changing rates of all observations range from 1.33% to 30.11%, which confirms that machine learning-assisted optimization gives better performance than conventional OED. Molecular weight could range from 61.5~594.9 kDa with some are over measuring range, furthermore mannose and glucose are the most abuntant monosaccharides of the polysaccharide from ideal extraction. 160 components are identified via UHPLC-MS/MS as well. In conclusion, ML is a powerful tool for predicting and understanding extraction processes, thus accelerating the development of eco-friendly extraction processes. A machine learning strategy mainly consist of radial basis function neural network and genetic algorithm for predicting and understanding multi-objective extraction process.</description><identifier>ISSN: 1463-9262</identifier><identifier>EISSN: 1463-9270</identifier><identifier>DOI: 10.1039/d2gc04574e</identifier><language>eng</language><publisher>Cambridge: Royal Society of Chemistry</publisher><subject>Design of experiments ; Ethanol ; Experimental design ; Genetic algorithms ; Green chemistry ; Learning algorithms ; Machine learning ; Mannose ; Metabolites ; Molecular weight ; Monosaccharides ; Multiple objective analysis ; Natural products ; Neural networks ; NMR ; Nuclear magnetic resonance ; Optimization ; Pareto optimization ; Performance evaluation ; Polysaccharides ; Radial basis function ; Saccharides ; Secondary metabolites</subject><ispartof>Green chemistry : an international journal and green chemistry resource : GC, 2023-04, Vol.25 (8), p.357-368</ispartof><rights>Copyright Royal Society of Chemistry 2023</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c281t-653dbc046aef454e1b9cc915e01aa20ed16cfde931265a2e757780e52d16e2903</citedby><cites>FETCH-LOGICAL-c281t-653dbc046aef454e1b9cc915e01aa20ed16cfde931265a2e757780e52d16e2903</cites><orcidid>0000-0003-1754-2140</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Ma, Jiamu</creatorcontrib><creatorcontrib>Yao, Jianling</creatorcontrib><creatorcontrib>Ren, Xueyang</creatorcontrib><creatorcontrib>Dong, Ying</creatorcontrib><creatorcontrib>Song, Ruolan</creatorcontrib><creatorcontrib>Zhong, Xiangjian</creatorcontrib><creatorcontrib>Zheng, Yuan</creatorcontrib><creatorcontrib>Shan, Dongjie</creatorcontrib><creatorcontrib>Lv, Fang</creatorcontrib><creatorcontrib>Li, Xianxian</creatorcontrib><creatorcontrib>Deng, Qingyue</creatorcontrib><creatorcontrib>He, Yingyu</creatorcontrib><creatorcontrib>Yuan, Ruijuan</creatorcontrib><creatorcontrib>She, Gaimei</creatorcontrib><title>Machine learning-assisted data-driven optimization and understanding of the multiple stage process for extraction of polysaccharides and secondary metabolites from natural products</title><title>Green chemistry : an international journal and green chemistry resource : GC</title><description>Currently, extraction process optimization is generally based on a few features, regardless of their different changing trends and the panoramic view of the extraction process. Comprehensive evaluation and understanding is hard to establish due to the small number of experiments. Here, machine learning-assisted optimization is demonstrated for better understanding the complex extraction process based on data from an orthogonal experimental design (OED). From two perspectives of panoramic characteristics and specific characteristics, several observations are adopted to evaluate the performance of the extraction process, including quantitative 1 H NMR, HPLC fingerprint, molecular weight, yield of dry extract and content of components. The close relationship between influencing factors and the extraction performance is described by grey relation analysis. With the help of radial basis function neural network (RBFNN), a nonlinear fitting regression equation is developed for every observation and influencing factor. A genetic algorithm is then introduced for multi-objective optimization and Pareto fronts are obtained. To select the best combination of water extraction process and ethanol extraction process, a list of the combinations of Pareto front points from those extraction processes is formed and ranked using CRITIC-TOPSIS. Finally, the ideal extraction is characterized by molecular weight, monosaccharide composition and UHPLC-MS/MS. With the verification between OED experiments and machine learning, the changing rates of all observations range from 1.33% to 30.11%, which confirms that machine learning-assisted optimization gives better performance than conventional OED. Molecular weight could range from 61.5~594.9 kDa with some are over measuring range, furthermore mannose and glucose are the most abuntant monosaccharides of the polysaccharide from ideal extraction. 160 components are identified via UHPLC-MS/MS as well. In conclusion, ML is a powerful tool for predicting and understanding extraction processes, thus accelerating the development of eco-friendly extraction processes. A machine learning strategy mainly consist of radial basis function neural network and genetic algorithm for predicting and understanding multi-objective extraction process.</description><subject>Design of experiments</subject><subject>Ethanol</subject><subject>Experimental design</subject><subject>Genetic algorithms</subject><subject>Green chemistry</subject><subject>Learning algorithms</subject><subject>Machine learning</subject><subject>Mannose</subject><subject>Metabolites</subject><subject>Molecular weight</subject><subject>Monosaccharides</subject><subject>Multiple objective analysis</subject><subject>Natural products</subject><subject>Neural networks</subject><subject>NMR</subject><subject>Nuclear magnetic resonance</subject><subject>Optimization</subject><subject>Pareto optimization</subject><subject>Performance evaluation</subject><subject>Polysaccharides</subject><subject>Radial basis function</subject><subject>Saccharides</subject><subject>Secondary metabolites</subject><issn>1463-9262</issn><issn>1463-9270</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNpFkU1LxDAQhosouH5cvAsBb0I1Sb-2R1k_QfGi5zKbTHeztEnNpKL-Ln-g2V3R0wy8z7zDzJskJ4JfCJ7Vl1ouFM-LKsedZCLyMktrWfHdv76U-8kB0YpzIaoynyTfT6CWxiLrELw1dpECkaGAmmkIkGpv3tEyNwTTmy8IxlkGVrPRavQUYhtnmGtZWCLrxy6YoUMWhQWywTuFRKx1nuFH8KA24xEeXPdJoNQSvNFIG0dC5awG_8l6DDB3nQlRab3rmYUweujWhnpUgY6SvRY6wuPfepi83t68zO7Tx-e7h9nVY6rkVIS0LDI9j-8oAdu8yFHMa6VqUSAXAJKjFqVqNdaZkGUBEquiqqYcCxkFlDXPDpOzrW9c_DYihWblRm_jykZOeT6tasnX1PmWUt4ReWybwZs-HtII3qxTaa7l3WyTyk2ET7ewJ_XH_aeW_QDTso8i</recordid><startdate>20230424</startdate><enddate>20230424</enddate><creator>Ma, Jiamu</creator><creator>Yao, Jianling</creator><creator>Ren, Xueyang</creator><creator>Dong, Ying</creator><creator>Song, Ruolan</creator><creator>Zhong, Xiangjian</creator><creator>Zheng, Yuan</creator><creator>Shan, Dongjie</creator><creator>Lv, Fang</creator><creator>Li, Xianxian</creator><creator>Deng, Qingyue</creator><creator>He, Yingyu</creator><creator>Yuan, Ruijuan</creator><creator>She, Gaimei</creator><general>Royal Society of Chemistry</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SR</scope><scope>7ST</scope><scope>7U6</scope><scope>8BQ</scope><scope>8FD</scope><scope>C1K</scope><scope>JG9</scope><orcidid>https://orcid.org/0000-0003-1754-2140</orcidid></search><sort><creationdate>20230424</creationdate><title>Machine learning-assisted data-driven optimization and understanding of the multiple stage process for extraction of polysaccharides and secondary metabolites from natural products</title><author>Ma, Jiamu ; Yao, Jianling ; Ren, Xueyang ; Dong, Ying ; Song, Ruolan ; Zhong, Xiangjian ; Zheng, Yuan ; Shan, Dongjie ; Lv, Fang ; Li, Xianxian ; Deng, Qingyue ; He, Yingyu ; Yuan, Ruijuan ; She, Gaimei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c281t-653dbc046aef454e1b9cc915e01aa20ed16cfde931265a2e757780e52d16e2903</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Design of experiments</topic><topic>Ethanol</topic><topic>Experimental design</topic><topic>Genetic algorithms</topic><topic>Green chemistry</topic><topic>Learning algorithms</topic><topic>Machine learning</topic><topic>Mannose</topic><topic>Metabolites</topic><topic>Molecular weight</topic><topic>Monosaccharides</topic><topic>Multiple objective analysis</topic><topic>Natural products</topic><topic>Neural networks</topic><topic>NMR</topic><topic>Nuclear magnetic resonance</topic><topic>Optimization</topic><topic>Pareto optimization</topic><topic>Performance evaluation</topic><topic>Polysaccharides</topic><topic>Radial basis function</topic><topic>Saccharides</topic><topic>Secondary metabolites</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ma, Jiamu</creatorcontrib><creatorcontrib>Yao, Jianling</creatorcontrib><creatorcontrib>Ren, Xueyang</creatorcontrib><creatorcontrib>Dong, Ying</creatorcontrib><creatorcontrib>Song, Ruolan</creatorcontrib><creatorcontrib>Zhong, Xiangjian</creatorcontrib><creatorcontrib>Zheng, Yuan</creatorcontrib><creatorcontrib>Shan, Dongjie</creatorcontrib><creatorcontrib>Lv, Fang</creatorcontrib><creatorcontrib>Li, Xianxian</creatorcontrib><creatorcontrib>Deng, Qingyue</creatorcontrib><creatorcontrib>He, Yingyu</creatorcontrib><creatorcontrib>Yuan, Ruijuan</creatorcontrib><creatorcontrib>She, Gaimei</creatorcontrib><collection>CrossRef</collection><collection>Engineered Materials Abstracts</collection><collection>Environment Abstracts</collection><collection>Sustainability Science Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Materials Research Database</collection><jtitle>Green chemistry : an international journal and green chemistry resource : GC</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ma, Jiamu</au><au>Yao, Jianling</au><au>Ren, Xueyang</au><au>Dong, Ying</au><au>Song, Ruolan</au><au>Zhong, Xiangjian</au><au>Zheng, Yuan</au><au>Shan, Dongjie</au><au>Lv, Fang</au><au>Li, Xianxian</au><au>Deng, Qingyue</au><au>He, Yingyu</au><au>Yuan, Ruijuan</au><au>She, Gaimei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Machine learning-assisted data-driven optimization and understanding of the multiple stage process for extraction of polysaccharides and secondary metabolites from natural products</atitle><jtitle>Green chemistry : an international journal and green chemistry resource : GC</jtitle><date>2023-04-24</date><risdate>2023</risdate><volume>25</volume><issue>8</issue><spage>357</spage><epage>368</epage><pages>357-368</pages><issn>1463-9262</issn><eissn>1463-9270</eissn><abstract>Currently, extraction process optimization is generally based on a few features, regardless of their different changing trends and the panoramic view of the extraction process. Comprehensive evaluation and understanding is hard to establish due to the small number of experiments. Here, machine learning-assisted optimization is demonstrated for better understanding the complex extraction process based on data from an orthogonal experimental design (OED). From two perspectives of panoramic characteristics and specific characteristics, several observations are adopted to evaluate the performance of the extraction process, including quantitative 1 H NMR, HPLC fingerprint, molecular weight, yield of dry extract and content of components. The close relationship between influencing factors and the extraction performance is described by grey relation analysis. With the help of radial basis function neural network (RBFNN), a nonlinear fitting regression equation is developed for every observation and influencing factor. A genetic algorithm is then introduced for multi-objective optimization and Pareto fronts are obtained. To select the best combination of water extraction process and ethanol extraction process, a list of the combinations of Pareto front points from those extraction processes is formed and ranked using CRITIC-TOPSIS. Finally, the ideal extraction is characterized by molecular weight, monosaccharide composition and UHPLC-MS/MS. With the verification between OED experiments and machine learning, the changing rates of all observations range from 1.33% to 30.11%, which confirms that machine learning-assisted optimization gives better performance than conventional OED. Molecular weight could range from 61.5~594.9 kDa with some are over measuring range, furthermore mannose and glucose are the most abuntant monosaccharides of the polysaccharide from ideal extraction. 160 components are identified via UHPLC-MS/MS as well. In conclusion, ML is a powerful tool for predicting and understanding extraction processes, thus accelerating the development of eco-friendly extraction processes. A machine learning strategy mainly consist of radial basis function neural network and genetic algorithm for predicting and understanding multi-objective extraction process.</abstract><cop>Cambridge</cop><pub>Royal Society of Chemistry</pub><doi>10.1039/d2gc04574e</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0003-1754-2140</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1463-9262
ispartof Green chemistry : an international journal and green chemistry resource : GC, 2023-04, Vol.25 (8), p.357-368
issn 1463-9262
1463-9270
language eng
recordid cdi_proquest_journals_2804879200
source Royal Society Of Chemistry Journals 2008-; Alma/SFX Local Collection
subjects Design of experiments
Ethanol
Experimental design
Genetic algorithms
Green chemistry
Learning algorithms
Machine learning
Mannose
Metabolites
Molecular weight
Monosaccharides
Multiple objective analysis
Natural products
Neural networks
NMR
Nuclear magnetic resonance
Optimization
Pareto optimization
Performance evaluation
Polysaccharides
Radial basis function
Saccharides
Secondary metabolites
title Machine learning-assisted data-driven optimization and understanding of the multiple stage process for extraction of polysaccharides and secondary metabolites from natural products
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-11T05%3A21%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_rsc_p&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Machine%20learning-assisted%20data-driven%20optimization%20and%20understanding%20of%20the%20multiple%20stage%20process%20for%20extraction%20of%20polysaccharides%20and%20secondary%20metabolites%20from%20natural%20products&rft.jtitle=Green%20chemistry%20:%20an%20international%20journal%20and%20green%20chemistry%20resource%20:%20GC&rft.au=Ma,%20Jiamu&rft.date=2023-04-24&rft.volume=25&rft.issue=8&rft.spage=357&rft.epage=368&rft.pages=357-368&rft.issn=1463-9262&rft.eissn=1463-9270&rft_id=info:doi/10.1039/d2gc04574e&rft_dat=%3Cproquest_rsc_p%3E2804879200%3C/proquest_rsc_p%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2804879200&rft_id=info:pmid/&rfr_iscdi=true