Prediction of biopersistence of hydrocarbons using a single parameter

Aerobic biodegradation is an important attenuation process for petroleum hydrocarbons (PHCs) in the natural environment. It has also been frequently used in engineered systems to remediate PHC-contaminated sites. A model such as a quantitative structure property relationship (QSPR) that can predict...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Chemosphere (Oxford) 2018-12, Vol.213, p.76-83
Hauptverfasser: Xiao, Feng, Huisman, Quinn E.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 83
container_issue
container_start_page 76
container_title Chemosphere (Oxford)
container_volume 213
creator Xiao, Feng
Huisman, Quinn E.
description Aerobic biodegradation is an important attenuation process for petroleum hydrocarbons (PHCs) in the natural environment. It has also been frequently used in engineered systems to remediate PHC-contaminated sites. A model such as a quantitative structure property relationship (QSPR) that can predict the biodegradation rate of PHCs would be helpful prior to implementing any extensive environmental measurements and bioremediation strategies. Existing QSPRs either have a large number of predictor variables that may cause overfitting or are based on a small dataset of PHCs. The goal of this study is to develop a simple, portable QSPR that has only a few predicator variables but can accurately predict the biodegradation half-lives of a large group of PHCs. To this end, more than 500 molecular variables were screened, and candidate variables were refined by a feature selection method and fitted to biodegradation data of a group of structurally heterogeneous PHCs (n = 173). The model was established by means of hierarchical clustering and classification and regression tree algorithms, which was optimized by an internal validation procedure and validated by an external dataset. The optimal QSPR model, containing only one predictor variable (the number of bonds that do not contain hydrogen), was able to accurately predict biodegradation half-lives for a wide variety of PHCs. The internal validation test indicated an overall prediction accuracy of 93%, and predictions applied to an independent external set of 64 PHCs yielded 95% accuracy. The new model is transparent and easily portable from one user to another. [Display omitted] •A new QSPR was built for the biodegradation half-lives of hydrocarbons.•This new QSPR contains only one predictor variable.•Number of bonds that do not contain hydrogen is the most significant predictor.•This model was validated by an external test set of 64 hydrocarbons.•The model shows a prediction accuracy of 95% over the external validation set.
doi_str_mv 10.1016/j.chemosphere.2018.09.035
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2105065402</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0045653518316886</els_id><sourcerecordid>2105065402</sourcerecordid><originalsourceid>FETCH-LOGICAL-c377t-48ea51d29ff81ee532262426358786e80dfbbb220e6c96ca9ad83585716328e13</originalsourceid><addsrcrecordid>eNqNkM1OwzAQhC0EoqXwCijcuCSsndqxj6gqP1IlOMDZcpwNddXEwU6R-vYkakEcOY20O7Oj_Qi5oZBRoOJuk9k1Nj52awyYMaAyA5VBzk_IlMpCpZQpeUqmAHOeCp7zCbmIcQMwhLk6J5McGGUFo1OyfA1YOds73ya-TkrnOwzRxR5bi-Nkva-CtyaUvo3JLrr2IzHJKFtMOhNMgz2GS3JWm23Eq6POyPvD8m3xlK5eHp8X96vU5kXRp3OJhtOKqbqWFJHnjAk2ZyLnspACJVR1WZaMAQqrhDXKVHLY8YKKnEmk-YzcHu52wX_uMPa6cdHidmta9LuoGQUOgs-BDVZ1sNrgYwxY6y64xoS9pqBHinqj_1DUI0UNSg8Uh-z1sWZXNlj9Jn-wDYbFwYDDs18Og47WjcQqF9D2uvLuHzXf6eeJOA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2105065402</pqid></control><display><type>article</type><title>Prediction of biopersistence of hydrocarbons using a single parameter</title><source>MEDLINE</source><source>Access via ScienceDirect (Elsevier)</source><creator>Xiao, Feng ; Huisman, Quinn E.</creator><creatorcontrib>Xiao, Feng ; Huisman, Quinn E.</creatorcontrib><description>Aerobic biodegradation is an important attenuation process for petroleum hydrocarbons (PHCs) in the natural environment. It has also been frequently used in engineered systems to remediate PHC-contaminated sites. A model such as a quantitative structure property relationship (QSPR) that can predict the biodegradation rate of PHCs would be helpful prior to implementing any extensive environmental measurements and bioremediation strategies. Existing QSPRs either have a large number of predictor variables that may cause overfitting or are based on a small dataset of PHCs. The goal of this study is to develop a simple, portable QSPR that has only a few predicator variables but can accurately predict the biodegradation half-lives of a large group of PHCs. To this end, more than 500 molecular variables were screened, and candidate variables were refined by a feature selection method and fitted to biodegradation data of a group of structurally heterogeneous PHCs (n = 173). The model was established by means of hierarchical clustering and classification and regression tree algorithms, which was optimized by an internal validation procedure and validated by an external dataset. The optimal QSPR model, containing only one predictor variable (the number of bonds that do not contain hydrogen), was able to accurately predict biodegradation half-lives for a wide variety of PHCs. The internal validation test indicated an overall prediction accuracy of 93%, and predictions applied to an independent external set of 64 PHCs yielded 95% accuracy. The new model is transparent and easily portable from one user to another. [Display omitted] •A new QSPR was built for the biodegradation half-lives of hydrocarbons.•This new QSPR contains only one predictor variable.•Number of bonds that do not contain hydrogen is the most significant predictor.•This model was validated by an external test set of 64 hydrocarbons.•The model shows a prediction accuracy of 95% over the external validation set.</description><identifier>ISSN: 0045-6535</identifier><identifier>EISSN: 1879-1298</identifier><identifier>DOI: 10.1016/j.chemosphere.2018.09.035</identifier><identifier>PMID: 30212721</identifier><language>eng</language><publisher>England: Elsevier Ltd</publisher><subject>Biodegradation ; Hydrocarbons - chemistry ; Hydrocarbons - metabolism ; Molecular descriptors ; Petroleum - metabolism ; Petroleum hydrocarbons ; Quantitative structure property relationship</subject><ispartof>Chemosphere (Oxford), 2018-12, Vol.213, p.76-83</ispartof><rights>2018 Elsevier Ltd</rights><rights>Copyright © 2018 Elsevier Ltd. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c377t-48ea51d29ff81ee532262426358786e80dfbbb220e6c96ca9ad83585716328e13</citedby><cites>FETCH-LOGICAL-c377t-48ea51d29ff81ee532262426358786e80dfbbb220e6c96ca9ad83585716328e13</cites><orcidid>0000-0001-5686-6055</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.chemosphere.2018.09.035$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/30212721$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Xiao, Feng</creatorcontrib><creatorcontrib>Huisman, Quinn E.</creatorcontrib><title>Prediction of biopersistence of hydrocarbons using a single parameter</title><title>Chemosphere (Oxford)</title><addtitle>Chemosphere</addtitle><description>Aerobic biodegradation is an important attenuation process for petroleum hydrocarbons (PHCs) in the natural environment. It has also been frequently used in engineered systems to remediate PHC-contaminated sites. A model such as a quantitative structure property relationship (QSPR) that can predict the biodegradation rate of PHCs would be helpful prior to implementing any extensive environmental measurements and bioremediation strategies. Existing QSPRs either have a large number of predictor variables that may cause overfitting or are based on a small dataset of PHCs. The goal of this study is to develop a simple, portable QSPR that has only a few predicator variables but can accurately predict the biodegradation half-lives of a large group of PHCs. To this end, more than 500 molecular variables were screened, and candidate variables were refined by a feature selection method and fitted to biodegradation data of a group of structurally heterogeneous PHCs (n = 173). The model was established by means of hierarchical clustering and classification and regression tree algorithms, which was optimized by an internal validation procedure and validated by an external dataset. The optimal QSPR model, containing only one predictor variable (the number of bonds that do not contain hydrogen), was able to accurately predict biodegradation half-lives for a wide variety of PHCs. The internal validation test indicated an overall prediction accuracy of 93%, and predictions applied to an independent external set of 64 PHCs yielded 95% accuracy. The new model is transparent and easily portable from one user to another. [Display omitted] •A new QSPR was built for the biodegradation half-lives of hydrocarbons.•This new QSPR contains only one predictor variable.•Number of bonds that do not contain hydrogen is the most significant predictor.•This model was validated by an external test set of 64 hydrocarbons.•The model shows a prediction accuracy of 95% over the external validation set.</description><subject>Biodegradation</subject><subject>Hydrocarbons - chemistry</subject><subject>Hydrocarbons - metabolism</subject><subject>Molecular descriptors</subject><subject>Petroleum - metabolism</subject><subject>Petroleum hydrocarbons</subject><subject>Quantitative structure property relationship</subject><issn>0045-6535</issn><issn>1879-1298</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqNkM1OwzAQhC0EoqXwCijcuCSsndqxj6gqP1IlOMDZcpwNddXEwU6R-vYkakEcOY20O7Oj_Qi5oZBRoOJuk9k1Nj52awyYMaAyA5VBzk_IlMpCpZQpeUqmAHOeCp7zCbmIcQMwhLk6J5McGGUFo1OyfA1YOds73ya-TkrnOwzRxR5bi-Nkva-CtyaUvo3JLrr2IzHJKFtMOhNMgz2GS3JWm23Eq6POyPvD8m3xlK5eHp8X96vU5kXRp3OJhtOKqbqWFJHnjAk2ZyLnspACJVR1WZaMAQqrhDXKVHLY8YKKnEmk-YzcHu52wX_uMPa6cdHidmta9LuoGQUOgs-BDVZ1sNrgYwxY6y64xoS9pqBHinqj_1DUI0UNSg8Uh-z1sWZXNlj9Jn-wDYbFwYDDs18Og47WjcQqF9D2uvLuHzXf6eeJOA</recordid><startdate>201812</startdate><enddate>201812</enddate><creator>Xiao, Feng</creator><creator>Huisman, Quinn E.</creator><general>Elsevier Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-5686-6055</orcidid></search><sort><creationdate>201812</creationdate><title>Prediction of biopersistence of hydrocarbons using a single parameter</title><author>Xiao, Feng ; Huisman, Quinn E.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c377t-48ea51d29ff81ee532262426358786e80dfbbb220e6c96ca9ad83585716328e13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Biodegradation</topic><topic>Hydrocarbons - chemistry</topic><topic>Hydrocarbons - metabolism</topic><topic>Molecular descriptors</topic><topic>Petroleum - metabolism</topic><topic>Petroleum hydrocarbons</topic><topic>Quantitative structure property relationship</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Xiao, Feng</creatorcontrib><creatorcontrib>Huisman, Quinn E.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Chemosphere (Oxford)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Xiao, Feng</au><au>Huisman, Quinn E.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction of biopersistence of hydrocarbons using a single parameter</atitle><jtitle>Chemosphere (Oxford)</jtitle><addtitle>Chemosphere</addtitle><date>2018-12</date><risdate>2018</risdate><volume>213</volume><spage>76</spage><epage>83</epage><pages>76-83</pages><issn>0045-6535</issn><eissn>1879-1298</eissn><abstract>Aerobic biodegradation is an important attenuation process for petroleum hydrocarbons (PHCs) in the natural environment. It has also been frequently used in engineered systems to remediate PHC-contaminated sites. A model such as a quantitative structure property relationship (QSPR) that can predict the biodegradation rate of PHCs would be helpful prior to implementing any extensive environmental measurements and bioremediation strategies. Existing QSPRs either have a large number of predictor variables that may cause overfitting or are based on a small dataset of PHCs. The goal of this study is to develop a simple, portable QSPR that has only a few predicator variables but can accurately predict the biodegradation half-lives of a large group of PHCs. To this end, more than 500 molecular variables were screened, and candidate variables were refined by a feature selection method and fitted to biodegradation data of a group of structurally heterogeneous PHCs (n = 173). The model was established by means of hierarchical clustering and classification and regression tree algorithms, which was optimized by an internal validation procedure and validated by an external dataset. The optimal QSPR model, containing only one predictor variable (the number of bonds that do not contain hydrogen), was able to accurately predict biodegradation half-lives for a wide variety of PHCs. The internal validation test indicated an overall prediction accuracy of 93%, and predictions applied to an independent external set of 64 PHCs yielded 95% accuracy. The new model is transparent and easily portable from one user to another. [Display omitted] •A new QSPR was built for the biodegradation half-lives of hydrocarbons.•This new QSPR contains only one predictor variable.•Number of bonds that do not contain hydrogen is the most significant predictor.•This model was validated by an external test set of 64 hydrocarbons.•The model shows a prediction accuracy of 95% over the external validation set.</abstract><cop>England</cop><pub>Elsevier Ltd</pub><pmid>30212721</pmid><doi>10.1016/j.chemosphere.2018.09.035</doi><tpages>8</tpages><orcidid>https://orcid.org/0000-0001-5686-6055</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0045-6535
ispartof Chemosphere (Oxford), 2018-12, Vol.213, p.76-83
issn 0045-6535
1879-1298
language eng
recordid cdi_proquest_miscellaneous_2105065402
source MEDLINE; Access via ScienceDirect (Elsevier)
subjects Biodegradation
Hydrocarbons - chemistry
Hydrocarbons - metabolism
Molecular descriptors
Petroleum - metabolism
Petroleum hydrocarbons
Quantitative structure property relationship
title Prediction of biopersistence of hydrocarbons using a single parameter
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-24T01%3A17%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20of%20biopersistence%20of%20hydrocarbons%20using%20a%20single%20parameter&rft.jtitle=Chemosphere%20(Oxford)&rft.au=Xiao,%20Feng&rft.date=2018-12&rft.volume=213&rft.spage=76&rft.epage=83&rft.pages=76-83&rft.issn=0045-6535&rft.eissn=1879-1298&rft_id=info:doi/10.1016/j.chemosphere.2018.09.035&rft_dat=%3Cproquest_cross%3E2105065402%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2105065402&rft_id=info:pmid/30212721&rft_els_id=S0045653518316886&rfr_iscdi=true