Prediction of biopersistence of hydrocarbons using a single parameter
Aerobic biodegradation is an important attenuation process for petroleum hydrocarbons (PHCs) in the natural environment. It has also been frequently used in engineered systems to remediate PHC-contaminated sites. A model such as a quantitative structure property relationship (QSPR) that can predict...
Gespeichert in:
Veröffentlicht in: | Chemosphere (Oxford) 2018-12, Vol.213, p.76-83 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 83 |
---|---|
container_issue | |
container_start_page | 76 |
container_title | Chemosphere (Oxford) |
container_volume | 213 |
creator | Xiao, Feng Huisman, Quinn E. |
description | Aerobic biodegradation is an important attenuation process for petroleum hydrocarbons (PHCs) in the natural environment. It has also been frequently used in engineered systems to remediate PHC-contaminated sites. A model such as a quantitative structure property relationship (QSPR) that can predict the biodegradation rate of PHCs would be helpful prior to implementing any extensive environmental measurements and bioremediation strategies. Existing QSPRs either have a large number of predictor variables that may cause overfitting or are based on a small dataset of PHCs. The goal of this study is to develop a simple, portable QSPR that has only a few predicator variables but can accurately predict the biodegradation half-lives of a large group of PHCs. To this end, more than 500 molecular variables were screened, and candidate variables were refined by a feature selection method and fitted to biodegradation data of a group of structurally heterogeneous PHCs (n = 173). The model was established by means of hierarchical clustering and classification and regression tree algorithms, which was optimized by an internal validation procedure and validated by an external dataset. The optimal QSPR model, containing only one predictor variable (the number of bonds that do not contain hydrogen), was able to accurately predict biodegradation half-lives for a wide variety of PHCs. The internal validation test indicated an overall prediction accuracy of 93%, and predictions applied to an independent external set of 64 PHCs yielded 95% accuracy. The new model is transparent and easily portable from one user to another.
[Display omitted]
•A new QSPR was built for the biodegradation half-lives of hydrocarbons.•This new QSPR contains only one predictor variable.•Number of bonds that do not contain hydrogen is the most significant predictor.•This model was validated by an external test set of 64 hydrocarbons.•The model shows a prediction accuracy of 95% over the external validation set. |
doi_str_mv | 10.1016/j.chemosphere.2018.09.035 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2105065402</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0045653518316886</els_id><sourcerecordid>2105065402</sourcerecordid><originalsourceid>FETCH-LOGICAL-c377t-48ea51d29ff81ee532262426358786e80dfbbb220e6c96ca9ad83585716328e13</originalsourceid><addsrcrecordid>eNqNkM1OwzAQhC0EoqXwCijcuCSsndqxj6gqP1IlOMDZcpwNddXEwU6R-vYkakEcOY20O7Oj_Qi5oZBRoOJuk9k1Nj52awyYMaAyA5VBzk_IlMpCpZQpeUqmAHOeCp7zCbmIcQMwhLk6J5McGGUFo1OyfA1YOds73ya-TkrnOwzRxR5bi-Nkva-CtyaUvo3JLrr2IzHJKFtMOhNMgz2GS3JWm23Eq6POyPvD8m3xlK5eHp8X96vU5kXRp3OJhtOKqbqWFJHnjAk2ZyLnspACJVR1WZaMAQqrhDXKVHLY8YKKnEmk-YzcHu52wX_uMPa6cdHidmta9LuoGQUOgs-BDVZ1sNrgYwxY6y64xoS9pqBHinqj_1DUI0UNSg8Uh-z1sWZXNlj9Jn-wDYbFwYDDs18Og47WjcQqF9D2uvLuHzXf6eeJOA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2105065402</pqid></control><display><type>article</type><title>Prediction of biopersistence of hydrocarbons using a single parameter</title><source>MEDLINE</source><source>Access via ScienceDirect (Elsevier)</source><creator>Xiao, Feng ; Huisman, Quinn E.</creator><creatorcontrib>Xiao, Feng ; Huisman, Quinn E.</creatorcontrib><description>Aerobic biodegradation is an important attenuation process for petroleum hydrocarbons (PHCs) in the natural environment. It has also been frequently used in engineered systems to remediate PHC-contaminated sites. A model such as a quantitative structure property relationship (QSPR) that can predict the biodegradation rate of PHCs would be helpful prior to implementing any extensive environmental measurements and bioremediation strategies. Existing QSPRs either have a large number of predictor variables that may cause overfitting or are based on a small dataset of PHCs. The goal of this study is to develop a simple, portable QSPR that has only a few predicator variables but can accurately predict the biodegradation half-lives of a large group of PHCs. To this end, more than 500 molecular variables were screened, and candidate variables were refined by a feature selection method and fitted to biodegradation data of a group of structurally heterogeneous PHCs (n = 173). The model was established by means of hierarchical clustering and classification and regression tree algorithms, which was optimized by an internal validation procedure and validated by an external dataset. The optimal QSPR model, containing only one predictor variable (the number of bonds that do not contain hydrogen), was able to accurately predict biodegradation half-lives for a wide variety of PHCs. The internal validation test indicated an overall prediction accuracy of 93%, and predictions applied to an independent external set of 64 PHCs yielded 95% accuracy. The new model is transparent and easily portable from one user to another.
[Display omitted]
•A new QSPR was built for the biodegradation half-lives of hydrocarbons.•This new QSPR contains only one predictor variable.•Number of bonds that do not contain hydrogen is the most significant predictor.•This model was validated by an external test set of 64 hydrocarbons.•The model shows a prediction accuracy of 95% over the external validation set.</description><identifier>ISSN: 0045-6535</identifier><identifier>EISSN: 1879-1298</identifier><identifier>DOI: 10.1016/j.chemosphere.2018.09.035</identifier><identifier>PMID: 30212721</identifier><language>eng</language><publisher>England: Elsevier Ltd</publisher><subject>Biodegradation ; Hydrocarbons - chemistry ; Hydrocarbons - metabolism ; Molecular descriptors ; Petroleum - metabolism ; Petroleum hydrocarbons ; Quantitative structure property relationship</subject><ispartof>Chemosphere (Oxford), 2018-12, Vol.213, p.76-83</ispartof><rights>2018 Elsevier Ltd</rights><rights>Copyright © 2018 Elsevier Ltd. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c377t-48ea51d29ff81ee532262426358786e80dfbbb220e6c96ca9ad83585716328e13</citedby><cites>FETCH-LOGICAL-c377t-48ea51d29ff81ee532262426358786e80dfbbb220e6c96ca9ad83585716328e13</cites><orcidid>0000-0001-5686-6055</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.chemosphere.2018.09.035$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/30212721$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Xiao, Feng</creatorcontrib><creatorcontrib>Huisman, Quinn E.</creatorcontrib><title>Prediction of biopersistence of hydrocarbons using a single parameter</title><title>Chemosphere (Oxford)</title><addtitle>Chemosphere</addtitle><description>Aerobic biodegradation is an important attenuation process for petroleum hydrocarbons (PHCs) in the natural environment. It has also been frequently used in engineered systems to remediate PHC-contaminated sites. A model such as a quantitative structure property relationship (QSPR) that can predict the biodegradation rate of PHCs would be helpful prior to implementing any extensive environmental measurements and bioremediation strategies. Existing QSPRs either have a large number of predictor variables that may cause overfitting or are based on a small dataset of PHCs. The goal of this study is to develop a simple, portable QSPR that has only a few predicator variables but can accurately predict the biodegradation half-lives of a large group of PHCs. To this end, more than 500 molecular variables were screened, and candidate variables were refined by a feature selection method and fitted to biodegradation data of a group of structurally heterogeneous PHCs (n = 173). The model was established by means of hierarchical clustering and classification and regression tree algorithms, which was optimized by an internal validation procedure and validated by an external dataset. The optimal QSPR model, containing only one predictor variable (the number of bonds that do not contain hydrogen), was able to accurately predict biodegradation half-lives for a wide variety of PHCs. The internal validation test indicated an overall prediction accuracy of 93%, and predictions applied to an independent external set of 64 PHCs yielded 95% accuracy. The new model is transparent and easily portable from one user to another.
[Display omitted]
•A new QSPR was built for the biodegradation half-lives of hydrocarbons.•This new QSPR contains only one predictor variable.•Number of bonds that do not contain hydrogen is the most significant predictor.•This model was validated by an external test set of 64 hydrocarbons.•The model shows a prediction accuracy of 95% over the external validation set.</description><subject>Biodegradation</subject><subject>Hydrocarbons - chemistry</subject><subject>Hydrocarbons - metabolism</subject><subject>Molecular descriptors</subject><subject>Petroleum - metabolism</subject><subject>Petroleum hydrocarbons</subject><subject>Quantitative structure property relationship</subject><issn>0045-6535</issn><issn>1879-1298</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqNkM1OwzAQhC0EoqXwCijcuCSsndqxj6gqP1IlOMDZcpwNddXEwU6R-vYkakEcOY20O7Oj_Qi5oZBRoOJuk9k1Nj52awyYMaAyA5VBzk_IlMpCpZQpeUqmAHOeCp7zCbmIcQMwhLk6J5McGGUFo1OyfA1YOds73ya-TkrnOwzRxR5bi-Nkva-CtyaUvo3JLrr2IzHJKFtMOhNMgz2GS3JWm23Eq6POyPvD8m3xlK5eHp8X96vU5kXRp3OJhtOKqbqWFJHnjAk2ZyLnspACJVR1WZaMAQqrhDXKVHLY8YKKnEmk-YzcHu52wX_uMPa6cdHidmta9LuoGQUOgs-BDVZ1sNrgYwxY6y64xoS9pqBHinqj_1DUI0UNSg8Uh-z1sWZXNlj9Jn-wDYbFwYDDs18Og47WjcQqF9D2uvLuHzXf6eeJOA</recordid><startdate>201812</startdate><enddate>201812</enddate><creator>Xiao, Feng</creator><creator>Huisman, Quinn E.</creator><general>Elsevier Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-5686-6055</orcidid></search><sort><creationdate>201812</creationdate><title>Prediction of biopersistence of hydrocarbons using a single parameter</title><author>Xiao, Feng ; Huisman, Quinn E.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c377t-48ea51d29ff81ee532262426358786e80dfbbb220e6c96ca9ad83585716328e13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Biodegradation</topic><topic>Hydrocarbons - chemistry</topic><topic>Hydrocarbons - metabolism</topic><topic>Molecular descriptors</topic><topic>Petroleum - metabolism</topic><topic>Petroleum hydrocarbons</topic><topic>Quantitative structure property relationship</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Xiao, Feng</creatorcontrib><creatorcontrib>Huisman, Quinn E.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Chemosphere (Oxford)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Xiao, Feng</au><au>Huisman, Quinn E.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction of biopersistence of hydrocarbons using a single parameter</atitle><jtitle>Chemosphere (Oxford)</jtitle><addtitle>Chemosphere</addtitle><date>2018-12</date><risdate>2018</risdate><volume>213</volume><spage>76</spage><epage>83</epage><pages>76-83</pages><issn>0045-6535</issn><eissn>1879-1298</eissn><abstract>Aerobic biodegradation is an important attenuation process for petroleum hydrocarbons (PHCs) in the natural environment. It has also been frequently used in engineered systems to remediate PHC-contaminated sites. A model such as a quantitative structure property relationship (QSPR) that can predict the biodegradation rate of PHCs would be helpful prior to implementing any extensive environmental measurements and bioremediation strategies. Existing QSPRs either have a large number of predictor variables that may cause overfitting or are based on a small dataset of PHCs. The goal of this study is to develop a simple, portable QSPR that has only a few predicator variables but can accurately predict the biodegradation half-lives of a large group of PHCs. To this end, more than 500 molecular variables were screened, and candidate variables were refined by a feature selection method and fitted to biodegradation data of a group of structurally heterogeneous PHCs (n = 173). The model was established by means of hierarchical clustering and classification and regression tree algorithms, which was optimized by an internal validation procedure and validated by an external dataset. The optimal QSPR model, containing only one predictor variable (the number of bonds that do not contain hydrogen), was able to accurately predict biodegradation half-lives for a wide variety of PHCs. The internal validation test indicated an overall prediction accuracy of 93%, and predictions applied to an independent external set of 64 PHCs yielded 95% accuracy. The new model is transparent and easily portable from one user to another.
[Display omitted]
•A new QSPR was built for the biodegradation half-lives of hydrocarbons.•This new QSPR contains only one predictor variable.•Number of bonds that do not contain hydrogen is the most significant predictor.•This model was validated by an external test set of 64 hydrocarbons.•The model shows a prediction accuracy of 95% over the external validation set.</abstract><cop>England</cop><pub>Elsevier Ltd</pub><pmid>30212721</pmid><doi>10.1016/j.chemosphere.2018.09.035</doi><tpages>8</tpages><orcidid>https://orcid.org/0000-0001-5686-6055</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0045-6535 |
ispartof | Chemosphere (Oxford), 2018-12, Vol.213, p.76-83 |
issn | 0045-6535 1879-1298 |
language | eng |
recordid | cdi_proquest_miscellaneous_2105065402 |
source | MEDLINE; Access via ScienceDirect (Elsevier) |
subjects | Biodegradation Hydrocarbons - chemistry Hydrocarbons - metabolism Molecular descriptors Petroleum - metabolism Petroleum hydrocarbons Quantitative structure property relationship |
title | Prediction of biopersistence of hydrocarbons using a single parameter |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-24T01%3A17%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20of%20biopersistence%20of%20hydrocarbons%20using%20a%20single%20parameter&rft.jtitle=Chemosphere%20(Oxford)&rft.au=Xiao,%20Feng&rft.date=2018-12&rft.volume=213&rft.spage=76&rft.epage=83&rft.pages=76-83&rft.issn=0045-6535&rft.eissn=1879-1298&rft_id=info:doi/10.1016/j.chemosphere.2018.09.035&rft_dat=%3Cproquest_cross%3E2105065402%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2105065402&rft_id=info:pmid/30212721&rft_els_id=S0045653518316886&rfr_iscdi=true |