Definition of the binding specificity of the T7 bacteriophage primase by analysis of a protein binding microarray using a thermodynamic model
Protein binding microarrays (PBM), SELEX, RNAcompete and chromatin-immunoprecipitation have been intensively used to determine the specificity of nucleic acid binding proteins. While the specificity of proteins with pronounced sequence specificity is straightforward, the determination of the sequenc...
Gespeichert in:
Veröffentlicht in: | Nucleic acids research 2024-05, Vol.52 (9), p.4818-4829 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 4829 |
---|---|
container_issue | 9 |
container_start_page | 4818 |
container_title | Nucleic acids research |
container_volume | 52 |
creator | Lipps, Georg |
description | Protein binding microarrays (PBM), SELEX, RNAcompete and chromatin-immunoprecipitation have been intensively used to determine the specificity of nucleic acid binding proteins. While the specificity of proteins with pronounced sequence specificity is straightforward, the determination of the sequence specificity of proteins of modest sequence specificity is more difficult. In this work, an explorative data analysis workflow for nucleic acid binding data was developed that can be used by scientists that want to analyse their binding data. The workflow is based on a regressor realized in scikit-learn, the major machine learning module for the scripting language Python. The regressor is built on a thermodynamic model of nucleic acid binding and describes the sequence specificity with base- and position-specific energies. The regressor was used to determine the binding specificity of the T7 primase. For this, we reanalysed the binding data of the T7 primase obtained with a custom PBM. The binding specificity of the T7 primase agrees with the priming specificity (5'-GTC) and the template (5'-GGGTC) for the preferentially synthesized tetraribonucleotide primer (5'-pppACCC) but is more relaxed. The dominant contribution of two positions in the motif can be explained by the involvement of the initiating and elongating nucleotides for template binding. |
doi_str_mv | 10.1093/nar/gkae215 |
format | Article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_11109968</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3035537377</sourcerecordid><originalsourceid>FETCH-LOGICAL-c340t-d96ee1a4f3981a8f94cf296fcb20e1c8f7d70a8280f734aab961ffcf9b580f183</originalsourceid><addsrcrecordid>eNpVkU9P3DAQxa0K1N1CT71XPiJVATvOP58Qoi0gIXGBszVxxrtuE3trZ5HyIfjOON1lBaex5v3m2Z5HyDfOzjmT4sJBuFj9Bcx5-YksuajyrJBVfkSWTLAy46xoFuRLjH8Y4wUvi89kIZpS1lVZLcnLTzTW2dF6R72h4xppa11n3YrGDWprrLbj9CY91rQFPWKwfrOGFdJNsAPENDNRcNBP0caZhST4Ea07mA1WBw8hwES3cW7AbBgG300OkkjTCftTcmygj_h1X0_I0-9fj9e32f3Dzd311X2mRcHGrJMVIofCCNlwaIwstMllZXSbM-S6MXVXM2jyhplaFACtrLgx2si2TC3eiBNyufPdbNsBO41uDNCr_78Jk_Jg1UfF2bVa-WfFeVq5rGaHs71D8P-2GEc12Kix78Gh30YlmChLUYu6TuiPHZo2EGNAc7iHMzUnqFKCap9gor-_f9qBfYtMvAJA7pzr</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3035537377</pqid></control><display><type>article</type><title>Definition of the binding specificity of the T7 bacteriophage primase by analysis of a protein binding microarray using a thermodynamic model</title><source>DOAJ Directory of Open Access Journals</source><source>Oxford Journals Open Access Collection</source><source>PubMed Central</source><source>Free Full-Text Journals in Chemistry</source><creator>Lipps, Georg</creator><creatorcontrib>Lipps, Georg</creatorcontrib><description>Protein binding microarrays (PBM), SELEX, RNAcompete and chromatin-immunoprecipitation have been intensively used to determine the specificity of nucleic acid binding proteins. While the specificity of proteins with pronounced sequence specificity is straightforward, the determination of the sequence specificity of proteins of modest sequence specificity is more difficult. In this work, an explorative data analysis workflow for nucleic acid binding data was developed that can be used by scientists that want to analyse their binding data. The workflow is based on a regressor realized in scikit-learn, the major machine learning module for the scripting language Python. The regressor is built on a thermodynamic model of nucleic acid binding and describes the sequence specificity with base- and position-specific energies. The regressor was used to determine the binding specificity of the T7 primase. For this, we reanalysed the binding data of the T7 primase obtained with a custom PBM. The binding specificity of the T7 primase agrees with the priming specificity (5'-GTC) and the template (5'-GGGTC) for the preferentially synthesized tetraribonucleotide primer (5'-pppACCC) but is more relaxed. The dominant contribution of two positions in the motif can be explained by the involvement of the initiating and elongating nucleotides for template binding.</description><identifier>ISSN: 0305-1048</identifier><identifier>EISSN: 1362-4962</identifier><identifier>DOI: 10.1093/nar/gkae215</identifier><identifier>PMID: 38597656</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Computational Biology</subject><ispartof>Nucleic acids research, 2024-05, Vol.52 (9), p.4818-4829</ispartof><rights>The Author(s) 2024. Published by Oxford University Press on behalf of Nucleic Acids Research.</rights><rights>The Author(s) 2024. Published by Oxford University Press on behalf of Nucleic Acids Research. 2024</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c340t-d96ee1a4f3981a8f94cf296fcb20e1c8f7d70a8280f734aab961ffcf9b580f183</cites><orcidid>0000-0002-5376-9716</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC11109968/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC11109968/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,864,885,27923,27924,53790,53792</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38597656$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Lipps, Georg</creatorcontrib><title>Definition of the binding specificity of the T7 bacteriophage primase by analysis of a protein binding microarray using a thermodynamic model</title><title>Nucleic acids research</title><addtitle>Nucleic Acids Res</addtitle><description>Protein binding microarrays (PBM), SELEX, RNAcompete and chromatin-immunoprecipitation have been intensively used to determine the specificity of nucleic acid binding proteins. While the specificity of proteins with pronounced sequence specificity is straightforward, the determination of the sequence specificity of proteins of modest sequence specificity is more difficult. In this work, an explorative data analysis workflow for nucleic acid binding data was developed that can be used by scientists that want to analyse their binding data. The workflow is based on a regressor realized in scikit-learn, the major machine learning module for the scripting language Python. The regressor is built on a thermodynamic model of nucleic acid binding and describes the sequence specificity with base- and position-specific energies. The regressor was used to determine the binding specificity of the T7 primase. For this, we reanalysed the binding data of the T7 primase obtained with a custom PBM. The binding specificity of the T7 primase agrees with the priming specificity (5'-GTC) and the template (5'-GGGTC) for the preferentially synthesized tetraribonucleotide primer (5'-pppACCC) but is more relaxed. The dominant contribution of two positions in the motif can be explained by the involvement of the initiating and elongating nucleotides for template binding.</description><subject>Computational Biology</subject><issn>0305-1048</issn><issn>1362-4962</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpVkU9P3DAQxa0K1N1CT71XPiJVATvOP58Qoi0gIXGBszVxxrtuE3trZ5HyIfjOON1lBaex5v3m2Z5HyDfOzjmT4sJBuFj9Bcx5-YksuajyrJBVfkSWTLAy46xoFuRLjH8Y4wUvi89kIZpS1lVZLcnLTzTW2dF6R72h4xppa11n3YrGDWprrLbj9CY91rQFPWKwfrOGFdJNsAPENDNRcNBP0caZhST4Ea07mA1WBw8hwES3cW7AbBgG300OkkjTCftTcmygj_h1X0_I0-9fj9e32f3Dzd311X2mRcHGrJMVIofCCNlwaIwstMllZXSbM-S6MXVXM2jyhplaFACtrLgx2si2TC3eiBNyufPdbNsBO41uDNCr_78Jk_Jg1UfF2bVa-WfFeVq5rGaHs71D8P-2GEc12Kix78Gh30YlmChLUYu6TuiPHZo2EGNAc7iHMzUnqFKCap9gor-_f9qBfYtMvAJA7pzr</recordid><startdate>20240522</startdate><enddate>20240522</enddate><creator>Lipps, Georg</creator><general>Oxford University Press</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-5376-9716</orcidid></search><sort><creationdate>20240522</creationdate><title>Definition of the binding specificity of the T7 bacteriophage primase by analysis of a protein binding microarray using a thermodynamic model</title><author>Lipps, Georg</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c340t-d96ee1a4f3981a8f94cf296fcb20e1c8f7d70a8280f734aab961ffcf9b580f183</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computational Biology</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lipps, Georg</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Nucleic acids research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lipps, Georg</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Definition of the binding specificity of the T7 bacteriophage primase by analysis of a protein binding microarray using a thermodynamic model</atitle><jtitle>Nucleic acids research</jtitle><addtitle>Nucleic Acids Res</addtitle><date>2024-05-22</date><risdate>2024</risdate><volume>52</volume><issue>9</issue><spage>4818</spage><epage>4829</epage><pages>4818-4829</pages><issn>0305-1048</issn><eissn>1362-4962</eissn><abstract>Protein binding microarrays (PBM), SELEX, RNAcompete and chromatin-immunoprecipitation have been intensively used to determine the specificity of nucleic acid binding proteins. While the specificity of proteins with pronounced sequence specificity is straightforward, the determination of the sequence specificity of proteins of modest sequence specificity is more difficult. In this work, an explorative data analysis workflow for nucleic acid binding data was developed that can be used by scientists that want to analyse their binding data. The workflow is based on a regressor realized in scikit-learn, the major machine learning module for the scripting language Python. The regressor is built on a thermodynamic model of nucleic acid binding and describes the sequence specificity with base- and position-specific energies. The regressor was used to determine the binding specificity of the T7 primase. For this, we reanalysed the binding data of the T7 primase obtained with a custom PBM. The binding specificity of the T7 primase agrees with the priming specificity (5'-GTC) and the template (5'-GGGTC) for the preferentially synthesized tetraribonucleotide primer (5'-pppACCC) but is more relaxed. The dominant contribution of two positions in the motif can be explained by the involvement of the initiating and elongating nucleotides for template binding.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>38597656</pmid><doi>10.1093/nar/gkae215</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0002-5376-9716</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0305-1048 |
ispartof | Nucleic acids research, 2024-05, Vol.52 (9), p.4818-4829 |
issn | 0305-1048 1362-4962 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_11109968 |
source | DOAJ Directory of Open Access Journals; Oxford Journals Open Access Collection; PubMed Central; Free Full-Text Journals in Chemistry |
subjects | Computational Biology |
title | Definition of the binding specificity of the T7 bacteriophage primase by analysis of a protein binding microarray using a thermodynamic model |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-10T12%3A39%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Definition%20of%20the%20binding%20specificity%20of%20the%20T7%20bacteriophage%20primase%20by%20analysis%20of%20a%20protein%20binding%20microarray%20using%20a%20thermodynamic%20model&rft.jtitle=Nucleic%20acids%20research&rft.au=Lipps,%20Georg&rft.date=2024-05-22&rft.volume=52&rft.issue=9&rft.spage=4818&rft.epage=4829&rft.pages=4818-4829&rft.issn=0305-1048&rft.eissn=1362-4962&rft_id=info:doi/10.1093/nar/gkae215&rft_dat=%3Cproquest_pubme%3E3035537377%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3035537377&rft_id=info:pmid/38597656&rfr_iscdi=true |