DIVAA: analysis of amino acid diversity in multiple aligned protein sequences

Motivation: Multiple alignments of proteins are an effective way of identifying conserved amino acids that provide clues to functional relationships among proteins. Quantitation of the abundances of amino acids found at each position in a sequence motif can provide a basis for understanding the stru...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Bioinformatics 2004-12, Vol.20 (18), p.3481-3489
Hauptverfasser: Rodi, Diane J., Mandava, Suneeta, Makowski, Lee
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 3489
container_issue 18
container_start_page 3481
container_title Bioinformatics
container_volume 20
creator Rodi, Diane J.
Mandava, Suneeta
Makowski, Lee
description Motivation: Multiple alignments of proteins are an effective way of identifying conserved amino acids that provide clues to functional relationships among proteins. Quantitation of the abundances of amino acids found at each position in a sequence motif can provide a basis for understanding the structural and functional constraints at each point. Distribution of information across a motif has been used previously, but the non-intuitive nature of the analysis has limited its impact. Results: Here, we introduce a quantitative measure of amino acid sequence diversity (DIVAA) that has a simple, intuitive meaning. Diversity, as a measure of sequence conservation or variation, is inextricably linked to the probability of selecting identical pairs from a distribution. We demonstrate its utility through the analysis of four populations: ATP-binding P-loops, hypervariable domains of kappa light chains, signal sequences, and the N- and C- termini of proteins. DIVAA provides a simple means to generate hypotheses concerning the contribution of individual residues to the functional and evolutionary relationships among proteins. Availability: Access to DIVAA software is available at RELIC (http://relic.bio.anl.gov)
doi_str_mv 10.1093/bioinformatics/bth432
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_67171969</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>67171969</sourcerecordid><originalsourceid>FETCH-LOGICAL-c546t-fb74485e3c16a07913457af761e4e2d62fdc89713f45daa9e7c0b97a065811123</originalsourceid><addsrcrecordid>eNqF0VtrFDEUB_BQlLbWfgQlCPo2NpncfVvWyxa3FEFL6Us4k0k0dS5rMiPutzfLLhZ98SmB_M7JSf4IPaPkNSWGXTRxjEMYUw9TdPmimb5xVh-hU8olqWoizKOyZ1JVXBN2gp7kfE-IoJzzY3RCRa05JfIUXb29vFks3mAYoNvmmPEYMPRxGDG42OI2_vQpx2mL44D7uZvipvMYuvh18C3epHHy5SD7H7MfnM9P0eMAXfbnh_UMfXn_7vNyVa2vP1wuF-vKCS6nKjSKcy08c1QCUYYyLhQEJannvm5lHVqnjaIscNECGK8caYwCIoWmlNbsDL3a9y0TlKvzZPuYne86GPw4ZysVVdRI81-4Q0ZrVuCLf-D9OKfyKTujpSjTkILEHrk05px8sJsUe0hbS4ndpWL_TsXuUyl1zw_N56b37UPVIYYCXh4AZAddSDC4mB-c5IRruXPV3sU8-V9_ziF9L09mStjV7Z39-Ikt12q9soT9BvAMqK0</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>198659710</pqid></control><display><type>article</type><title>DIVAA: analysis of amino acid diversity in multiple aligned protein sequences</title><source>MEDLINE</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Oxford Journals Open Access Collection</source><source>Alma/SFX Local Collection</source><creator>Rodi, Diane J. ; Mandava, Suneeta ; Makowski, Lee</creator><creatorcontrib>Rodi, Diane J. ; Mandava, Suneeta ; Makowski, Lee</creatorcontrib><description>Motivation: Multiple alignments of proteins are an effective way of identifying conserved amino acids that provide clues to functional relationships among proteins. Quantitation of the abundances of amino acids found at each position in a sequence motif can provide a basis for understanding the structural and functional constraints at each point. Distribution of information across a motif has been used previously, but the non-intuitive nature of the analysis has limited its impact. Results: Here, we introduce a quantitative measure of amino acid sequence diversity (DIVAA) that has a simple, intuitive meaning. Diversity, as a measure of sequence conservation or variation, is inextricably linked to the probability of selecting identical pairs from a distribution. We demonstrate its utility through the analysis of four populations: ATP-binding P-loops, hypervariable domains of kappa light chains, signal sequences, and the N- and C- termini of proteins. DIVAA provides a simple means to generate hypotheses concerning the contribution of individual residues to the functional and evolutionary relationships among proteins. Availability: Access to DIVAA software is available at RELIC (http://relic.bio.anl.gov)</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1460-2059</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/bth432</identifier><identifier>PMID: 15284106</identifier><identifier>CODEN: BOINFP</identifier><language>eng</language><publisher>Oxford: Oxford University Press</publisher><subject>Algorithms ; Biological and medical sciences ; Chromosome Mapping - methods ; Evolution, Molecular ; Fundamental and applied biological sciences. Psychology ; General aspects ; Genetic Variation - genetics ; Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) ; Proteins - chemistry ; Proteins - genetics ; Sequence Alignment - methods ; Sequence Analysis, Protein - methods ; Software</subject><ispartof>Bioinformatics, 2004-12, Vol.20 (18), p.3481-3489</ispartof><rights>2005 INIST-CNRS</rights><rights>Copyright Oxford University Press(England) Dec 12, 2004</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c546t-fb74485e3c16a07913457af761e4e2d62fdc89713f45daa9e7c0b97a065811123</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=16404866$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/15284106$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Rodi, Diane J.</creatorcontrib><creatorcontrib>Mandava, Suneeta</creatorcontrib><creatorcontrib>Makowski, Lee</creatorcontrib><title>DIVAA: analysis of amino acid diversity in multiple aligned protein sequences</title><title>Bioinformatics</title><addtitle>Bioinformatics</addtitle><description>Motivation: Multiple alignments of proteins are an effective way of identifying conserved amino acids that provide clues to functional relationships among proteins. Quantitation of the abundances of amino acids found at each position in a sequence motif can provide a basis for understanding the structural and functional constraints at each point. Distribution of information across a motif has been used previously, but the non-intuitive nature of the analysis has limited its impact. Results: Here, we introduce a quantitative measure of amino acid sequence diversity (DIVAA) that has a simple, intuitive meaning. Diversity, as a measure of sequence conservation or variation, is inextricably linked to the probability of selecting identical pairs from a distribution. We demonstrate its utility through the analysis of four populations: ATP-binding P-loops, hypervariable domains of kappa light chains, signal sequences, and the N- and C- termini of proteins. DIVAA provides a simple means to generate hypotheses concerning the contribution of individual residues to the functional and evolutionary relationships among proteins. Availability: Access to DIVAA software is available at RELIC (http://relic.bio.anl.gov)</description><subject>Algorithms</subject><subject>Biological and medical sciences</subject><subject>Chromosome Mapping - methods</subject><subject>Evolution, Molecular</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>General aspects</subject><subject>Genetic Variation - genetics</subject><subject>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</subject><subject>Proteins - chemistry</subject><subject>Proteins - genetics</subject><subject>Sequence Alignment - methods</subject><subject>Sequence Analysis, Protein - methods</subject><subject>Software</subject><issn>1367-4803</issn><issn>1460-2059</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqF0VtrFDEUB_BQlLbWfgQlCPo2NpncfVvWyxa3FEFL6Us4k0k0dS5rMiPutzfLLhZ98SmB_M7JSf4IPaPkNSWGXTRxjEMYUw9TdPmimb5xVh-hU8olqWoizKOyZ1JVXBN2gp7kfE-IoJzzY3RCRa05JfIUXb29vFks3mAYoNvmmPEYMPRxGDG42OI2_vQpx2mL44D7uZvipvMYuvh18C3epHHy5SD7H7MfnM9P0eMAXfbnh_UMfXn_7vNyVa2vP1wuF-vKCS6nKjSKcy08c1QCUYYyLhQEJannvm5lHVqnjaIscNECGK8caYwCIoWmlNbsDL3a9y0TlKvzZPuYne86GPw4ZysVVdRI81-4Q0ZrVuCLf-D9OKfyKTujpSjTkILEHrk05px8sJsUe0hbS4ndpWL_TsXuUyl1zw_N56b37UPVIYYCXh4AZAddSDC4mB-c5IRruXPV3sU8-V9_ziF9L09mStjV7Z39-Ikt12q9soT9BvAMqK0</recordid><startdate>20041212</startdate><enddate>20041212</enddate><creator>Rodi, Diane J.</creator><creator>Mandava, Suneeta</creator><creator>Makowski, Lee</creator><general>Oxford University Press</general><general>Oxford Publishing Limited (England)</general><scope>BSCLL</scope><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QF</scope><scope>7QO</scope><scope>7QQ</scope><scope>7SC</scope><scope>7SE</scope><scope>7SP</scope><scope>7SR</scope><scope>7TA</scope><scope>7TB</scope><scope>7TM</scope><scope>7TO</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>H8D</scope><scope>H8G</scope><scope>H94</scope><scope>JG9</scope><scope>JQ2</scope><scope>K9.</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>P64</scope><scope>7X8</scope></search><sort><creationdate>20041212</creationdate><title>DIVAA: analysis of amino acid diversity in multiple aligned protein sequences</title><author>Rodi, Diane J. ; Mandava, Suneeta ; Makowski, Lee</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c546t-fb74485e3c16a07913457af761e4e2d62fdc89713f45daa9e7c0b97a065811123</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Algorithms</topic><topic>Biological and medical sciences</topic><topic>Chromosome Mapping - methods</topic><topic>Evolution, Molecular</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>General aspects</topic><topic>Genetic Variation - genetics</topic><topic>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</topic><topic>Proteins - chemistry</topic><topic>Proteins - genetics</topic><topic>Sequence Alignment - methods</topic><topic>Sequence Analysis, Protein - methods</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Rodi, Diane J.</creatorcontrib><creatorcontrib>Mandava, Suneeta</creatorcontrib><creatorcontrib>Makowski, Lee</creatorcontrib><collection>Istex</collection><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Aluminium Industry Abstracts</collection><collection>Biotechnology Research Abstracts</collection><collection>Ceramic Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Corrosion Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Materials Business File</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Oncogenes and Growth Factors Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Copper Technical Reference Library</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rodi, Diane J.</au><au>Mandava, Suneeta</au><au>Makowski, Lee</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>DIVAA: analysis of amino acid diversity in multiple aligned protein sequences</atitle><jtitle>Bioinformatics</jtitle><addtitle>Bioinformatics</addtitle><date>2004-12-12</date><risdate>2004</risdate><volume>20</volume><issue>18</issue><spage>3481</spage><epage>3489</epage><pages>3481-3489</pages><issn>1367-4803</issn><eissn>1460-2059</eissn><eissn>1367-4811</eissn><coden>BOINFP</coden><abstract>Motivation: Multiple alignments of proteins are an effective way of identifying conserved amino acids that provide clues to functional relationships among proteins. Quantitation of the abundances of amino acids found at each position in a sequence motif can provide a basis for understanding the structural and functional constraints at each point. Distribution of information across a motif has been used previously, but the non-intuitive nature of the analysis has limited its impact. Results: Here, we introduce a quantitative measure of amino acid sequence diversity (DIVAA) that has a simple, intuitive meaning. Diversity, as a measure of sequence conservation or variation, is inextricably linked to the probability of selecting identical pairs from a distribution. We demonstrate its utility through the analysis of four populations: ATP-binding P-loops, hypervariable domains of kappa light chains, signal sequences, and the N- and C- termini of proteins. DIVAA provides a simple means to generate hypotheses concerning the contribution of individual residues to the functional and evolutionary relationships among proteins. Availability: Access to DIVAA software is available at RELIC (http://relic.bio.anl.gov)</abstract><cop>Oxford</cop><pub>Oxford University Press</pub><pmid>15284106</pmid><doi>10.1093/bioinformatics/bth432</doi><tpages>9</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1367-4803
ispartof Bioinformatics, 2004-12, Vol.20 (18), p.3481-3489
issn 1367-4803
1460-2059
1367-4811
language eng
recordid cdi_proquest_miscellaneous_67171969
source MEDLINE; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Oxford Journals Open Access Collection; Alma/SFX Local Collection
subjects Algorithms
Biological and medical sciences
Chromosome Mapping - methods
Evolution, Molecular
Fundamental and applied biological sciences. Psychology
General aspects
Genetic Variation - genetics
Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)
Proteins - chemistry
Proteins - genetics
Sequence Alignment - methods
Sequence Analysis, Protein - methods
Software
title DIVAA: analysis of amino acid diversity in multiple aligned protein sequences
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T04%3A47%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=DIVAA:%20analysis%20of%20amino%20acid%20diversity%20in%20multiple%20aligned%20protein%20sequences&rft.jtitle=Bioinformatics&rft.au=Rodi,%20Diane%20J.&rft.date=2004-12-12&rft.volume=20&rft.issue=18&rft.spage=3481&rft.epage=3489&rft.pages=3481-3489&rft.issn=1367-4803&rft.eissn=1460-2059&rft.coden=BOINFP&rft_id=info:doi/10.1093/bioinformatics/bth432&rft_dat=%3Cproquest_cross%3E67171969%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=198659710&rft_id=info:pmid/15284106&rfr_iscdi=true