DIVAA: analysis of amino acid diversity in multiple aligned protein sequences
Motivation: Multiple alignments of proteins are an effective way of identifying conserved amino acids that provide clues to functional relationships among proteins. Quantitation of the abundances of amino acids found at each position in a sequence motif can provide a basis for understanding the stru...
Gespeichert in:
Veröffentlicht in: | Bioinformatics 2004-12, Vol.20 (18), p.3481-3489 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 3489 |
---|---|
container_issue | 18 |
container_start_page | 3481 |
container_title | Bioinformatics |
container_volume | 20 |
creator | Rodi, Diane J. Mandava, Suneeta Makowski, Lee |
description | Motivation: Multiple alignments of proteins are an effective way of identifying conserved amino acids that provide clues to functional relationships among proteins. Quantitation of the abundances of amino acids found at each position in a sequence motif can provide a basis for understanding the structural and functional constraints at each point. Distribution of information across a motif has been used previously, but the non-intuitive nature of the analysis has limited its impact. Results: Here, we introduce a quantitative measure of amino acid sequence diversity (DIVAA) that has a simple, intuitive meaning. Diversity, as a measure of sequence conservation or variation, is inextricably linked to the probability of selecting identical pairs from a distribution. We demonstrate its utility through the analysis of four populations: ATP-binding P-loops, hypervariable domains of kappa light chains, signal sequences, and the N- and C- termini of proteins. DIVAA provides a simple means to generate hypotheses concerning the contribution of individual residues to the functional and evolutionary relationships among proteins. Availability: Access to DIVAA software is available at RELIC (http://relic.bio.anl.gov) |
doi_str_mv | 10.1093/bioinformatics/bth432 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_67171969</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>67171969</sourcerecordid><originalsourceid>FETCH-LOGICAL-c546t-fb74485e3c16a07913457af761e4e2d62fdc89713f45daa9e7c0b97a065811123</originalsourceid><addsrcrecordid>eNqF0VtrFDEUB_BQlLbWfgQlCPo2NpncfVvWyxa3FEFL6Us4k0k0dS5rMiPutzfLLhZ98SmB_M7JSf4IPaPkNSWGXTRxjEMYUw9TdPmimb5xVh-hU8olqWoizKOyZ1JVXBN2gp7kfE-IoJzzY3RCRa05JfIUXb29vFks3mAYoNvmmPEYMPRxGDG42OI2_vQpx2mL44D7uZvipvMYuvh18C3epHHy5SD7H7MfnM9P0eMAXfbnh_UMfXn_7vNyVa2vP1wuF-vKCS6nKjSKcy08c1QCUYYyLhQEJannvm5lHVqnjaIscNECGK8caYwCIoWmlNbsDL3a9y0TlKvzZPuYne86GPw4ZysVVdRI81-4Q0ZrVuCLf-D9OKfyKTujpSjTkILEHrk05px8sJsUe0hbS4ndpWL_TsXuUyl1zw_N56b37UPVIYYCXh4AZAddSDC4mB-c5IRruXPV3sU8-V9_ziF9L09mStjV7Z39-Ikt12q9soT9BvAMqK0</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>198659710</pqid></control><display><type>article</type><title>DIVAA: analysis of amino acid diversity in multiple aligned protein sequences</title><source>MEDLINE</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Oxford Journals Open Access Collection</source><source>Alma/SFX Local Collection</source><creator>Rodi, Diane J. ; Mandava, Suneeta ; Makowski, Lee</creator><creatorcontrib>Rodi, Diane J. ; Mandava, Suneeta ; Makowski, Lee</creatorcontrib><description>Motivation: Multiple alignments of proteins are an effective way of identifying conserved amino acids that provide clues to functional relationships among proteins. Quantitation of the abundances of amino acids found at each position in a sequence motif can provide a basis for understanding the structural and functional constraints at each point. Distribution of information across a motif has been used previously, but the non-intuitive nature of the analysis has limited its impact. Results: Here, we introduce a quantitative measure of amino acid sequence diversity (DIVAA) that has a simple, intuitive meaning. Diversity, as a measure of sequence conservation or variation, is inextricably linked to the probability of selecting identical pairs from a distribution. We demonstrate its utility through the analysis of four populations: ATP-binding P-loops, hypervariable domains of kappa light chains, signal sequences, and the N- and C- termini of proteins. DIVAA provides a simple means to generate hypotheses concerning the contribution of individual residues to the functional and evolutionary relationships among proteins. Availability: Access to DIVAA software is available at RELIC (http://relic.bio.anl.gov)</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1460-2059</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/bth432</identifier><identifier>PMID: 15284106</identifier><identifier>CODEN: BOINFP</identifier><language>eng</language><publisher>Oxford: Oxford University Press</publisher><subject>Algorithms ; Biological and medical sciences ; Chromosome Mapping - methods ; Evolution, Molecular ; Fundamental and applied biological sciences. Psychology ; General aspects ; Genetic Variation - genetics ; Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) ; Proteins - chemistry ; Proteins - genetics ; Sequence Alignment - methods ; Sequence Analysis, Protein - methods ; Software</subject><ispartof>Bioinformatics, 2004-12, Vol.20 (18), p.3481-3489</ispartof><rights>2005 INIST-CNRS</rights><rights>Copyright Oxford University Press(England) Dec 12, 2004</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c546t-fb74485e3c16a07913457af761e4e2d62fdc89713f45daa9e7c0b97a065811123</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=16404866$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/15284106$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Rodi, Diane J.</creatorcontrib><creatorcontrib>Mandava, Suneeta</creatorcontrib><creatorcontrib>Makowski, Lee</creatorcontrib><title>DIVAA: analysis of amino acid diversity in multiple aligned protein sequences</title><title>Bioinformatics</title><addtitle>Bioinformatics</addtitle><description>Motivation: Multiple alignments of proteins are an effective way of identifying conserved amino acids that provide clues to functional relationships among proteins. Quantitation of the abundances of amino acids found at each position in a sequence motif can provide a basis for understanding the structural and functional constraints at each point. Distribution of information across a motif has been used previously, but the non-intuitive nature of the analysis has limited its impact. Results: Here, we introduce a quantitative measure of amino acid sequence diversity (DIVAA) that has a simple, intuitive meaning. Diversity, as a measure of sequence conservation or variation, is inextricably linked to the probability of selecting identical pairs from a distribution. We demonstrate its utility through the analysis of four populations: ATP-binding P-loops, hypervariable domains of kappa light chains, signal sequences, and the N- and C- termini of proteins. DIVAA provides a simple means to generate hypotheses concerning the contribution of individual residues to the functional and evolutionary relationships among proteins. Availability: Access to DIVAA software is available at RELIC (http://relic.bio.anl.gov)</description><subject>Algorithms</subject><subject>Biological and medical sciences</subject><subject>Chromosome Mapping - methods</subject><subject>Evolution, Molecular</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>General aspects</subject><subject>Genetic Variation - genetics</subject><subject>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</subject><subject>Proteins - chemistry</subject><subject>Proteins - genetics</subject><subject>Sequence Alignment - methods</subject><subject>Sequence Analysis, Protein - methods</subject><subject>Software</subject><issn>1367-4803</issn><issn>1460-2059</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqF0VtrFDEUB_BQlLbWfgQlCPo2NpncfVvWyxa3FEFL6Us4k0k0dS5rMiPutzfLLhZ98SmB_M7JSf4IPaPkNSWGXTRxjEMYUw9TdPmimb5xVh-hU8olqWoizKOyZ1JVXBN2gp7kfE-IoJzzY3RCRa05JfIUXb29vFks3mAYoNvmmPEYMPRxGDG42OI2_vQpx2mL44D7uZvipvMYuvh18C3epHHy5SD7H7MfnM9P0eMAXfbnh_UMfXn_7vNyVa2vP1wuF-vKCS6nKjSKcy08c1QCUYYyLhQEJannvm5lHVqnjaIscNECGK8caYwCIoWmlNbsDL3a9y0TlKvzZPuYne86GPw4ZysVVdRI81-4Q0ZrVuCLf-D9OKfyKTujpSjTkILEHrk05px8sJsUe0hbS4ndpWL_TsXuUyl1zw_N56b37UPVIYYCXh4AZAddSDC4mB-c5IRruXPV3sU8-V9_ziF9L09mStjV7Z39-Ikt12q9soT9BvAMqK0</recordid><startdate>20041212</startdate><enddate>20041212</enddate><creator>Rodi, Diane J.</creator><creator>Mandava, Suneeta</creator><creator>Makowski, Lee</creator><general>Oxford University Press</general><general>Oxford Publishing Limited (England)</general><scope>BSCLL</scope><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QF</scope><scope>7QO</scope><scope>7QQ</scope><scope>7SC</scope><scope>7SE</scope><scope>7SP</scope><scope>7SR</scope><scope>7TA</scope><scope>7TB</scope><scope>7TM</scope><scope>7TO</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>H8D</scope><scope>H8G</scope><scope>H94</scope><scope>JG9</scope><scope>JQ2</scope><scope>K9.</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>P64</scope><scope>7X8</scope></search><sort><creationdate>20041212</creationdate><title>DIVAA: analysis of amino acid diversity in multiple aligned protein sequences</title><author>Rodi, Diane J. ; Mandava, Suneeta ; Makowski, Lee</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c546t-fb74485e3c16a07913457af761e4e2d62fdc89713f45daa9e7c0b97a065811123</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Algorithms</topic><topic>Biological and medical sciences</topic><topic>Chromosome Mapping - methods</topic><topic>Evolution, Molecular</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>General aspects</topic><topic>Genetic Variation - genetics</topic><topic>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</topic><topic>Proteins - chemistry</topic><topic>Proteins - genetics</topic><topic>Sequence Alignment - methods</topic><topic>Sequence Analysis, Protein - methods</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Rodi, Diane J.</creatorcontrib><creatorcontrib>Mandava, Suneeta</creatorcontrib><creatorcontrib>Makowski, Lee</creatorcontrib><collection>Istex</collection><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Aluminium Industry Abstracts</collection><collection>Biotechnology Research Abstracts</collection><collection>Ceramic Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Corrosion Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Materials Business File</collection><collection>Mechanical & Transportation Engineering Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Oncogenes and Growth Factors Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Copper Technical Reference Library</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rodi, Diane J.</au><au>Mandava, Suneeta</au><au>Makowski, Lee</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>DIVAA: analysis of amino acid diversity in multiple aligned protein sequences</atitle><jtitle>Bioinformatics</jtitle><addtitle>Bioinformatics</addtitle><date>2004-12-12</date><risdate>2004</risdate><volume>20</volume><issue>18</issue><spage>3481</spage><epage>3489</epage><pages>3481-3489</pages><issn>1367-4803</issn><eissn>1460-2059</eissn><eissn>1367-4811</eissn><coden>BOINFP</coden><abstract>Motivation: Multiple alignments of proteins are an effective way of identifying conserved amino acids that provide clues to functional relationships among proteins. Quantitation of the abundances of amino acids found at each position in a sequence motif can provide a basis for understanding the structural and functional constraints at each point. Distribution of information across a motif has been used previously, but the non-intuitive nature of the analysis has limited its impact. Results: Here, we introduce a quantitative measure of amino acid sequence diversity (DIVAA) that has a simple, intuitive meaning. Diversity, as a measure of sequence conservation or variation, is inextricably linked to the probability of selecting identical pairs from a distribution. We demonstrate its utility through the analysis of four populations: ATP-binding P-loops, hypervariable domains of kappa light chains, signal sequences, and the N- and C- termini of proteins. DIVAA provides a simple means to generate hypotheses concerning the contribution of individual residues to the functional and evolutionary relationships among proteins. Availability: Access to DIVAA software is available at RELIC (http://relic.bio.anl.gov)</abstract><cop>Oxford</cop><pub>Oxford University Press</pub><pmid>15284106</pmid><doi>10.1093/bioinformatics/bth432</doi><tpages>9</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1367-4803 |
ispartof | Bioinformatics, 2004-12, Vol.20 (18), p.3481-3489 |
issn | 1367-4803 1460-2059 1367-4811 |
language | eng |
recordid | cdi_proquest_miscellaneous_67171969 |
source | MEDLINE; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Oxford Journals Open Access Collection; Alma/SFX Local Collection |
subjects | Algorithms Biological and medical sciences Chromosome Mapping - methods Evolution, Molecular Fundamental and applied biological sciences. Psychology General aspects Genetic Variation - genetics Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) Proteins - chemistry Proteins - genetics Sequence Alignment - methods Sequence Analysis, Protein - methods Software |
title | DIVAA: analysis of amino acid diversity in multiple aligned protein sequences |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T04%3A47%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=DIVAA:%20analysis%20of%20amino%20acid%20diversity%20in%20multiple%20aligned%20protein%20sequences&rft.jtitle=Bioinformatics&rft.au=Rodi,%20Diane%20J.&rft.date=2004-12-12&rft.volume=20&rft.issue=18&rft.spage=3481&rft.epage=3489&rft.pages=3481-3489&rft.issn=1367-4803&rft.eissn=1460-2059&rft.coden=BOINFP&rft_id=info:doi/10.1093/bioinformatics/bth432&rft_dat=%3Cproquest_cross%3E67171969%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=198659710&rft_id=info:pmid/15284106&rfr_iscdi=true |