Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids

Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of indiv...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:PloS one 2016-12, Vol.11 (12), p.e0167651-e0167651
Hauptverfasser: Das, Jayanta Kumar, Das, Provas, Ray, Korak Kumar, Choudhury, Pabitra Pal, Jana, Siddhartha Sankar
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page e0167651
container_issue 12
container_start_page e0167651
container_title PloS one
container_volume 11
creator Das, Jayanta Kumar
Das, Provas
Ray, Korak Kumar
Choudhury, Pabitra Pal
Jana, Siddhartha Sankar
description Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as 'FPKATD' and 'Y/FTNEKL' without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids' pattern in different proteins.
doi_str_mv 10.1371/journal.pone.0167651
format Article
fullrecord <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_1847440338</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A473276067</galeid><doaj_id>oai_doaj_org_article_470b50cb432946bf9cbc5029ab9ed2f1</doaj_id><sourcerecordid>A473276067</sourcerecordid><originalsourceid>FETCH-LOGICAL-c725t-3ccdec199f9905d290b609ef3de381aa241e3625a8cf13d10c91e6495bf122863</originalsourceid><addsrcrecordid>eNqNk19v0zAUxSMEYqPwDRBEQkLw0OL_jl-QqgpGpaFNjPFqOY7TekrsYicI-PQ4bTY1aA9THhLd_M7Jzbm-WfYSggXEHH648X1wqlnsvDMLABlnFD7KTqHAaM4QwI-Pnk-yZzHeAEBxwdjT7ARxgQEr-Gnmv6pua1rVWa2afLVVQenOBPs3VbzLfZ1fBt8Z6_Ir87M3TpuYX0frNvml6hLoYq5i0pl2b3AWfL_LV74trds7xMFi2Vrn86W2VXyePalVE82L8T7Lrj9_-r76Mj-_OFuvludzzRHt5ljrymgoRC0EoBUSoGRAmBpXBhdQKUSgwQxRVega4goCLaBhRNCyhggVDM-y1wffXeOjHLOKEhaEEwIwLhKxPhCVVzdyF2yrwh_plZX7gg8bqUKKpTGScFBSoEuCkSCsrIUuNQVIqFKYCqUGZtnH8Wt92ZpKG9cF1UxMp2-c3cqN_yUpJBTyweDdaBB8ijl2srVRm6ZRzvh-6JtSUBCCxQNQwgsBmBhc3_yH3h_ESG1U-lfrap9a1IOpXBKOEWeA8UQt7qHSVQ2jT2ewtqk-EbyfCBLTmd_dRvUxyvXVt4ezFz-m7NsjdmtU022jb_r9aZuC5ADq4GMMpr6bBwRyWKHbNOSwQnJcoSR7dTzLO9HtzuB_WqcWMw</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1847440338</pqid></control><display><type>article</type><title>Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids</title><source>MEDLINE</source><source>Public Library of Science</source><source>DOAJ Directory of Open Access Journals</source><source>PubMed Central</source><source>Free Full-Text Journals in Chemistry</source><source>EZB Electronic Journals Library</source><creator>Das, Jayanta Kumar ; Das, Provas ; Ray, Korak Kumar ; Choudhury, Pabitra Pal ; Jana, Siddhartha Sankar</creator><contributor>Permyakov, Eugene A.</contributor><creatorcontrib>Das, Jayanta Kumar ; Das, Provas ; Ray, Korak Kumar ; Choudhury, Pabitra Pal ; Jana, Siddhartha Sankar ; Permyakov, Eugene A.</creatorcontrib><description>Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as 'FPKATD' and 'Y/FTNEKL' without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids' pattern in different proteins.</description><identifier>ISSN: 1932-6203</identifier><identifier>EISSN: 1932-6203</identifier><identifier>DOI: 10.1371/journal.pone.0167651</identifier><identifier>PMID: 27930687</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Algorithms ; Amino acid sequence ; Amino acids ; Amino Acids - chemistry ; Analysis ; Bioinformatics ; Biology and Life Sciences ; Ca2+-transporting ATPase ; Calcium ; Chemistry ; Combinations (mathematics) ; Commonality ; Enzymes ; Kinesin ; Mathematical models ; Methodology ; Myosin ; Myosin-light-chain kinase ; Na+/K+-exchanging ATPase ; Neck ; Phylogenetics ; Phylogeny ; Protein families ; Protein kinase ; Proteins ; Proteins - chemistry ; Research and Analysis Methods ; Sequences ; Studies ; Yttrium</subject><ispartof>PloS one, 2016-12, Vol.11 (12), p.e0167651-e0167651</ispartof><rights>COPYRIGHT 2016 Public Library of Science</rights><rights>2016 Das et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2016 Das et al 2016 Das et al</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c725t-3ccdec199f9905d290b609ef3de381aa241e3625a8cf13d10c91e6495bf122863</citedby><cites>FETCH-LOGICAL-c725t-3ccdec199f9905d290b609ef3de381aa241e3625a8cf13d10c91e6495bf122863</cites><orcidid>0000-0001-5390-698X ; 0000-0002-4339-7445 ; 0000-0002-7630-7194</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5145171/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5145171/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,724,777,781,861,882,2096,2915,23847,27905,27906,53772,53774,79349,79350</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27930687$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Permyakov, Eugene A.</contributor><creatorcontrib>Das, Jayanta Kumar</creatorcontrib><creatorcontrib>Das, Provas</creatorcontrib><creatorcontrib>Ray, Korak Kumar</creatorcontrib><creatorcontrib>Choudhury, Pabitra Pal</creatorcontrib><creatorcontrib>Jana, Siddhartha Sankar</creatorcontrib><title>Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids</title><title>PloS one</title><addtitle>PLoS One</addtitle><description>Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as 'FPKATD' and 'Y/FTNEKL' without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids' pattern in different proteins.</description><subject>Algorithms</subject><subject>Amino acid sequence</subject><subject>Amino acids</subject><subject>Amino Acids - chemistry</subject><subject>Analysis</subject><subject>Bioinformatics</subject><subject>Biology and Life Sciences</subject><subject>Ca2+-transporting ATPase</subject><subject>Calcium</subject><subject>Chemistry</subject><subject>Combinations (mathematics)</subject><subject>Commonality</subject><subject>Enzymes</subject><subject>Kinesin</subject><subject>Mathematical models</subject><subject>Methodology</subject><subject>Myosin</subject><subject>Myosin-light-chain kinase</subject><subject>Na+/K+-exchanging ATPase</subject><subject>Neck</subject><subject>Phylogenetics</subject><subject>Phylogeny</subject><subject>Protein families</subject><subject>Protein kinase</subject><subject>Proteins</subject><subject>Proteins - chemistry</subject><subject>Research and Analysis Methods</subject><subject>Sequences</subject><subject>Studies</subject><subject>Yttrium</subject><issn>1932-6203</issn><issn>1932-6203</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>DOA</sourceid><recordid>eNqNk19v0zAUxSMEYqPwDRBEQkLw0OL_jl-QqgpGpaFNjPFqOY7TekrsYicI-PQ4bTY1aA9THhLd_M7Jzbm-WfYSggXEHH648X1wqlnsvDMLABlnFD7KTqHAaM4QwI-Pnk-yZzHeAEBxwdjT7ARxgQEr-Gnmv6pua1rVWa2afLVVQenOBPs3VbzLfZ1fBt8Z6_Ir87M3TpuYX0frNvml6hLoYq5i0pl2b3AWfL_LV74trds7xMFi2Vrn86W2VXyePalVE82L8T7Lrj9_-r76Mj-_OFuvludzzRHt5ljrymgoRC0EoBUSoGRAmBpXBhdQKUSgwQxRVega4goCLaBhRNCyhggVDM-y1wffXeOjHLOKEhaEEwIwLhKxPhCVVzdyF2yrwh_plZX7gg8bqUKKpTGScFBSoEuCkSCsrIUuNQVIqFKYCqUGZtnH8Wt92ZpKG9cF1UxMp2-c3cqN_yUpJBTyweDdaBB8ijl2srVRm6ZRzvh-6JtSUBCCxQNQwgsBmBhc3_yH3h_ESG1U-lfrap9a1IOpXBKOEWeA8UQt7qHSVQ2jT2ewtqk-EbyfCBLTmd_dRvUxyvXVt4ezFz-m7NsjdmtU022jb_r9aZuC5ADq4GMMpr6bBwRyWKHbNOSwQnJcoSR7dTzLO9HtzuB_WqcWMw</recordid><startdate>20161208</startdate><enddate>20161208</enddate><creator>Das, Jayanta Kumar</creator><creator>Das, Provas</creator><creator>Ray, Korak Kumar</creator><creator>Choudhury, Pabitra Pal</creator><creator>Jana, Siddhartha Sankar</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>IOV</scope><scope>ISR</scope><scope>3V.</scope><scope>7QG</scope><scope>7QL</scope><scope>7QO</scope><scope>7RV</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TG</scope><scope>7TM</scope><scope>7U9</scope><scope>7X2</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>KB.</scope><scope>KB0</scope><scope>KL.</scope><scope>L6V</scope><scope>LK8</scope><scope>M0K</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>M7S</scope><scope>NAPCQ</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PATMY</scope><scope>PDBOC</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0001-5390-698X</orcidid><orcidid>https://orcid.org/0000-0002-4339-7445</orcidid><orcidid>https://orcid.org/0000-0002-7630-7194</orcidid></search><sort><creationdate>20161208</creationdate><title>Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids</title><author>Das, Jayanta Kumar ; Das, Provas ; Ray, Korak Kumar ; Choudhury, Pabitra Pal ; Jana, Siddhartha Sankar</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c725t-3ccdec199f9905d290b609ef3de381aa241e3625a8cf13d10c91e6495bf122863</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Algorithms</topic><topic>Amino acid sequence</topic><topic>Amino acids</topic><topic>Amino Acids - chemistry</topic><topic>Analysis</topic><topic>Bioinformatics</topic><topic>Biology and Life Sciences</topic><topic>Ca2+-transporting ATPase</topic><topic>Calcium</topic><topic>Chemistry</topic><topic>Combinations (mathematics)</topic><topic>Commonality</topic><topic>Enzymes</topic><topic>Kinesin</topic><topic>Mathematical models</topic><topic>Methodology</topic><topic>Myosin</topic><topic>Myosin-light-chain kinase</topic><topic>Na+/K+-exchanging ATPase</topic><topic>Neck</topic><topic>Phylogenetics</topic><topic>Phylogeny</topic><topic>Protein families</topic><topic>Protein kinase</topic><topic>Proteins</topic><topic>Proteins - chemistry</topic><topic>Research and Analysis Methods</topic><topic>Sequences</topic><topic>Studies</topic><topic>Yttrium</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Das, Jayanta Kumar</creatorcontrib><creatorcontrib>Das, Provas</creatorcontrib><creatorcontrib>Ray, Korak Kumar</creatorcontrib><creatorcontrib>Choudhury, Pabitra Pal</creatorcontrib><creatorcontrib>Jana, Siddhartha Sankar</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Opposing Viewpoints</collection><collection>Science (Gale in Context)</collection><collection>ProQuest Central (Corporate)</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>ProQuest Nursing and Allied Health Journals</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Meteorological &amp; Geoastrophysical Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Agricultural Science Collection</collection><collection>ProQuest Health and Medical</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Public Health Database (ProQuest Medical &amp; Health Databases)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>Agricultural &amp; Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Materials Science Database</collection><collection>Nursing &amp; Allied Health Database (Alumni Edition)</collection><collection>Meteorological &amp; Geoastrophysical Abstracts - Academic</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Agriculture Science Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>PML(ProQuest Medical Library)</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>ProQuest Biological Science Journals</collection><collection>Engineering Database</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Environmental Science Database</collection><collection>Materials science collection</collection><collection>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering collection</collection><collection>Environmental Science Collection</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PloS one</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Das, Jayanta Kumar</au><au>Das, Provas</au><au>Ray, Korak Kumar</au><au>Choudhury, Pabitra Pal</au><au>Jana, Siddhartha Sankar</au><au>Permyakov, Eugene A.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids</atitle><jtitle>PloS one</jtitle><addtitle>PLoS One</addtitle><date>2016-12-08</date><risdate>2016</risdate><volume>11</volume><issue>12</issue><spage>e0167651</spage><epage>e0167651</epage><pages>e0167651-e0167651</pages><issn>1932-6203</issn><eissn>1932-6203</eissn><abstract>Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as 'FPKATD' and 'Y/FTNEKL' without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids' pattern in different proteins.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>27930687</pmid><doi>10.1371/journal.pone.0167651</doi><tpages>e0167651</tpages><orcidid>https://orcid.org/0000-0001-5390-698X</orcidid><orcidid>https://orcid.org/0000-0002-4339-7445</orcidid><orcidid>https://orcid.org/0000-0002-7630-7194</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1932-6203
ispartof PloS one, 2016-12, Vol.11 (12), p.e0167651-e0167651
issn 1932-6203
1932-6203
language eng
recordid cdi_plos_journals_1847440338
source MEDLINE; Public Library of Science; DOAJ Directory of Open Access Journals; PubMed Central; Free Full-Text Journals in Chemistry; EZB Electronic Journals Library
subjects Algorithms
Amino acid sequence
Amino acids
Amino Acids - chemistry
Analysis
Bioinformatics
Biology and Life Sciences
Ca2+-transporting ATPase
Calcium
Chemistry
Combinations (mathematics)
Commonality
Enzymes
Kinesin
Mathematical models
Methodology
Myosin
Myosin-light-chain kinase
Na+/K+-exchanging ATPase
Neck
Phylogenetics
Phylogeny
Protein families
Protein kinase
Proteins
Proteins - chemistry
Research and Analysis Methods
Sequences
Studies
Yttrium
title Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T14%3A23%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Mathematical%20Characterization%20of%20Protein%20Sequences%20Using%20Patterns%20as%20Chemical%20Group%20Combinations%20of%20Amino%20Acids&rft.jtitle=PloS%20one&rft.au=Das,%20Jayanta%20Kumar&rft.date=2016-12-08&rft.volume=11&rft.issue=12&rft.spage=e0167651&rft.epage=e0167651&rft.pages=e0167651-e0167651&rft.issn=1932-6203&rft.eissn=1932-6203&rft_id=info:doi/10.1371/journal.pone.0167651&rft_dat=%3Cgale_plos_%3EA473276067%3C/gale_plos_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1847440338&rft_id=info:pmid/27930687&rft_galeid=A473276067&rft_doaj_id=oai_doaj_org_article_470b50cb432946bf9cbc5029ab9ed2f1&rfr_iscdi=true