Automated de novo Sequencing of Proteins by Tandem High-Resolution Mass Spectrometry

A de novo sequencing program for proteins is described that uses tandem MS data from electron capture dissociation and collisionally activated dissociation of electrosprayed protein ions. Computer automation is used to convert the fragment ion mass values derived from these spectra into the most pro...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Proceedings of the National Academy of Sciences - PNAS 2000-09, Vol.97 (19), p.10313-10317
Hauptverfasser: Horn, David M., Zubarev, Roman A., McLafferty, Fred W.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 10317
container_issue 19
container_start_page 10313
container_title Proceedings of the National Academy of Sciences - PNAS
container_volume 97
creator Horn, David M.
Zubarev, Roman A.
McLafferty, Fred W.
description A de novo sequencing program for proteins is described that uses tandem MS data from electron capture dissociation and collisionally activated dissociation of electrosprayed protein ions. Computer automation is used to convert the fragment ion mass values derived from these spectra into the most probable protein sequence, without distinguishing Leu/Ile. Minimum human input is necessary for the data reduction and interpretation. No extra chemistry is necessary to distinguish N- and C-terminal fragments in the mass spectra, as this is determined from the electron capture dissociation data. With parts-per-million mass accuracy (now available by using higher field Fourier transform MS instruments), the complete sequences of ubiquitin (8.6 kDa) and melittin (2.8 kDa) were predicted correctly by the program. The data available also provided 91% of the cytochrome c(12.4 kDa) sequence (essentially complete except for the tandem MS-resistant region K13V20that contains the cyclic heme). Uncorrected mass values from a 6-T instrument still gave 86% of the sequence for ubiquitin, except for distinguishing Gln/Lys. Extensive sequencing of larger proteins should be possible by applying the algorithm to pieces of ≈ 10-kDa size, such as products of limited proteolysis.
doi_str_mv 10.1073/pnas.97.19.10313
format Article
fullrecord <record><control><sourceid>jstor_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_27020</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><jstor_id>123557</jstor_id><sourcerecordid>123557</sourcerecordid><originalsourceid>FETCH-LOGICAL-c494t-322303bc41e735bd75bca2ad8deaed92e9d9d7604862ef4abdd1cac434bafe723</originalsourceid><addsrcrecordid>eNp9kU1vEzEQhi0EoqFwR0ICiwPissFfG68lLlVVKFIRiIaz5bVn04127dT2VuTf1yEBAgdO1mieZzyjF6HnlMwpkfzdxps0V3JOVak55Q_QjBJFq4VQ5CGaEcJk1QgmTtCTlNaEEFU35DE6KVAjaqZmaHk25TCaDA47wD7cBXwNtxN42_sVDh3-GkOG3ifcbvHSeAcjvuxXN9U3SGGYch88_mxSwtcbsDmGEXLcPkWPOjMkeHZ4T9H3DxfL88vq6svHT-dnV5UVSuSKM8YJb62gIHndOlm31jDjGgcGnGKgnHJyQUSzYNAJ0zpHrbGCi9Z0IBk_Re_3czdTO4Kz4HM0g97EfjRxq4Pp9d8d39_oVbjTTBJGiv7moMdQTk5Zj32yMAzGQ5iSlowtSCNUAV__A67DFH05TTNCuRKq2UFkD9kYUorQ_d6DEr1LS-_S0kpqqvTPtIry8nj_I2EfTwHeHoCd-qt9NEJ30zBk-JEL-ur_aCFe7Il1yiH--Yzxupb8HiQQtLw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>201394989</pqid></control><display><type>article</type><title>Automated de novo Sequencing of Proteins by Tandem High-Resolution Mass Spectrometry</title><source>Jstor Complete Legacy</source><source>MEDLINE</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><source>Free Full-Text Journals in Chemistry</source><creator>Horn, David M. ; Zubarev, Roman A. ; McLafferty, Fred W.</creator><creatorcontrib>Horn, David M. ; Zubarev, Roman A. ; McLafferty, Fred W.</creatorcontrib><description>A de novo sequencing program for proteins is described that uses tandem MS data from electron capture dissociation and collisionally activated dissociation of electrosprayed protein ions. Computer automation is used to convert the fragment ion mass values derived from these spectra into the most probable protein sequence, without distinguishing Leu/Ile. Minimum human input is necessary for the data reduction and interpretation. No extra chemistry is necessary to distinguish N- and C-terminal fragments in the mass spectra, as this is determined from the electron capture dissociation data. With parts-per-million mass accuracy (now available by using higher field Fourier transform MS instruments), the complete sequences of ubiquitin (8.6 kDa) and melittin (2.8 kDa) were predicted correctly by the program. The data available also provided 91% of the cytochrome c(12.4 kDa) sequence (essentially complete except for the tandem MS-resistant region K13V20that contains the cyclic heme). Uncorrected mass values from a 6-T instrument still gave 86% of the sequence for ubiquitin, except for distinguishing Gln/Lys. Extensive sequencing of larger proteins should be possible by applying the algorithm to pieces of ≈ 10-kDa size, such as products of limited proteolysis.</description><identifier>ISSN: 0027-8424</identifier><identifier>EISSN: 1091-6490</identifier><identifier>DOI: 10.1073/pnas.97.19.10313</identifier><identifier>PMID: 10984529</identifier><language>eng</language><publisher>United States: National Academy of Sciences of the United States of America</publisher><subject>Algorithms ; Amino Acid Sequence ; Amino acids ; Chemistry ; Computer aided design ; Cytochrome c Group - chemistry ; Cytochromes ; Humans ; Ions ; Mass Spectrometry - methods ; Mass spectroscopy ; Molecular Sequence Data ; Molecular spectra ; Physical Sciences ; Proteins ; Sequence Analysis, Protein - methods ; Sequence Homology, Amino Acid ; Sequencing ; Ubiquitins</subject><ispartof>Proceedings of the National Academy of Sciences - PNAS, 2000-09, Vol.97 (19), p.10313-10317</ispartof><rights>Copyright 1993-2000 National Academy of Sciences of the United States of America</rights><rights>Copyright National Academy of Sciences Sep 12, 2000</rights><rights>Copyright © 2000, The National Academy of Sciences 2000</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c494t-322303bc41e735bd75bca2ad8deaed92e9d9d7604862ef4abdd1cac434bafe723</citedby><cites>FETCH-LOGICAL-c494t-322303bc41e735bd75bca2ad8deaed92e9d9d7604862ef4abdd1cac434bafe723</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttp://www.pnas.org/content/97/19.cover.gif</thumbnail><linktopdf>$$Uhttps://www.jstor.org/stable/pdf/123557$$EPDF$$P50$$Gjstor$$H</linktopdf><linktohtml>$$Uhttps://www.jstor.org/stable/123557$$EHTML$$P50$$Gjstor$$H</linktohtml><link.rule.ids>230,314,723,776,780,799,881,27901,27902,53766,53768,57992,58225</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/10984529$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Horn, David M.</creatorcontrib><creatorcontrib>Zubarev, Roman A.</creatorcontrib><creatorcontrib>McLafferty, Fred W.</creatorcontrib><title>Automated de novo Sequencing of Proteins by Tandem High-Resolution Mass Spectrometry</title><title>Proceedings of the National Academy of Sciences - PNAS</title><addtitle>Proc Natl Acad Sci U S A</addtitle><description>A de novo sequencing program for proteins is described that uses tandem MS data from electron capture dissociation and collisionally activated dissociation of electrosprayed protein ions. Computer automation is used to convert the fragment ion mass values derived from these spectra into the most probable protein sequence, without distinguishing Leu/Ile. Minimum human input is necessary for the data reduction and interpretation. No extra chemistry is necessary to distinguish N- and C-terminal fragments in the mass spectra, as this is determined from the electron capture dissociation data. With parts-per-million mass accuracy (now available by using higher field Fourier transform MS instruments), the complete sequences of ubiquitin (8.6 kDa) and melittin (2.8 kDa) were predicted correctly by the program. The data available also provided 91% of the cytochrome c(12.4 kDa) sequence (essentially complete except for the tandem MS-resistant region K13V20that contains the cyclic heme). Uncorrected mass values from a 6-T instrument still gave 86% of the sequence for ubiquitin, except for distinguishing Gln/Lys. Extensive sequencing of larger proteins should be possible by applying the algorithm to pieces of ≈ 10-kDa size, such as products of limited proteolysis.</description><subject>Algorithms</subject><subject>Amino Acid Sequence</subject><subject>Amino acids</subject><subject>Chemistry</subject><subject>Computer aided design</subject><subject>Cytochrome c Group - chemistry</subject><subject>Cytochromes</subject><subject>Humans</subject><subject>Ions</subject><subject>Mass Spectrometry - methods</subject><subject>Mass spectroscopy</subject><subject>Molecular Sequence Data</subject><subject>Molecular spectra</subject><subject>Physical Sciences</subject><subject>Proteins</subject><subject>Sequence Analysis, Protein - methods</subject><subject>Sequence Homology, Amino Acid</subject><subject>Sequencing</subject><subject>Ubiquitins</subject><issn>0027-8424</issn><issn>1091-6490</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2000</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kU1vEzEQhi0EoqFwR0ICiwPissFfG68lLlVVKFIRiIaz5bVn04127dT2VuTf1yEBAgdO1mieZzyjF6HnlMwpkfzdxps0V3JOVak55Q_QjBJFq4VQ5CGaEcJk1QgmTtCTlNaEEFU35DE6KVAjaqZmaHk25TCaDA47wD7cBXwNtxN42_sVDh3-GkOG3ifcbvHSeAcjvuxXN9U3SGGYch88_mxSwtcbsDmGEXLcPkWPOjMkeHZ4T9H3DxfL88vq6svHT-dnV5UVSuSKM8YJb62gIHndOlm31jDjGgcGnGKgnHJyQUSzYNAJ0zpHrbGCi9Z0IBk_Re_3czdTO4Kz4HM0g97EfjRxq4Pp9d8d39_oVbjTTBJGiv7moMdQTk5Zj32yMAzGQ5iSlowtSCNUAV__A67DFH05TTNCuRKq2UFkD9kYUorQ_d6DEr1LS-_S0kpqqvTPtIry8nj_I2EfTwHeHoCd-qt9NEJ30zBk-JEL-ur_aCFe7Il1yiH--Yzxupb8HiQQtLw</recordid><startdate>20000912</startdate><enddate>20000912</enddate><creator>Horn, David M.</creator><creator>Zubarev, Roman A.</creator><creator>McLafferty, Fred W.</creator><general>National Academy of Sciences of the United States of America</general><general>National Acad Sciences</general><general>National Academy of Sciences</general><general>The National Academy of Sciences</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QG</scope><scope>7QL</scope><scope>7QP</scope><scope>7QR</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TK</scope><scope>7TM</scope><scope>7TO</scope><scope>7U9</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>H94</scope><scope>M7N</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20000912</creationdate><title>Automated de novo Sequencing of Proteins by Tandem High-Resolution Mass Spectrometry</title><author>Horn, David M. ; Zubarev, Roman A. ; McLafferty, Fred W.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c494t-322303bc41e735bd75bca2ad8deaed92e9d9d7604862ef4abdd1cac434bafe723</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2000</creationdate><topic>Algorithms</topic><topic>Amino Acid Sequence</topic><topic>Amino acids</topic><topic>Chemistry</topic><topic>Computer aided design</topic><topic>Cytochrome c Group - chemistry</topic><topic>Cytochromes</topic><topic>Humans</topic><topic>Ions</topic><topic>Mass Spectrometry - methods</topic><topic>Mass spectroscopy</topic><topic>Molecular Sequence Data</topic><topic>Molecular spectra</topic><topic>Physical Sciences</topic><topic>Proteins</topic><topic>Sequence Analysis, Protein - methods</topic><topic>Sequence Homology, Amino Acid</topic><topic>Sequencing</topic><topic>Ubiquitins</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Horn, David M.</creatorcontrib><creatorcontrib>Zubarev, Roman A.</creatorcontrib><creatorcontrib>McLafferty, Fred W.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Calcium &amp; Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Oncogenes and Growth Factors Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Proceedings of the National Academy of Sciences - PNAS</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Horn, David M.</au><au>Zubarev, Roman A.</au><au>McLafferty, Fred W.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Automated de novo Sequencing of Proteins by Tandem High-Resolution Mass Spectrometry</atitle><jtitle>Proceedings of the National Academy of Sciences - PNAS</jtitle><addtitle>Proc Natl Acad Sci U S A</addtitle><date>2000-09-12</date><risdate>2000</risdate><volume>97</volume><issue>19</issue><spage>10313</spage><epage>10317</epage><pages>10313-10317</pages><issn>0027-8424</issn><eissn>1091-6490</eissn><abstract>A de novo sequencing program for proteins is described that uses tandem MS data from electron capture dissociation and collisionally activated dissociation of electrosprayed protein ions. Computer automation is used to convert the fragment ion mass values derived from these spectra into the most probable protein sequence, without distinguishing Leu/Ile. Minimum human input is necessary for the data reduction and interpretation. No extra chemistry is necessary to distinguish N- and C-terminal fragments in the mass spectra, as this is determined from the electron capture dissociation data. With parts-per-million mass accuracy (now available by using higher field Fourier transform MS instruments), the complete sequences of ubiquitin (8.6 kDa) and melittin (2.8 kDa) were predicted correctly by the program. The data available also provided 91% of the cytochrome c(12.4 kDa) sequence (essentially complete except for the tandem MS-resistant region K13V20that contains the cyclic heme). Uncorrected mass values from a 6-T instrument still gave 86% of the sequence for ubiquitin, except for distinguishing Gln/Lys. Extensive sequencing of larger proteins should be possible by applying the algorithm to pieces of ≈ 10-kDa size, such as products of limited proteolysis.</abstract><cop>United States</cop><pub>National Academy of Sciences of the United States of America</pub><pmid>10984529</pmid><doi>10.1073/pnas.97.19.10313</doi><tpages>5</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0027-8424
ispartof Proceedings of the National Academy of Sciences - PNAS, 2000-09, Vol.97 (19), p.10313-10317
issn 0027-8424
1091-6490
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_27020
source Jstor Complete Legacy; MEDLINE; PubMed Central; Alma/SFX Local Collection; Free Full-Text Journals in Chemistry
subjects Algorithms
Amino Acid Sequence
Amino acids
Chemistry
Computer aided design
Cytochrome c Group - chemistry
Cytochromes
Humans
Ions
Mass Spectrometry - methods
Mass spectroscopy
Molecular Sequence Data
Molecular spectra
Physical Sciences
Proteins
Sequence Analysis, Protein - methods
Sequence Homology, Amino Acid
Sequencing
Ubiquitins
title Automated de novo Sequencing of Proteins by Tandem High-Resolution Mass Spectrometry
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-12T20%3A27%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-jstor_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Automated%20de%20novo%20Sequencing%20of%20Proteins%20by%20Tandem%20High-Resolution%20Mass%20Spectrometry&rft.jtitle=Proceedings%20of%20the%20National%20Academy%20of%20Sciences%20-%20PNAS&rft.au=Horn,%20David%20M.&rft.date=2000-09-12&rft.volume=97&rft.issue=19&rft.spage=10313&rft.epage=10317&rft.pages=10313-10317&rft.issn=0027-8424&rft.eissn=1091-6490&rft_id=info:doi/10.1073/pnas.97.19.10313&rft_dat=%3Cjstor_pubme%3E123557%3C/jstor_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=201394989&rft_id=info:pmid/10984529&rft_jstor_id=123557&rfr_iscdi=true