Automated de novo Sequencing of Proteins by Tandem High-Resolution Mass Spectrometry

A de novo sequencing program for proteins is described that uses tandem MS data from electron capture dissociation and collisionally activated dissociation of electrosprayed protein ions. Computer automation is used to convert the fragment ion mass values derived from these spectra into the most pro...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Proceedings of the National Academy of Sciences - PNAS 2000-09, Vol.97 (19), p.10313-10317
Hauptverfasser:	Horn, David M., Zubarev, Roman A., McLafferty, Fred W.
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Amino Acid Sequence Amino acids Chemistry Computer aided design Cytochrome c Group - chemistry Cytochromes Humans Ions Mass Spectrometry - methods Mass spectroscopy Molecular Sequence Data Molecular spectra Physical Sciences Proteins Sequence Analysis, Protein - methods Sequence Homology, Amino Acid Sequencing Ubiquitins
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	10317
container_issue	19
container_start_page	10313
container_title	Proceedings of the National Academy of Sciences - PNAS
container_volume	97
creator	Horn, David M. Zubarev, Roman A. McLafferty, Fred W.
description	A de novo sequencing program for proteins is described that uses tandem MS data from electron capture dissociation and collisionally activated dissociation of electrosprayed protein ions. Computer automation is used to convert the fragment ion mass values derived from these spectra into the most probable protein sequence, without distinguishing Leu/Ile. Minimum human input is necessary for the data reduction and interpretation. No extra chemistry is necessary to distinguish N- and C-terminal fragments in the mass spectra, as this is determined from the electron capture dissociation data. With parts-per-million mass accuracy (now available by using higher field Fourier transform MS instruments), the complete sequences of ubiquitin (8.6 kDa) and melittin (2.8 kDa) were predicted correctly by the program. The data available also provided 91% of the cytochrome c(12.4 kDa) sequence (essentially complete except for the tandem MS-resistant region K13V20that contains the cyclic heme). Uncorrected mass values from a 6-T instrument still gave 86% of the sequence for ubiquitin, except for distinguishing Gln/Lys. Extensive sequencing of larger proteins should be possible by applying the algorithm to pieces of ≈ 10-kDa size, such as products of limited proteolysis.
doi_str_mv	10.1073/pnas.97.19.10313
format	Article
fullrecord	<record><control><sourceid>jstor_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_27020</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><jstor_id>123557</jstor_id><sourcerecordid>123557</sourcerecordid><originalsourceid>FETCH-LOGICAL-c494t-322303bc41e735bd75bca2ad8deaed92e9d9d7604862ef4abdd1cac434bafe723</originalsourceid><addsrcrecordid>eNp9kU1vEzEQhi0EoqFwR0ICiwPissFfG68lLlVVKFIRiIaz5bVn04127dT2VuTf1yEBAgdO1mieZzyjF6HnlMwpkfzdxps0V3JOVak55Q_QjBJFq4VQ5CGaEcJk1QgmTtCTlNaEEFU35DE6KVAjaqZmaHk25TCaDA47wD7cBXwNtxN42_sVDh3-GkOG3ifcbvHSeAcjvuxXN9U3SGGYch88_mxSwtcbsDmGEXLcPkWPOjMkeHZ4T9H3DxfL88vq6svHT-dnV5UVSuSKM8YJb62gIHndOlm31jDjGgcGnGKgnHJyQUSzYNAJ0zpHrbGCi9Z0IBk_Re_3czdTO4Kz4HM0g97EfjRxq4Pp9d8d39_oVbjTTBJGiv7moMdQTk5Zj32yMAzGQ5iSlowtSCNUAV__A67DFH05TTNCuRKq2UFkD9kYUorQ_d6DEr1LS-_S0kpqqvTPtIry8nj_I2EfTwHeHoCd-qt9NEJ30zBk-JEL-ur_aCFe7Il1yiH--Yzxupb8HiQQtLw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>201394989</pqid></control><display><type>article</type><title>Automated de novo Sequencing of Proteins by Tandem High-Resolution Mass Spectrometry</title><source>Jstor Complete Legacy</source><source>MEDLINE</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><source>Free Full-Text Journals in Chemistry</source><creator>Horn, David M. ; Zubarev, Roman A. ; McLafferty, Fred W.</creator><creatorcontrib>Horn, David M. ; Zubarev, Roman A. ; McLafferty, Fred W.</creatorcontrib><description>A de novo sequencing program for proteins is described that uses tandem MS data from electron capture dissociation and collisionally activated dissociation of electrosprayed protein ions. Computer automation is used to convert the fragment ion mass values derived from these spectra into the most probable protein sequence, without distinguishing Leu/Ile. Minimum human input is necessary for the data reduction and interpretation. No extra chemistry is necessary to distinguish N- and C-terminal fragments in the mass spectra, as this is determined from the electron capture dissociation data. With parts-per-million mass accuracy (now available by using higher field Fourier transform MS instruments), the complete sequences of ubiquitin (8.6 kDa) and melittin (2.8 kDa) were predicted correctly by the program. The data available also provided 91% of the cytochrome c(12.4 kDa) sequence (essentially complete except for the tandem MS-resistant region K13V20that contains the cyclic heme). Uncorrected mass values from a 6-T instrument still gave 86% of the sequence for ubiquitin, except for distinguishing Gln/Lys. Extensive sequencing of larger proteins should be possible by applying the algorithm to pieces of ≈ 10-kDa size, such as products of limited proteolysis.</description><identifier>ISSN: 0027-8424</identifier><identifier>EISSN: 1091-6490</identifier><identifier>DOI: 10.1073/pnas.97.19.10313</identifier><identifier>PMID: 10984529</identifier><language>eng</language><publisher>United States: National Academy of Sciences of the United States of America</publisher><subject>Algorithms ; Amino Acid Sequence ; Amino acids ; Chemistry ; Computer aided design ; Cytochrome c Group - chemistry ; Cytochromes ; Humans ; Ions ; Mass Spectrometry - methods ; Mass spectroscopy ; Molecular Sequence Data ; Molecular spectra ; Physical Sciences ; Proteins ; Sequence Analysis, Protein - methods ; Sequence Homology, Amino Acid ; Sequencing ; Ubiquitins</subject><ispartof>Proceedings of the National Academy of Sciences - PNAS, 2000-09, Vol.97 (19), p.10313-10317</ispartof><rights>Copyright 1993-2000 National Academy of Sciences of the United States of America</rights><rights>Copyright National Academy of Sciences Sep 12, 2000</rights><rights>Copyright © 2000, The National Academy of Sciences 2000</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c494t-322303bc41e735bd75bca2ad8deaed92e9d9d7604862ef4abdd1cac434bafe723</citedby><cites>FETCH-LOGICAL-c494t-322303bc41e735bd75bca2ad8deaed92e9d9d7604862ef4abdd1cac434bafe723</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttp://www.pnas.org/content/97/19.cover.gif</thumbnail><linktopdf>$$Uhttps://www.jstor.org/stable/pdf/123557$$EPDF$$P50$$Gjstor$$H</linktopdf><linktohtml>$$Uhttps://www.jstor.org/stable/123557$$EHTML$$P50$$Gjstor$$H</linktohtml><link.rule.ids>230,314,723,776,780,799,881,27901,27902,53766,53768,57992,58225</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/10984529$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Horn, David M.</creatorcontrib><creatorcontrib>Zubarev, Roman A.</creatorcontrib><creatorcontrib>McLafferty, Fred W.</creatorcontrib><title>Automated de novo Sequencing of Proteins by Tandem High-Resolution Mass Spectrometry</title><title>Proceedings of the National Academy of Sciences - PNAS</title><addtitle>Proc Natl Acad Sci U S A</addtitle><description>A de novo sequencing program for proteins is described that uses tandem MS data from electron capture dissociation and collisionally activated dissociation of electrosprayed protein ions. Computer automation is used to convert the fragment ion mass values derived from these spectra into the most probable protein sequence, without distinguishing Leu/Ile. Minimum human input is necessary for the data reduction and interpretation. No extra chemistry is necessary to distinguish N- and C-terminal fragments in the mass spectra, as this is determined from the electron capture dissociation data. With parts-per-million mass accuracy (now available by using higher field Fourier transform MS instruments), the complete sequences of ubiquitin (8.6 kDa) and melittin (2.8 kDa) were predicted correctly by the program. The data available also provided 91% of the cytochrome c(12.4 kDa) sequence (essentially complete except for the tandem MS-resistant region K13V20that contains the cyclic heme). Uncorrected mass values from a 6-T instrument still gave 86% of the sequence for ubiquitin, except for distinguishing Gln/Lys. Extensive sequencing of larger proteins should be possible by applying the algorithm to pieces of ≈ 10-kDa size, such as products of limited proteolysis.</description><subject>Algorithms</subject><subject>Amino Acid Sequence</subject><subject>Amino acids</subject><subject>Chemistry</subject><subject>Computer aided design</subject><subject>Cytochrome c Group - chemistry</subject><subject>Cytochromes</subject><subject>Humans</subject><subject>Ions</subject><subject>Mass Spectrometry - methods</subject><subject>Mass spectroscopy</subject><subject>Molecular Sequence Data</subject><subject>Molecular spectra</subject><subject>Physical Sciences</subject><subject>Proteins</subject><subject>Sequence Analysis, Protein - methods</subject><subject>Sequence Homology, Amino Acid</subject><subject>Sequencing</subject><subject>Ubiquitins</subject><issn>0027-8424</issn><issn>1091-6490</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2000</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kU1vEzEQhi0EoqFwR0ICiwPissFfG68lLlVVKFIRiIaz5bVn04127dT2VuTf1yEBAgdO1mieZzyjF6HnlMwpkfzdxps0V3JOVak55Q_QjBJFq4VQ5CGaEcJk1QgmTtCTlNaEEFU35DE6KVAjaqZmaHk25TCaDA47wD7cBXwNtxN42_sVDh3-GkOG3ifcbvHSeAcjvuxXN9U3SGGYch88_mxSwtcbsDmGEXLcPkWPOjMkeHZ4T9H3DxfL88vq6svHT-dnV5UVSuSKM8YJb62gIHndOlm31jDjGgcGnGKgnHJyQUSzYNAJ0zpHrbGCi9Z0IBk_Re_3czdTO4Kz4HM0g97EfjRxq4Pp9d8d39_oVbjTTBJGiv7moMdQTk5Zj32yMAzGQ5iSlowtSCNUAV__A67DFH05TTNCuRKq2UFkD9kYUorQ_d6DEr1LS-_S0kpqqvTPtIry8nj_I2EfTwHeHoCd-qt9NEJ30zBk-JEL-ur_aCFe7Il1yiH--Yzxupb8HiQQtLw</recordid><startdate>20000912</startdate><enddate>20000912</enddate><creator>Horn, David M.</creator><creator>Zubarev, Roman A.</creator><creator>McLafferty, Fred W.</creator><general>National Academy of Sciences of the United States of America</general><general>National Acad Sciences</general><general>National Academy of Sciences</general><general>The National Academy of Sciences</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QG</scope><scope>7QL</scope><scope>7QP</scope><scope>7QR</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TK</scope><scope>7TM</scope><scope>7TO</scope><scope>7U9</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>H94</scope><scope>M7N</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20000912</creationdate><title>Automated de novo Sequencing of Proteins by Tandem High-Resolution Mass Spectrometry</title><author>Horn, David M. ; Zubarev, Roman A. ; McLafferty, Fred W.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c494t-322303bc41e735bd75bca2ad8deaed92e9d9d7604862ef4abdd1cac434bafe723</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2000</creationdate><topic>Algorithms</topic><topic>Amino Acid Sequence</topic><topic>Amino acids</topic><topic>Chemistry</topic><topic>Computer aided design</topic><topic>Cytochrome c Group - chemistry</topic><topic>Cytochromes</topic><topic>Humans</topic><topic>Ions</topic><topic>Mass Spectrometry - methods</topic><topic>Mass spectroscopy</topic><topic>Molecular Sequence Data</topic><topic>Molecular spectra</topic><topic>Physical Sciences</topic><topic>Proteins</topic><topic>Sequence Analysis, Protein - methods</topic><topic>Sequence Homology, Amino Acid</topic><topic>Sequencing</topic><topic>Ubiquitins</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Horn, David M.</creatorcontrib><creatorcontrib>Zubarev, Roman A.</creatorcontrib><creatorcontrib>McLafferty, Fred W.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Calcium & Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Oncogenes and Growth Factors Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Proceedings of the National Academy of Sciences - PNAS</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Horn, David M.</au><au>Zubarev, Roman A.</au><au>McLafferty, Fred W.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Automated de novo Sequencing of Proteins by Tandem High-Resolution Mass Spectrometry</atitle><jtitle>Proceedings of the National Academy of Sciences - PNAS</jtitle><addtitle>Proc Natl Acad Sci U S A</addtitle><date>2000-09-12</date><risdate>2000</risdate><volume>97</volume><issue>19</issue><spage>10313</spage><epage>10317</epage><pages>10313-10317</pages><issn>0027-8424</issn><eissn>1091-6490</eissn><abstract>A de novo sequencing program for proteins is described that uses tandem MS data from electron capture dissociation and collisionally activated dissociation of electrosprayed protein ions. Computer automation is used to convert the fragment ion mass values derived from these spectra into the most probable protein sequence, without distinguishing Leu/Ile. Minimum human input is necessary for the data reduction and interpretation. No extra chemistry is necessary to distinguish N- and C-terminal fragments in the mass spectra, as this is determined from the electron capture dissociation data. With parts-per-million mass accuracy (now available by using higher field Fourier transform MS instruments), the complete sequences of ubiquitin (8.6 kDa) and melittin (2.8 kDa) were predicted correctly by the program. The data available also provided 91% of the cytochrome c(12.4 kDa) sequence (essentially complete except for the tandem MS-resistant region K13V20that contains the cyclic heme). Uncorrected mass values from a 6-T instrument still gave 86% of the sequence for ubiquitin, except for distinguishing Gln/Lys. Extensive sequencing of larger proteins should be possible by applying the algorithm to pieces of ≈ 10-kDa size, such as products of limited proteolysis.</abstract><cop>United States</cop><pub>National Academy of Sciences of the United States of America</pub><pmid>10984529</pmid><doi>10.1073/pnas.97.19.10313</doi><tpages>5</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 0027-8424
ispartof	Proceedings of the National Academy of Sciences - PNAS, 2000-09, Vol.97 (19), p.10313-10317
issn	0027-8424 1091-6490
language	eng
recordid	cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_27020
source	Jstor Complete Legacy; MEDLINE; PubMed Central; Alma/SFX Local Collection; Free Full-Text Journals in Chemistry
subjects	Algorithms Amino Acid Sequence Amino acids Chemistry Computer aided design Cytochrome c Group - chemistry Cytochromes Humans Ions Mass Spectrometry - methods Mass spectroscopy Molecular Sequence Data Molecular spectra Physical Sciences Proteins Sequence Analysis, Protein - methods Sequence Homology, Amino Acid Sequencing Ubiquitins
title	Automated de novo Sequencing of Proteins by Tandem High-Resolution Mass Spectrometry
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-12T20%3A27%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-jstor_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Automated%20de%20novo%20Sequencing%20of%20Proteins%20by%20Tandem%20High-Resolution%20Mass%20Spectrometry&rft.jtitle=Proceedings%20of%20the%20National%20Academy%20of%20Sciences%20-%20PNAS&rft.au=Horn,%20David%20M.&rft.date=2000-09-12&rft.volume=97&rft.issue=19&rft.spage=10313&rft.epage=10317&rft.pages=10313-10317&rft.issn=0027-8424&rft.eissn=1091-6490&rft_id=info:doi/10.1073/pnas.97.19.10313&rft_dat=%3Cjstor_pubme%3E123557%3C/jstor_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=201394989&rft_id=info:pmid/10984529&rft_jstor_id=123557&rfr_iscdi=true