Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments

Motivation: There has been much interest in using patterns derived from surface-enhanced laser desorption and ionization (SELDI) protein mass spectra from serum to differentiate samples from patients both with and without disease. Such patterns have been used without identification of the underlying...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Bioinformatics 2004-03, Vol.20 (5), p.777-785
Hauptverfasser: Baggerly, Keith A., Morris, Jeffrey S., Coombes, Kevin R.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 785
container_issue 5
container_start_page 777
container_title Bioinformatics
container_volume 20
creator Baggerly, Keith A.
Morris, Jeffrey S.
Coombes, Kevin R.
description Motivation: There has been much interest in using patterns derived from surface-enhanced laser desorption and ionization (SELDI) protein mass spectra from serum to differentiate samples from patients both with and without disease. Such patterns have been used without identification of the underlying proteins responsible. However, there are questions as to the stability of this procedure over multiple experiments. Results: We compared SELDI proteomic spectra from serum from three experiments by the same group on separating ovarian cancer from normal tissue. These spectra are available on the web at http://clinicalproteomics.steem.com. In general, the results were not reproducible across experiments. Baseline correction prevents reproduction of the results for two of the experiments. In one experiment, there is evidence of a major shift in protocol mid-experiment which could bias the results. In another, structure in the noise regions of the spectra allows us to distinguish normal from cancer, suggesting that the normals and cancers were processed differently. Sets of features found to discriminate well in one experiment do not generalize to other experiments. Finally, the mass calibration in all three experiments appears suspect. Taken together, these and other concerns suggest that much of the structure uncovered in these experiments could be due to artifacts of sample processing, not to the underlying biology of cancer. We provide some guidelines for design and analysis in experiments like these to ensure better reproducible, biologically meaningfully results. Availability: The MATLAB and Perl code used in our analyses is available at http://bioinformatics.mdanderson.org
doi_str_mv 10.1093/bioinformatics/btg484
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_71768857</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>71768857</sourcerecordid><originalsourceid>FETCH-LOGICAL-c544t-29890898591331b0aa66c7bb726e7ba86b0ad5357e44b0cde70f2ff99298040c3</originalsourceid><addsrcrecordid>eNqFkV1rFTEQhhex2Fr7E5Qg6N22yebbO62tp3CgB21RehOy2UlJ3S-TLLT_3pRzsOiNV3nJPDPMO29VvSb4mGBNT9owhdFPcbA5uHTS5lum2LPqgDCB6wZz_bxoKmTNFKb71cuU7jDmhDH2otonTHKiNT-o4CvMceoWF9rQh_yAJo--na0_X9RXl-eolDKEEc02Z4hjQkUniMvwAblpmG0M4y3qbLYJckI-TgPqgvcQYcwI7meIYSgyvar2vO0THO3ew-r6_OzqdFWvL79cnH5c144zlutGK42VVlwTSkmLrRXCybaVjQDZWiXKV8cpl8BYi10HEvvGe61LI2bY0cPq_XZuWfzXAimbISQHfW9HmJZkJJFCKS7_CxItdFlFFfDtP-DdtMSxmCiMEqLBjBaIbyEXp5QieDMX4zY-GILNY1rm77TMNq3S92Y3fGkH6J66dvEU4N0OsMnZ3kc7upCeOC7KqfCjnXrLhZTh_k_dxp9GSCq5Wf24MasN3XzafL8xlP4GviayeQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>198662043</pqid></control><display><type>article</type><title>Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments</title><source>MEDLINE</source><source>Oxford Journals Open Access Collection</source><source>EZB-FREE-00999 freely available EZB journals</source><source>Alma/SFX Local Collection</source><creator>Baggerly, Keith A. ; Morris, Jeffrey S. ; Coombes, Kevin R.</creator><creatorcontrib>Baggerly, Keith A. ; Morris, Jeffrey S. ; Coombes, Kevin R.</creatorcontrib><description>Motivation: There has been much interest in using patterns derived from surface-enhanced laser desorption and ionization (SELDI) protein mass spectra from serum to differentiate samples from patients both with and without disease. Such patterns have been used without identification of the underlying proteins responsible. However, there are questions as to the stability of this procedure over multiple experiments. Results: We compared SELDI proteomic spectra from serum from three experiments by the same group on separating ovarian cancer from normal tissue. These spectra are available on the web at http://clinicalproteomics.steem.com. In general, the results were not reproducible across experiments. Baseline correction prevents reproduction of the results for two of the experiments. In one experiment, there is evidence of a major shift in protocol mid-experiment which could bias the results. In another, structure in the noise regions of the spectra allows us to distinguish normal from cancer, suggesting that the normals and cancers were processed differently. Sets of features found to discriminate well in one experiment do not generalize to other experiments. Finally, the mass calibration in all three experiments appears suspect. Taken together, these and other concerns suggest that much of the structure uncovered in these experiments could be due to artifacts of sample processing, not to the underlying biology of cancer. We provide some guidelines for design and analysis in experiments like these to ensure better reproducible, biologically meaningfully results. Availability: The MATLAB and Perl code used in our analyses is available at http://bioinformatics.mdanderson.org</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1460-2059</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/btg484</identifier><identifier>PMID: 14751995</identifier><identifier>CODEN: BOINFP</identifier><language>eng</language><publisher>Oxford: Oxford University Press</publisher><subject>Algorithms ; Biological and medical sciences ; Blood Proteins - analysis ; Blood Proteins - chemistry ; Blood Proteins - classification ; Databases, Protein ; Diagnosis, Computer-Assisted - methods ; Diagnosis, Computer-Assisted - standards ; Female ; Fundamental and applied biological sciences. Psychology ; General aspects ; Humans ; Lasers ; Mass Spectrometry - methods ; Mass Spectrometry - standards ; Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) ; Ovarian Neoplasms - blood ; Ovarian Neoplasms - diagnosis ; Reproducibility of Results ; Sensitivity and Specificity ; Sequence Analysis, Protein - methods ; Software</subject><ispartof>Bioinformatics, 2004-03, Vol.20 (5), p.777-785</ispartof><rights>2004 INIST-CNRS</rights><rights>Copyright Oxford University Press(England) Mar 22, 2004</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c544t-29890898591331b0aa66c7bb726e7ba86b0ad5357e44b0cde70f2ff99298040c3</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=15659107$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/14751995$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Baggerly, Keith A.</creatorcontrib><creatorcontrib>Morris, Jeffrey S.</creatorcontrib><creatorcontrib>Coombes, Kevin R.</creatorcontrib><title>Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments</title><title>Bioinformatics</title><addtitle>Bioinformatics</addtitle><description>Motivation: There has been much interest in using patterns derived from surface-enhanced laser desorption and ionization (SELDI) protein mass spectra from serum to differentiate samples from patients both with and without disease. Such patterns have been used without identification of the underlying proteins responsible. However, there are questions as to the stability of this procedure over multiple experiments. Results: We compared SELDI proteomic spectra from serum from three experiments by the same group on separating ovarian cancer from normal tissue. These spectra are available on the web at http://clinicalproteomics.steem.com. In general, the results were not reproducible across experiments. Baseline correction prevents reproduction of the results for two of the experiments. In one experiment, there is evidence of a major shift in protocol mid-experiment which could bias the results. In another, structure in the noise regions of the spectra allows us to distinguish normal from cancer, suggesting that the normals and cancers were processed differently. Sets of features found to discriminate well in one experiment do not generalize to other experiments. Finally, the mass calibration in all three experiments appears suspect. Taken together, these and other concerns suggest that much of the structure uncovered in these experiments could be due to artifacts of sample processing, not to the underlying biology of cancer. We provide some guidelines for design and analysis in experiments like these to ensure better reproducible, biologically meaningfully results. Availability: The MATLAB and Perl code used in our analyses is available at http://bioinformatics.mdanderson.org</description><subject>Algorithms</subject><subject>Biological and medical sciences</subject><subject>Blood Proteins - analysis</subject><subject>Blood Proteins - chemistry</subject><subject>Blood Proteins - classification</subject><subject>Databases, Protein</subject><subject>Diagnosis, Computer-Assisted - methods</subject><subject>Diagnosis, Computer-Assisted - standards</subject><subject>Female</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>General aspects</subject><subject>Humans</subject><subject>Lasers</subject><subject>Mass Spectrometry - methods</subject><subject>Mass Spectrometry - standards</subject><subject>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</subject><subject>Ovarian Neoplasms - blood</subject><subject>Ovarian Neoplasms - diagnosis</subject><subject>Reproducibility of Results</subject><subject>Sensitivity and Specificity</subject><subject>Sequence Analysis, Protein - methods</subject><subject>Software</subject><issn>1367-4803</issn><issn>1460-2059</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkV1rFTEQhhex2Fr7E5Qg6N22yebbO62tp3CgB21RehOy2UlJ3S-TLLT_3pRzsOiNV3nJPDPMO29VvSb4mGBNT9owhdFPcbA5uHTS5lum2LPqgDCB6wZz_bxoKmTNFKb71cuU7jDmhDH2otonTHKiNT-o4CvMceoWF9rQh_yAJo--na0_X9RXl-eolDKEEc02Z4hjQkUniMvwAblpmG0M4y3qbLYJckI-TgPqgvcQYcwI7meIYSgyvar2vO0THO3ew-r6_OzqdFWvL79cnH5c144zlutGK42VVlwTSkmLrRXCybaVjQDZWiXKV8cpl8BYi10HEvvGe61LI2bY0cPq_XZuWfzXAimbISQHfW9HmJZkJJFCKS7_CxItdFlFFfDtP-DdtMSxmCiMEqLBjBaIbyEXp5QieDMX4zY-GILNY1rm77TMNq3S92Y3fGkH6J66dvEU4N0OsMnZ3kc7upCeOC7KqfCjnXrLhZTh_k_dxp9GSCq5Wf24MasN3XzafL8xlP4GviayeQ</recordid><startdate>20040322</startdate><enddate>20040322</enddate><creator>Baggerly, Keith A.</creator><creator>Morris, Jeffrey S.</creator><creator>Coombes, Kevin R.</creator><general>Oxford University Press</general><general>Oxford Publishing Limited (England)</general><scope>BSCLL</scope><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QF</scope><scope>7QO</scope><scope>7QQ</scope><scope>7SC</scope><scope>7SE</scope><scope>7SP</scope><scope>7SR</scope><scope>7TA</scope><scope>7TB</scope><scope>7TM</scope><scope>7TO</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>H8D</scope><scope>H8G</scope><scope>H94</scope><scope>JG9</scope><scope>JQ2</scope><scope>K9.</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>P64</scope><scope>7X8</scope></search><sort><creationdate>20040322</creationdate><title>Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments</title><author>Baggerly, Keith A. ; Morris, Jeffrey S. ; Coombes, Kevin R.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c544t-29890898591331b0aa66c7bb726e7ba86b0ad5357e44b0cde70f2ff99298040c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Algorithms</topic><topic>Biological and medical sciences</topic><topic>Blood Proteins - analysis</topic><topic>Blood Proteins - chemistry</topic><topic>Blood Proteins - classification</topic><topic>Databases, Protein</topic><topic>Diagnosis, Computer-Assisted - methods</topic><topic>Diagnosis, Computer-Assisted - standards</topic><topic>Female</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>General aspects</topic><topic>Humans</topic><topic>Lasers</topic><topic>Mass Spectrometry - methods</topic><topic>Mass Spectrometry - standards</topic><topic>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</topic><topic>Ovarian Neoplasms - blood</topic><topic>Ovarian Neoplasms - diagnosis</topic><topic>Reproducibility of Results</topic><topic>Sensitivity and Specificity</topic><topic>Sequence Analysis, Protein - methods</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Baggerly, Keith A.</creatorcontrib><creatorcontrib>Morris, Jeffrey S.</creatorcontrib><creatorcontrib>Coombes, Kevin R.</creatorcontrib><collection>Istex</collection><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Aluminium Industry Abstracts</collection><collection>Biotechnology Research Abstracts</collection><collection>Ceramic Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Corrosion Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>Materials Business File</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Oncogenes and Growth Factors Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Copper Technical Reference Library</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Baggerly, Keith A.</au><au>Morris, Jeffrey S.</au><au>Coombes, Kevin R.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments</atitle><jtitle>Bioinformatics</jtitle><addtitle>Bioinformatics</addtitle><date>2004-03-22</date><risdate>2004</risdate><volume>20</volume><issue>5</issue><spage>777</spage><epage>785</epage><pages>777-785</pages><issn>1367-4803</issn><eissn>1460-2059</eissn><eissn>1367-4811</eissn><coden>BOINFP</coden><abstract>Motivation: There has been much interest in using patterns derived from surface-enhanced laser desorption and ionization (SELDI) protein mass spectra from serum to differentiate samples from patients both with and without disease. Such patterns have been used without identification of the underlying proteins responsible. However, there are questions as to the stability of this procedure over multiple experiments. Results: We compared SELDI proteomic spectra from serum from three experiments by the same group on separating ovarian cancer from normal tissue. These spectra are available on the web at http://clinicalproteomics.steem.com. In general, the results were not reproducible across experiments. Baseline correction prevents reproduction of the results for two of the experiments. In one experiment, there is evidence of a major shift in protocol mid-experiment which could bias the results. In another, structure in the noise regions of the spectra allows us to distinguish normal from cancer, suggesting that the normals and cancers were processed differently. Sets of features found to discriminate well in one experiment do not generalize to other experiments. Finally, the mass calibration in all three experiments appears suspect. Taken together, these and other concerns suggest that much of the structure uncovered in these experiments could be due to artifacts of sample processing, not to the underlying biology of cancer. We provide some guidelines for design and analysis in experiments like these to ensure better reproducible, biologically meaningfully results. Availability: The MATLAB and Perl code used in our analyses is available at http://bioinformatics.mdanderson.org</abstract><cop>Oxford</cop><pub>Oxford University Press</pub><pmid>14751995</pmid><doi>10.1093/bioinformatics/btg484</doi><tpages>9</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1367-4803
ispartof Bioinformatics, 2004-03, Vol.20 (5), p.777-785
issn 1367-4803
1460-2059
1367-4811
language eng
recordid cdi_proquest_miscellaneous_71768857
source MEDLINE; Oxford Journals Open Access Collection; EZB-FREE-00999 freely available EZB journals; Alma/SFX Local Collection
subjects Algorithms
Biological and medical sciences
Blood Proteins - analysis
Blood Proteins - chemistry
Blood Proteins - classification
Databases, Protein
Diagnosis, Computer-Assisted - methods
Diagnosis, Computer-Assisted - standards
Female
Fundamental and applied biological sciences. Psychology
General aspects
Humans
Lasers
Mass Spectrometry - methods
Mass Spectrometry - standards
Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)
Ovarian Neoplasms - blood
Ovarian Neoplasms - diagnosis
Reproducibility of Results
Sensitivity and Specificity
Sequence Analysis, Protein - methods
Software
title Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T07%3A13%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Reproducibility%20of%20SELDI-TOF%20protein%20patterns%20in%20serum:%20comparing%20datasets%20from%20different%20experiments&rft.jtitle=Bioinformatics&rft.au=Baggerly,%20Keith%20A.&rft.date=2004-03-22&rft.volume=20&rft.issue=5&rft.spage=777&rft.epage=785&rft.pages=777-785&rft.issn=1367-4803&rft.eissn=1460-2059&rft.coden=BOINFP&rft_id=info:doi/10.1093/bioinformatics/btg484&rft_dat=%3Cproquest_cross%3E71768857%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=198662043&rft_id=info:pmid/14751995&rfr_iscdi=true