Disentangling the Complex Multiplexed DIA Spectra in De Novo Peptide Sequencing

Data-Independent Acquisition (DIA) was introduced to improve sensitivity to cover all peptides in a range rather than only sampling high-intensity peaks as in Data-Dependent Acquisition (DDA) mass spectrometry. However, it is not very clear how useful DIA data is for de novo peptide sequencing as th...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Ma, Zheng, Mao, Zeping, Zhang, Ruixue, Chen, Jiazhen, Xin, Lei, Shan, Paul, Ghodsi, Ali, Li, Ming
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Ma, Zheng
Mao, Zeping
Zhang, Ruixue
Chen, Jiazhen
Xin, Lei
Shan, Paul
Ghodsi, Ali
Li, Ming
description Data-Independent Acquisition (DIA) was introduced to improve sensitivity to cover all peptides in a range rather than only sampling high-intensity peaks as in Data-Dependent Acquisition (DDA) mass spectrometry. However, it is not very clear how useful DIA data is for de novo peptide sequencing as the DIA data are marred with coeluted peptides, high noises, and varying data quality. We present a new deep learning method DIANovo, and address each of these difficulties, and improves the previous established system DeepNovo-DIA by from 25% to 81%, averaging 48%, for amino acid recall, and by from 27% to 89%, averaging 57%, for peptide recall, by equipping the model with a deeper understanding of coeluted DIA spectra. This paper also provides criteria about when DIA data could be used for de novo peptide sequencing and when not to by providing a comparison between DDA and DIA, in both de novo and database search mode. We find that while DIA excels with narrow isolation windows on older-generation instruments, it loses its advantage with wider windows. However, with Orbitrap Astral, DIA consistently outperforms DDA due to narrow window mode enabled. We also provide a theoretical explanation of this phenomenon, emphasizing the critical role of the signal-to-noise profile in the successful application of de novo sequencing.
doi_str_mv 10.48550/arxiv.2411.15684
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2411_15684</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2411_15684</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2411_156843</originalsourceid><addsrcrecordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjE01DM0NbMw4WTwd8ksTs0rScxLz8nMS1coyUhVcM7PLchJrVDwLc0pyQSxUlMUXDwdFYILUpNLihIVMvMUXFIV_PLL8hUCUgtKMlNSFYJTC0tT85KBJvAwsKYl5hSn8kJpbgZ5N9cQZw9dsNXxBUWZuYlFlfEgJ8SDnWBMWAUAfz87rw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Disentangling the Complex Multiplexed DIA Spectra in De Novo Peptide Sequencing</title><source>arXiv.org</source><creator>Ma, Zheng ; Mao, Zeping ; Zhang, Ruixue ; Chen, Jiazhen ; Xin, Lei ; Shan, Paul ; Ghodsi, Ali ; Li, Ming</creator><creatorcontrib>Ma, Zheng ; Mao, Zeping ; Zhang, Ruixue ; Chen, Jiazhen ; Xin, Lei ; Shan, Paul ; Ghodsi, Ali ; Li, Ming</creatorcontrib><description>Data-Independent Acquisition (DIA) was introduced to improve sensitivity to cover all peptides in a range rather than only sampling high-intensity peaks as in Data-Dependent Acquisition (DDA) mass spectrometry. However, it is not very clear how useful DIA data is for de novo peptide sequencing as the DIA data are marred with coeluted peptides, high noises, and varying data quality. We present a new deep learning method DIANovo, and address each of these difficulties, and improves the previous established system DeepNovo-DIA by from 25% to 81%, averaging 48%, for amino acid recall, and by from 27% to 89%, averaging 57%, for peptide recall, by equipping the model with a deeper understanding of coeluted DIA spectra. This paper also provides criteria about when DIA data could be used for de novo peptide sequencing and when not to by providing a comparison between DDA and DIA, in both de novo and database search mode. We find that while DIA excels with narrow isolation windows on older-generation instruments, it loses its advantage with wider windows. However, with Orbitrap Astral, DIA consistently outperforms DDA due to narrow window mode enabled. We also provide a theoretical explanation of this phenomenon, emphasizing the critical role of the signal-to-noise profile in the successful application of de novo sequencing.</description><identifier>DOI: 10.48550/arxiv.2411.15684</identifier><language>eng</language><subject>Computer Science - Learning ; Quantitative Biology - Biomolecules</subject><creationdate>2024-11</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2411.15684$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2411.15684$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Ma, Zheng</creatorcontrib><creatorcontrib>Mao, Zeping</creatorcontrib><creatorcontrib>Zhang, Ruixue</creatorcontrib><creatorcontrib>Chen, Jiazhen</creatorcontrib><creatorcontrib>Xin, Lei</creatorcontrib><creatorcontrib>Shan, Paul</creatorcontrib><creatorcontrib>Ghodsi, Ali</creatorcontrib><creatorcontrib>Li, Ming</creatorcontrib><title>Disentangling the Complex Multiplexed DIA Spectra in De Novo Peptide Sequencing</title><description>Data-Independent Acquisition (DIA) was introduced to improve sensitivity to cover all peptides in a range rather than only sampling high-intensity peaks as in Data-Dependent Acquisition (DDA) mass spectrometry. However, it is not very clear how useful DIA data is for de novo peptide sequencing as the DIA data are marred with coeluted peptides, high noises, and varying data quality. We present a new deep learning method DIANovo, and address each of these difficulties, and improves the previous established system DeepNovo-DIA by from 25% to 81%, averaging 48%, for amino acid recall, and by from 27% to 89%, averaging 57%, for peptide recall, by equipping the model with a deeper understanding of coeluted DIA spectra. This paper also provides criteria about when DIA data could be used for de novo peptide sequencing and when not to by providing a comparison between DDA and DIA, in both de novo and database search mode. We find that while DIA excels with narrow isolation windows on older-generation instruments, it loses its advantage with wider windows. However, with Orbitrap Astral, DIA consistently outperforms DDA due to narrow window mode enabled. We also provide a theoretical explanation of this phenomenon, emphasizing the critical role of the signal-to-noise profile in the successful application of de novo sequencing.</description><subject>Computer Science - Learning</subject><subject>Quantitative Biology - Biomolecules</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjE01DM0NbMw4WTwd8ksTs0rScxLz8nMS1coyUhVcM7PLchJrVDwLc0pyQSxUlMUXDwdFYILUpNLihIVMvMUXFIV_PLL8hUCUgtKMlNSFYJTC0tT85KBJvAwsKYl5hSn8kJpbgZ5N9cQZw9dsNXxBUWZuYlFlfEgJ8SDnWBMWAUAfz87rw</recordid><startdate>20241123</startdate><enddate>20241123</enddate><creator>Ma, Zheng</creator><creator>Mao, Zeping</creator><creator>Zhang, Ruixue</creator><creator>Chen, Jiazhen</creator><creator>Xin, Lei</creator><creator>Shan, Paul</creator><creator>Ghodsi, Ali</creator><creator>Li, Ming</creator><scope>AKY</scope><scope>ALC</scope><scope>GOX</scope></search><sort><creationdate>20241123</creationdate><title>Disentangling the Complex Multiplexed DIA Spectra in De Novo Peptide Sequencing</title><author>Ma, Zheng ; Mao, Zeping ; Zhang, Ruixue ; Chen, Jiazhen ; Xin, Lei ; Shan, Paul ; Ghodsi, Ali ; Li, Ming</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2411_156843</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Learning</topic><topic>Quantitative Biology - Biomolecules</topic><toplevel>online_resources</toplevel><creatorcontrib>Ma, Zheng</creatorcontrib><creatorcontrib>Mao, Zeping</creatorcontrib><creatorcontrib>Zhang, Ruixue</creatorcontrib><creatorcontrib>Chen, Jiazhen</creatorcontrib><creatorcontrib>Xin, Lei</creatorcontrib><creatorcontrib>Shan, Paul</creatorcontrib><creatorcontrib>Ghodsi, Ali</creatorcontrib><creatorcontrib>Li, Ming</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv Quantitative Biology</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ma, Zheng</au><au>Mao, Zeping</au><au>Zhang, Ruixue</au><au>Chen, Jiazhen</au><au>Xin, Lei</au><au>Shan, Paul</au><au>Ghodsi, Ali</au><au>Li, Ming</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Disentangling the Complex Multiplexed DIA Spectra in De Novo Peptide Sequencing</atitle><date>2024-11-23</date><risdate>2024</risdate><abstract>Data-Independent Acquisition (DIA) was introduced to improve sensitivity to cover all peptides in a range rather than only sampling high-intensity peaks as in Data-Dependent Acquisition (DDA) mass spectrometry. However, it is not very clear how useful DIA data is for de novo peptide sequencing as the DIA data are marred with coeluted peptides, high noises, and varying data quality. We present a new deep learning method DIANovo, and address each of these difficulties, and improves the previous established system DeepNovo-DIA by from 25% to 81%, averaging 48%, for amino acid recall, and by from 27% to 89%, averaging 57%, for peptide recall, by equipping the model with a deeper understanding of coeluted DIA spectra. This paper also provides criteria about when DIA data could be used for de novo peptide sequencing and when not to by providing a comparison between DDA and DIA, in both de novo and database search mode. We find that while DIA excels with narrow isolation windows on older-generation instruments, it loses its advantage with wider windows. However, with Orbitrap Astral, DIA consistently outperforms DDA due to narrow window mode enabled. We also provide a theoretical explanation of this phenomenon, emphasizing the critical role of the signal-to-noise profile in the successful application of de novo sequencing.</abstract><doi>10.48550/arxiv.2411.15684</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2411.15684
ispartof
issn
language eng
recordid cdi_arxiv_primary_2411_15684
source arXiv.org
subjects Computer Science - Learning
Quantitative Biology - Biomolecules
title Disentangling the Complex Multiplexed DIA Spectra in De Novo Peptide Sequencing
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T21%3A07%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Disentangling%20the%20Complex%20Multiplexed%20DIA%20Spectra%20in%20De%20Novo%20Peptide%20Sequencing&rft.au=Ma,%20Zheng&rft.date=2024-11-23&rft_id=info:doi/10.48550/arxiv.2411.15684&rft_dat=%3Carxiv_GOX%3E2411_15684%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true