Addressing big data challenges in mass spectrometry-based metabolomics

Advancements in computer science and software engineering have greatly facilitated mass spectrometry (MS)-based untargeted metabolomics. Nowadays, gigabytes of metabolomics data are routinely generated from MS platforms, containing condensed structural and quantitative information from thousands of...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Chemical communications (Cambridge, England) England), 2022-09, Vol.58 (72), p.9979-999
Hauptverfasser: Guo, Jian, Yu, Huaxu, Xing, Shipei, Huan, Tao
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 999
container_issue 72
container_start_page 9979
container_title Chemical communications (Cambridge, England)
container_volume 58
creator Guo, Jian
Yu, Huaxu
Xing, Shipei
Huan, Tao
description Advancements in computer science and software engineering have greatly facilitated mass spectrometry (MS)-based untargeted metabolomics. Nowadays, gigabytes of metabolomics data are routinely generated from MS platforms, containing condensed structural and quantitative information from thousands of metabolites. Manual data processing is almost impossible due to the large data size. Therefore, in the "omics" era, we are faced with new challenges, the big data challenges of how to accurately and efficiently process the raw data, extract the biological information, and visualize the results from the gigantic amount of collected data. Although important, proposing solutions to address these big data challenges requires broad interdisciplinary knowledge, which can be challenging for many metabolomics practitioners. Our laboratory in the Department of Chemistry at the University of British Columbia is committed to combining analytical chemistry, computer science, and statistics to develop bioinformatics tools that address these big data challenges. In this Feature Article, we elaborate on the major big data challenges in metabolomics, including data acquisition, feature extraction, quantitative measurements, statistical analysis, and metabolite annotation. We also introduce our recently developed bioinformatics solutions for these challenges. Notably, all of the bioinformatics tools and source codes are freely available on GitHub ( https://www.github.com/HuanLab ), along with revised and regularly updated content. This work elaborates on a suite of bioinformatics solutions developed in the Huan lab to address big-data challenges in metabolomics.
doi_str_mv 10.1039/d2cc03598g
format Article
fullrecord <record><control><sourceid>proquest_rsc_p</sourceid><recordid>TN_cdi_rsc_primary_d2cc03598g</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2705750393</sourcerecordid><originalsourceid>FETCH-LOGICAL-c280t-7f6560cb83e4f2ee366dd51d5006c8ae419294df0bec2d7c97f6b83f5b66d8313</originalsourceid><addsrcrecordid>eNpd0E1Lw0AQBuBFFKzVi3ch4EWE6H4m2WOJtgoFLwrewmZ3ElPyUXfSQ_-9WyMKzmXm8MwwvIRcMnrHqND3jltLhdJZfURmTCQyVjJ7Pz7MSsepkOqUnCFuaCimshlZLpzzgNj0dVQ2deTMaCL7YdoW-howavqoM4gRbsGOfuhg9Pu4NAguCrMph3boGovn5KQyLcLFT5-Tt-Xja_4Ur19Wz_liHVue0TFOq0Ql1JaZAFlxAJEkzinmFKWJzQxIprmWrqIlWO5Sq8NCwJUqA8wEE3NyM93d-uFzBzgWXYMW2tb0MOyw4ClVqQpJiECv_9HNsPN9-C4oxrjkWqdB3U7K-gHRQ1VsfdMZvy8YLQ6RFg88z78jXQV8NWGP9tf9RS6-ACrecow</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2711242997</pqid></control><display><type>article</type><title>Addressing big data challenges in mass spectrometry-based metabolomics</title><source>Royal Society Of Chemistry Journals 2008-</source><source>Alma/SFX Local Collection</source><creator>Guo, Jian ; Yu, Huaxu ; Xing, Shipei ; Huan, Tao</creator><creatorcontrib>Guo, Jian ; Yu, Huaxu ; Xing, Shipei ; Huan, Tao</creatorcontrib><description>Advancements in computer science and software engineering have greatly facilitated mass spectrometry (MS)-based untargeted metabolomics. Nowadays, gigabytes of metabolomics data are routinely generated from MS platforms, containing condensed structural and quantitative information from thousands of metabolites. Manual data processing is almost impossible due to the large data size. Therefore, in the "omics" era, we are faced with new challenges, the big data challenges of how to accurately and efficiently process the raw data, extract the biological information, and visualize the results from the gigantic amount of collected data. Although important, proposing solutions to address these big data challenges requires broad interdisciplinary knowledge, which can be challenging for many metabolomics practitioners. Our laboratory in the Department of Chemistry at the University of British Columbia is committed to combining analytical chemistry, computer science, and statistics to develop bioinformatics tools that address these big data challenges. In this Feature Article, we elaborate on the major big data challenges in metabolomics, including data acquisition, feature extraction, quantitative measurements, statistical analysis, and metabolite annotation. We also introduce our recently developed bioinformatics solutions for these challenges. Notably, all of the bioinformatics tools and source codes are freely available on GitHub ( https://www.github.com/HuanLab ), along with revised and regularly updated content. This work elaborates on a suite of bioinformatics solutions developed in the Huan lab to address big-data challenges in metabolomics.</description><identifier>ISSN: 1359-7345</identifier><identifier>EISSN: 1364-548X</identifier><identifier>DOI: 10.1039/d2cc03598g</identifier><language>eng</language><publisher>Cambridge: Royal Society of Chemistry</publisher><subject>Analytical chemistry ; Annotations ; Big Data ; Bioinformatics ; Computer science ; Data acquisition ; Data collection ; Data processing ; Feature extraction ; Mass spectrometry ; Metabolites ; Scientific imaging ; Software engineering ; Spectroscopy ; Statistical analysis</subject><ispartof>Chemical communications (Cambridge, England), 2022-09, Vol.58 (72), p.9979-999</ispartof><rights>Copyright Royal Society of Chemistry 2022</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c280t-7f6560cb83e4f2ee366dd51d5006c8ae419294df0bec2d7c97f6b83f5b66d8313</citedby><cites>FETCH-LOGICAL-c280t-7f6560cb83e4f2ee366dd51d5006c8ae419294df0bec2d7c97f6b83f5b66d8313</cites><orcidid>0000-0001-6295-2435</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Guo, Jian</creatorcontrib><creatorcontrib>Yu, Huaxu</creatorcontrib><creatorcontrib>Xing, Shipei</creatorcontrib><creatorcontrib>Huan, Tao</creatorcontrib><title>Addressing big data challenges in mass spectrometry-based metabolomics</title><title>Chemical communications (Cambridge, England)</title><description>Advancements in computer science and software engineering have greatly facilitated mass spectrometry (MS)-based untargeted metabolomics. Nowadays, gigabytes of metabolomics data are routinely generated from MS platforms, containing condensed structural and quantitative information from thousands of metabolites. Manual data processing is almost impossible due to the large data size. Therefore, in the "omics" era, we are faced with new challenges, the big data challenges of how to accurately and efficiently process the raw data, extract the biological information, and visualize the results from the gigantic amount of collected data. Although important, proposing solutions to address these big data challenges requires broad interdisciplinary knowledge, which can be challenging for many metabolomics practitioners. Our laboratory in the Department of Chemistry at the University of British Columbia is committed to combining analytical chemistry, computer science, and statistics to develop bioinformatics tools that address these big data challenges. In this Feature Article, we elaborate on the major big data challenges in metabolomics, including data acquisition, feature extraction, quantitative measurements, statistical analysis, and metabolite annotation. We also introduce our recently developed bioinformatics solutions for these challenges. Notably, all of the bioinformatics tools and source codes are freely available on GitHub ( https://www.github.com/HuanLab ), along with revised and regularly updated content. This work elaborates on a suite of bioinformatics solutions developed in the Huan lab to address big-data challenges in metabolomics.</description><subject>Analytical chemistry</subject><subject>Annotations</subject><subject>Big Data</subject><subject>Bioinformatics</subject><subject>Computer science</subject><subject>Data acquisition</subject><subject>Data collection</subject><subject>Data processing</subject><subject>Feature extraction</subject><subject>Mass spectrometry</subject><subject>Metabolites</subject><subject>Scientific imaging</subject><subject>Software engineering</subject><subject>Spectroscopy</subject><subject>Statistical analysis</subject><issn>1359-7345</issn><issn>1364-548X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNpd0E1Lw0AQBuBFFKzVi3ch4EWE6H4m2WOJtgoFLwrewmZ3ElPyUXfSQ_-9WyMKzmXm8MwwvIRcMnrHqND3jltLhdJZfURmTCQyVjJ7Pz7MSsepkOqUnCFuaCimshlZLpzzgNj0dVQ2deTMaCL7YdoW-howavqoM4gRbsGOfuhg9Pu4NAguCrMph3boGovn5KQyLcLFT5-Tt-Xja_4Ur19Wz_liHVue0TFOq0Ql1JaZAFlxAJEkzinmFKWJzQxIprmWrqIlWO5Sq8NCwJUqA8wEE3NyM93d-uFzBzgWXYMW2tb0MOyw4ClVqQpJiECv_9HNsPN9-C4oxrjkWqdB3U7K-gHRQ1VsfdMZvy8YLQ6RFg88z78jXQV8NWGP9tf9RS6-ACrecow</recordid><startdate>20220908</startdate><enddate>20220908</enddate><creator>Guo, Jian</creator><creator>Yu, Huaxu</creator><creator>Xing, Shipei</creator><creator>Huan, Tao</creator><general>Royal Society of Chemistry</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SR</scope><scope>7U5</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>L7M</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-6295-2435</orcidid></search><sort><creationdate>20220908</creationdate><title>Addressing big data challenges in mass spectrometry-based metabolomics</title><author>Guo, Jian ; Yu, Huaxu ; Xing, Shipei ; Huan, Tao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c280t-7f6560cb83e4f2ee366dd51d5006c8ae419294df0bec2d7c97f6b83f5b66d8313</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Analytical chemistry</topic><topic>Annotations</topic><topic>Big Data</topic><topic>Bioinformatics</topic><topic>Computer science</topic><topic>Data acquisition</topic><topic>Data collection</topic><topic>Data processing</topic><topic>Feature extraction</topic><topic>Mass spectrometry</topic><topic>Metabolites</topic><topic>Scientific imaging</topic><topic>Software engineering</topic><topic>Spectroscopy</topic><topic>Statistical analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Guo, Jian</creatorcontrib><creatorcontrib>Yu, Huaxu</creatorcontrib><creatorcontrib>Xing, Shipei</creatorcontrib><creatorcontrib>Huan, Tao</creatorcontrib><collection>CrossRef</collection><collection>Engineered Materials Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>MEDLINE - Academic</collection><jtitle>Chemical communications (Cambridge, England)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Guo, Jian</au><au>Yu, Huaxu</au><au>Xing, Shipei</au><au>Huan, Tao</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Addressing big data challenges in mass spectrometry-based metabolomics</atitle><jtitle>Chemical communications (Cambridge, England)</jtitle><date>2022-09-08</date><risdate>2022</risdate><volume>58</volume><issue>72</issue><spage>9979</spage><epage>999</epage><pages>9979-999</pages><issn>1359-7345</issn><eissn>1364-548X</eissn><abstract>Advancements in computer science and software engineering have greatly facilitated mass spectrometry (MS)-based untargeted metabolomics. Nowadays, gigabytes of metabolomics data are routinely generated from MS platforms, containing condensed structural and quantitative information from thousands of metabolites. Manual data processing is almost impossible due to the large data size. Therefore, in the "omics" era, we are faced with new challenges, the big data challenges of how to accurately and efficiently process the raw data, extract the biological information, and visualize the results from the gigantic amount of collected data. Although important, proposing solutions to address these big data challenges requires broad interdisciplinary knowledge, which can be challenging for many metabolomics practitioners. Our laboratory in the Department of Chemistry at the University of British Columbia is committed to combining analytical chemistry, computer science, and statistics to develop bioinformatics tools that address these big data challenges. In this Feature Article, we elaborate on the major big data challenges in metabolomics, including data acquisition, feature extraction, quantitative measurements, statistical analysis, and metabolite annotation. We also introduce our recently developed bioinformatics solutions for these challenges. Notably, all of the bioinformatics tools and source codes are freely available on GitHub ( https://www.github.com/HuanLab ), along with revised and regularly updated content. This work elaborates on a suite of bioinformatics solutions developed in the Huan lab to address big-data challenges in metabolomics.</abstract><cop>Cambridge</cop><pub>Royal Society of Chemistry</pub><doi>10.1039/d2cc03598g</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0001-6295-2435</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1359-7345
ispartof Chemical communications (Cambridge, England), 2022-09, Vol.58 (72), p.9979-999
issn 1359-7345
1364-548X
language eng
recordid cdi_rsc_primary_d2cc03598g
source Royal Society Of Chemistry Journals 2008-; Alma/SFX Local Collection
subjects Analytical chemistry
Annotations
Big Data
Bioinformatics
Computer science
Data acquisition
Data collection
Data processing
Feature extraction
Mass spectrometry
Metabolites
Scientific imaging
Software engineering
Spectroscopy
Statistical analysis
title Addressing big data challenges in mass spectrometry-based metabolomics
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T05%3A15%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_rsc_p&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Addressing%20big%20data%20challenges%20in%20mass%20spectrometry-based%20metabolomics&rft.jtitle=Chemical%20communications%20(Cambridge,%20England)&rft.au=Guo,%20Jian&rft.date=2022-09-08&rft.volume=58&rft.issue=72&rft.spage=9979&rft.epage=999&rft.pages=9979-999&rft.issn=1359-7345&rft.eissn=1364-548X&rft_id=info:doi/10.1039/d2cc03598g&rft_dat=%3Cproquest_rsc_p%3E2705750393%3C/proquest_rsc_p%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2711242997&rft_id=info:pmid/&rfr_iscdi=true