Benchmark of structured machine learning methods for microbial identification from mass-spectrometry data
Microbial identification is a central issue in microbiology, in particular in the fields of infectious diseases diagnosis and industrial quality control. The concept of species is tightly linked to the concept of biological and clinical classification where the proximity between species is generally...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Vervier, Kévin Mahé, Pierre Veyrieras, Jean-Baptiste Vert, Jean-Philippe |
description | Microbial identification is a central issue in microbiology, in particular in
the fields of infectious diseases diagnosis and industrial quality control. The
concept of species is tightly linked to the concept of biological and clinical
classification where the proximity between species is generally measured in
terms of evolutionary distances and/or clinical phenotypes. Surprisingly, the
information provided by this well-known hierarchical structure is rarely used
by machine learning-based automatic microbial identification systems.
Structured machine learning methods were recently proposed for taking into
account the structure embedded in a hierarchy and using it as additional a
priori information, and could therefore allow to improve microbial
identification systems. We test and compare several state-of-the-art machine
learning methods for microbial identification on a new Matrix-Assisted Laser
Desorption/Ionization Time-of-Flight mass spectrometry (MALDI-TOF MS) dataset.
We include in the benchmark standard and structured methods, that leverage the
knowledge of the underlying hierarchical structure in the learning process. Our
results show that although some methods perform better than others, structured
methods do not consistently perform better than their "flat" counterparts. We
postulate that this is partly due to the fact that standard methods already
reach a high level of accuracy in this context, and that they mainly confuse
species close to each other in the tree, a case where using the known hierarchy
is not helpful. |
doi_str_mv | 10.48550/arxiv.1506.07251 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1506_07251</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1506_07251</sourcerecordid><originalsourceid>FETCH-LOGICAL-a671-2b9fe72d75c76f967735a0803d354b571f766070fa7538b3d54db53be62263d13</originalsourceid><addsrcrecordid>eNotz81OAyEUBWA2Lkz1AVzJC8wIwwDTpTb-JU266X5ygYtDnIEGqLFvb62uTs7inOQj5I6zth-kZA-Qv8NXyyVTLdOd5NckPGG00wL5kyZPS81HW48ZHV3ATiEinRFyDPGDLlin5Ar1KdMl2JxMgJkGh7EGHyzUkCL1OS3naSlNOaCt54Y1n6iDCjfkysNc8PY_V2T_8rzfvDXb3ev75nHbgNK86czao-6cllYrv1ZaCwlsYMIJ2RupuddKMc08aCkGI5zsnZHCoOo6JRwXK3L_d3uxjocczrjT-GseL2bxAwjuU1Y</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Benchmark of structured machine learning methods for microbial identification from mass-spectrometry data</title><source>arXiv.org</source><creator>Vervier, Kévin ; Mahé, Pierre ; Veyrieras, Jean-Baptiste ; Vert, Jean-Philippe</creator><creatorcontrib>Vervier, Kévin ; Mahé, Pierre ; Veyrieras, Jean-Baptiste ; Vert, Jean-Philippe</creatorcontrib><description>Microbial identification is a central issue in microbiology, in particular in
the fields of infectious diseases diagnosis and industrial quality control. The
concept of species is tightly linked to the concept of biological and clinical
classification where the proximity between species is generally measured in
terms of evolutionary distances and/or clinical phenotypes. Surprisingly, the
information provided by this well-known hierarchical structure is rarely used
by machine learning-based automatic microbial identification systems.
Structured machine learning methods were recently proposed for taking into
account the structure embedded in a hierarchy and using it as additional a
priori information, and could therefore allow to improve microbial
identification systems. We test and compare several state-of-the-art machine
learning methods for microbial identification on a new Matrix-Assisted Laser
Desorption/Ionization Time-of-Flight mass spectrometry (MALDI-TOF MS) dataset.
We include in the benchmark standard and structured methods, that leverage the
knowledge of the underlying hierarchical structure in the learning process. Our
results show that although some methods perform better than others, structured
methods do not consistently perform better than their "flat" counterparts. We
postulate that this is partly due to the fact that standard methods already
reach a high level of accuracy in this context, and that they mainly confuse
species close to each other in the tree, a case where using the known hierarchy
is not helpful.</description><identifier>DOI: 10.48550/arxiv.1506.07251</identifier><language>eng</language><subject>Computer Science - Learning ; Quantitative Biology - Quantitative Methods ; Statistics - Machine Learning</subject><creationdate>2015-06</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1506.07251$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1506.07251$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Vervier, Kévin</creatorcontrib><creatorcontrib>Mahé, Pierre</creatorcontrib><creatorcontrib>Veyrieras, Jean-Baptiste</creatorcontrib><creatorcontrib>Vert, Jean-Philippe</creatorcontrib><title>Benchmark of structured machine learning methods for microbial identification from mass-spectrometry data</title><description>Microbial identification is a central issue in microbiology, in particular in
the fields of infectious diseases diagnosis and industrial quality control. The
concept of species is tightly linked to the concept of biological and clinical
classification where the proximity between species is generally measured in
terms of evolutionary distances and/or clinical phenotypes. Surprisingly, the
information provided by this well-known hierarchical structure is rarely used
by machine learning-based automatic microbial identification systems.
Structured machine learning methods were recently proposed for taking into
account the structure embedded in a hierarchy and using it as additional a
priori information, and could therefore allow to improve microbial
identification systems. We test and compare several state-of-the-art machine
learning methods for microbial identification on a new Matrix-Assisted Laser
Desorption/Ionization Time-of-Flight mass spectrometry (MALDI-TOF MS) dataset.
We include in the benchmark standard and structured methods, that leverage the
knowledge of the underlying hierarchical structure in the learning process. Our
results show that although some methods perform better than others, structured
methods do not consistently perform better than their "flat" counterparts. We
postulate that this is partly due to the fact that standard methods already
reach a high level of accuracy in this context, and that they mainly confuse
species close to each other in the tree, a case where using the known hierarchy
is not helpful.</description><subject>Computer Science - Learning</subject><subject>Quantitative Biology - Quantitative Methods</subject><subject>Statistics - Machine Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz81OAyEUBWA2Lkz1AVzJC8wIwwDTpTb-JU266X5ygYtDnIEGqLFvb62uTs7inOQj5I6zth-kZA-Qv8NXyyVTLdOd5NckPGG00wL5kyZPS81HW48ZHV3ATiEinRFyDPGDLlin5Ar1KdMl2JxMgJkGh7EGHyzUkCL1OS3naSlNOaCt54Y1n6iDCjfkysNc8PY_V2T_8rzfvDXb3ev75nHbgNK86czao-6cllYrv1ZaCwlsYMIJ2RupuddKMc08aCkGI5zsnZHCoOo6JRwXK3L_d3uxjocczrjT-GseL2bxAwjuU1Y</recordid><startdate>20150624</startdate><enddate>20150624</enddate><creator>Vervier, Kévin</creator><creator>Mahé, Pierre</creator><creator>Veyrieras, Jean-Baptiste</creator><creator>Vert, Jean-Philippe</creator><scope>AKY</scope><scope>ALC</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20150624</creationdate><title>Benchmark of structured machine learning methods for microbial identification from mass-spectrometry data</title><author>Vervier, Kévin ; Mahé, Pierre ; Veyrieras, Jean-Baptiste ; Vert, Jean-Philippe</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a671-2b9fe72d75c76f967735a0803d354b571f766070fa7538b3d54db53be62263d13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Computer Science - Learning</topic><topic>Quantitative Biology - Quantitative Methods</topic><topic>Statistics - Machine Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Vervier, Kévin</creatorcontrib><creatorcontrib>Mahé, Pierre</creatorcontrib><creatorcontrib>Veyrieras, Jean-Baptiste</creatorcontrib><creatorcontrib>Vert, Jean-Philippe</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv Quantitative Biology</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Vervier, Kévin</au><au>Mahé, Pierre</au><au>Veyrieras, Jean-Baptiste</au><au>Vert, Jean-Philippe</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Benchmark of structured machine learning methods for microbial identification from mass-spectrometry data</atitle><date>2015-06-24</date><risdate>2015</risdate><abstract>Microbial identification is a central issue in microbiology, in particular in
the fields of infectious diseases diagnosis and industrial quality control. The
concept of species is tightly linked to the concept of biological and clinical
classification where the proximity between species is generally measured in
terms of evolutionary distances and/or clinical phenotypes. Surprisingly, the
information provided by this well-known hierarchical structure is rarely used
by machine learning-based automatic microbial identification systems.
Structured machine learning methods were recently proposed for taking into
account the structure embedded in a hierarchy and using it as additional a
priori information, and could therefore allow to improve microbial
identification systems. We test and compare several state-of-the-art machine
learning methods for microbial identification on a new Matrix-Assisted Laser
Desorption/Ionization Time-of-Flight mass spectrometry (MALDI-TOF MS) dataset.
We include in the benchmark standard and structured methods, that leverage the
knowledge of the underlying hierarchical structure in the learning process. Our
results show that although some methods perform better than others, structured
methods do not consistently perform better than their "flat" counterparts. We
postulate that this is partly due to the fact that standard methods already
reach a high level of accuracy in this context, and that they mainly confuse
species close to each other in the tree, a case where using the known hierarchy
is not helpful.</abstract><doi>10.48550/arxiv.1506.07251</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.1506.07251 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_1506_07251 |
source | arXiv.org |
subjects | Computer Science - Learning Quantitative Biology - Quantitative Methods Statistics - Machine Learning |
title | Benchmark of structured machine learning methods for microbial identification from mass-spectrometry data |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-19T17%3A12%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Benchmark%20of%20structured%20machine%20learning%20methods%20for%20microbial%20identification%20from%20mass-spectrometry%20data&rft.au=Vervier,%20K%C3%A9vin&rft.date=2015-06-24&rft_id=info:doi/10.48550/arxiv.1506.07251&rft_dat=%3Carxiv_GOX%3E1506_07251%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |