Machine Learning to predict tuberculosis in cattle from the state of Sao Paulo, Brazil

Abstract Tuberculosis is a well-known and worldwide spread zoonosis. In Brazil 1.594.787 cases were confirmed cases since 2001, where, in Sao Paulo state, 8.226 deaths were reported. This study aims to present steps related to the use of machine learning algorithms for predictive analysis for bovine...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:European journal of public health 2020-09, Vol.30 (Supplement_5)
Hauptverfasser: Pereira, L E C, Ferraudo, A S, Panosso, A R, Carvalho, A A B, Mathias, L A, Saches, A C, Hellwig, K S, Ancêncio, R A
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue Supplement_5
container_start_page
container_title European journal of public health
container_volume 30
creator Pereira, L E C
Ferraudo, A S
Panosso, A R
Carvalho, A A B
Mathias, L A
Saches, A C
Hellwig, K S
Ancêncio, R A
description Abstract Tuberculosis is a well-known and worldwide spread zoonosis. In Brazil 1.594.787 cases were confirmed cases since 2001, where, in Sao Paulo state, 8.226 deaths were reported. This study aims to present steps related to the use of machine learning algorithms for predictive analysis for bovine tuberculosis. For this, an application was made based on data from farms in state of São Paulo, Brazil, of an epidemiological survey, using a specific questionnaire, carried out on farms (n = 1,743). Response variable was presented by apparent prevalence of positive properties for disease, and predictors by (k = 77) predictors related to type of farm, type of lactation, number of animals on property. Application was organized according to following steps: division of data in training (75%) and testing (25%), pre-processing of predictors, learning and model evaluation. In the learning step, algorithm for adjusting gradient boosted trees models was used. The hyperparameters of algorithms were optimized by 10-fold cross-validation, to select those corresponding to best models. Models showed an accuracy of 88.07%, with an error in learning process equal to 3%. In the test / model validation procedure (n = 436), an error in 12% estimate was observed. Five important predictors were daily milk production, number of cows, type of farm, bovine breed and slaughter of adult animals. Proportion of false positives among all individuals whose response of interest was observed was 2.06%, and proportion of false negatives among those with a response of absent interest was 9.86%. It is hoped that, with increase in trained surveillance to detect the disease and availability of data, it will be possible to develop predictive models of machine learning with potential to efficiently assist professionals in disease control and assist in education program in animal health Key messages Predictive analyzes in health: application for tuberculosis in cattle from the state of Sao Paulo, Brazil. An infectious disease and zoonosis important to the world that needs support to develop means to control and consequently eradicate it.
doi_str_mv 10.1093/eurpub/ckaa166.849
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2476167871</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1093/eurpub/ckaa166.849</oup_id><sourcerecordid>2476167871</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2029-6e9aeb93462897ea090ec24bb6798757a229f44d25d5741aedc973e264960ced3</originalsourceid><addsrcrecordid>eNqNkMlOwzAURS0EEqXwA6wssSWt7Th2vATEJBWBxCB2luO80JQ0Dh4W8PUEpR_A6t3FufdJB6FTShaUqHwJyQ-pWtpPY6gQi5KrPTSjXPAsF-R9f8yU0IwywQ7RUQgbQkghSzZDbw_Grtse8AqM79v-A0eHBw91ayOOqQJvU-dCG3DbY2ti7AA33m1xXAMO0UTArsHPxuEnM4Ln-NKbn7Y7RgeN6QKc7O4cvd5cv1zdZavH2_uri1VmGWEqE6AMVCrngpVKgiGKgGW8qoRUpSykYUw1nNesqAvJqYHaKpkDE1wJYqHO5-hs2h28-0oQot645PvxpWZcCipkKelIsYmy3oXgodGDb7fGf2tK9J8_PfnTO3969DeWsqnk0vAf_hfo0nUF</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2476167871</pqid></control><display><type>article</type><title>Machine Learning to predict tuberculosis in cattle from the state of Sao Paulo, Brazil</title><source>PAIS Index</source><source>Oxford Journals Open Access Collection</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><creator>Pereira, L E C ; Ferraudo, A S ; Panosso, A R ; Carvalho, A A B ; Mathias, L A ; Saches, A C ; Hellwig, K S ; Ancêncio, R A</creator><creatorcontrib>Pereira, L E C ; Ferraudo, A S ; Panosso, A R ; Carvalho, A A B ; Mathias, L A ; Saches, A C ; Hellwig, K S ; Ancêncio, R A</creatorcontrib><description>Abstract Tuberculosis is a well-known and worldwide spread zoonosis. In Brazil 1.594.787 cases were confirmed cases since 2001, where, in Sao Paulo state, 8.226 deaths were reported. This study aims to present steps related to the use of machine learning algorithms for predictive analysis for bovine tuberculosis. For this, an application was made based on data from farms in state of São Paulo, Brazil, of an epidemiological survey, using a specific questionnaire, carried out on farms (n = 1,743). Response variable was presented by apparent prevalence of positive properties for disease, and predictors by (k = 77) predictors related to type of farm, type of lactation, number of animals on property. Application was organized according to following steps: division of data in training (75%) and testing (25%), pre-processing of predictors, learning and model evaluation. In the learning step, algorithm for adjusting gradient boosted trees models was used. The hyperparameters of algorithms were optimized by 10-fold cross-validation, to select those corresponding to best models. Models showed an accuracy of 88.07%, with an error in learning process equal to 3%. In the test / model validation procedure (n = 436), an error in 12% estimate was observed. Five important predictors were daily milk production, number of cows, type of farm, bovine breed and slaughter of adult animals. Proportion of false positives among all individuals whose response of interest was observed was 2.06%, and proportion of false negatives among those with a response of absent interest was 9.86%. It is hoped that, with increase in trained surveillance to detect the disease and availability of data, it will be possible to develop predictive models of machine learning with potential to efficiently assist professionals in disease control and assist in education program in animal health Key messages Predictive analyzes in health: application for tuberculosis in cattle from the state of Sao Paulo, Brazil. An infectious disease and zoonosis important to the world that needs support to develop means to control and consequently eradicate it.</description><identifier>ISSN: 1101-1262</identifier><identifier>EISSN: 1464-360X</identifier><identifier>DOI: 10.1093/eurpub/ckaa166.849</identifier><language>eng</language><publisher>Oxford: Oxford University Press</publisher><subject>Algorithms ; Animal health ; Animals ; Cattle ; Cow's milk ; Dairy cattle ; Data processing ; Disease control ; Epidemiology ; Farms ; Health education ; Infectious diseases ; Lactation ; Learning algorithms ; Livestock industry ; Machine learning ; Milk ; Milk production ; Model accuracy ; Model testing ; Prediction models ; Property ; Public health ; Surveillance ; Training ; Tuberculosis ; Validity ; Zoonoses</subject><ispartof>European journal of public health, 2020-09, Vol.30 (Supplement_5)</ispartof><rights>The Author(s) 2020. Published by Oxford University Press on behalf of the European Public Health Association. All rights reserved. 2020</rights><rights>The Author(s) 2020. Published by Oxford University Press on behalf of the European Public Health Association. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c2029-6e9aeb93462897ea090ec24bb6798757a229f44d25d5741aedc973e264960ced3</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27866,27924,27925</link.rule.ids></links><search><creatorcontrib>Pereira, L E C</creatorcontrib><creatorcontrib>Ferraudo, A S</creatorcontrib><creatorcontrib>Panosso, A R</creatorcontrib><creatorcontrib>Carvalho, A A B</creatorcontrib><creatorcontrib>Mathias, L A</creatorcontrib><creatorcontrib>Saches, A C</creatorcontrib><creatorcontrib>Hellwig, K S</creatorcontrib><creatorcontrib>Ancêncio, R A</creatorcontrib><title>Machine Learning to predict tuberculosis in cattle from the state of Sao Paulo, Brazil</title><title>European journal of public health</title><description>Abstract Tuberculosis is a well-known and worldwide spread zoonosis. In Brazil 1.594.787 cases were confirmed cases since 2001, where, in Sao Paulo state, 8.226 deaths were reported. This study aims to present steps related to the use of machine learning algorithms for predictive analysis for bovine tuberculosis. For this, an application was made based on data from farms in state of São Paulo, Brazil, of an epidemiological survey, using a specific questionnaire, carried out on farms (n = 1,743). Response variable was presented by apparent prevalence of positive properties for disease, and predictors by (k = 77) predictors related to type of farm, type of lactation, number of animals on property. Application was organized according to following steps: division of data in training (75%) and testing (25%), pre-processing of predictors, learning and model evaluation. In the learning step, algorithm for adjusting gradient boosted trees models was used. The hyperparameters of algorithms were optimized by 10-fold cross-validation, to select those corresponding to best models. Models showed an accuracy of 88.07%, with an error in learning process equal to 3%. In the test / model validation procedure (n = 436), an error in 12% estimate was observed. Five important predictors were daily milk production, number of cows, type of farm, bovine breed and slaughter of adult animals. Proportion of false positives among all individuals whose response of interest was observed was 2.06%, and proportion of false negatives among those with a response of absent interest was 9.86%. It is hoped that, with increase in trained surveillance to detect the disease and availability of data, it will be possible to develop predictive models of machine learning with potential to efficiently assist professionals in disease control and assist in education program in animal health Key messages Predictive analyzes in health: application for tuberculosis in cattle from the state of Sao Paulo, Brazil. An infectious disease and zoonosis important to the world that needs support to develop means to control and consequently eradicate it.</description><subject>Algorithms</subject><subject>Animal health</subject><subject>Animals</subject><subject>Cattle</subject><subject>Cow's milk</subject><subject>Dairy cattle</subject><subject>Data processing</subject><subject>Disease control</subject><subject>Epidemiology</subject><subject>Farms</subject><subject>Health education</subject><subject>Infectious diseases</subject><subject>Lactation</subject><subject>Learning algorithms</subject><subject>Livestock industry</subject><subject>Machine learning</subject><subject>Milk</subject><subject>Milk production</subject><subject>Model accuracy</subject><subject>Model testing</subject><subject>Prediction models</subject><subject>Property</subject><subject>Public health</subject><subject>Surveillance</subject><subject>Training</subject><subject>Tuberculosis</subject><subject>Validity</subject><subject>Zoonoses</subject><issn>1101-1262</issn><issn>1464-360X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>7TQ</sourceid><recordid>eNqNkMlOwzAURS0EEqXwA6wssSWt7Th2vATEJBWBxCB2luO80JQ0Dh4W8PUEpR_A6t3FufdJB6FTShaUqHwJyQ-pWtpPY6gQi5KrPTSjXPAsF-R9f8yU0IwywQ7RUQgbQkghSzZDbw_Grtse8AqM79v-A0eHBw91ayOOqQJvU-dCG3DbY2ti7AA33m1xXAMO0UTArsHPxuEnM4Ln-NKbn7Y7RgeN6QKc7O4cvd5cv1zdZavH2_uri1VmGWEqE6AMVCrngpVKgiGKgGW8qoRUpSykYUw1nNesqAvJqYHaKpkDE1wJYqHO5-hs2h28-0oQot645PvxpWZcCipkKelIsYmy3oXgodGDb7fGf2tK9J8_PfnTO3969DeWsqnk0vAf_hfo0nUF</recordid><startdate>20200901</startdate><enddate>20200901</enddate><creator>Pereira, L E C</creator><creator>Ferraudo, A S</creator><creator>Panosso, A R</creator><creator>Carvalho, A A B</creator><creator>Mathias, L A</creator><creator>Saches, A C</creator><creator>Hellwig, K S</creator><creator>Ancêncio, R A</creator><general>Oxford University Press</general><general>Oxford Publishing Limited (England)</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7T2</scope><scope>7TQ</scope><scope>C1K</scope><scope>DHY</scope><scope>DON</scope><scope>K9.</scope><scope>NAPCQ</scope></search><sort><creationdate>20200901</creationdate><title>Machine Learning to predict tuberculosis in cattle from the state of Sao Paulo, Brazil</title><author>Pereira, L E C ; Ferraudo, A S ; Panosso, A R ; Carvalho, A A B ; Mathias, L A ; Saches, A C ; Hellwig, K S ; Ancêncio, R A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2029-6e9aeb93462897ea090ec24bb6798757a229f44d25d5741aedc973e264960ced3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Animal health</topic><topic>Animals</topic><topic>Cattle</topic><topic>Cow's milk</topic><topic>Dairy cattle</topic><topic>Data processing</topic><topic>Disease control</topic><topic>Epidemiology</topic><topic>Farms</topic><topic>Health education</topic><topic>Infectious diseases</topic><topic>Lactation</topic><topic>Learning algorithms</topic><topic>Livestock industry</topic><topic>Machine learning</topic><topic>Milk</topic><topic>Milk production</topic><topic>Model accuracy</topic><topic>Model testing</topic><topic>Prediction models</topic><topic>Property</topic><topic>Public health</topic><topic>Surveillance</topic><topic>Training</topic><topic>Tuberculosis</topic><topic>Validity</topic><topic>Zoonoses</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Pereira, L E C</creatorcontrib><creatorcontrib>Ferraudo, A S</creatorcontrib><creatorcontrib>Panosso, A R</creatorcontrib><creatorcontrib>Carvalho, A A B</creatorcontrib><creatorcontrib>Mathias, L A</creatorcontrib><creatorcontrib>Saches, A C</creatorcontrib><creatorcontrib>Hellwig, K S</creatorcontrib><creatorcontrib>Ancêncio, R A</creatorcontrib><collection>CrossRef</collection><collection>Health and Safety Science Abstracts (Full archive)</collection><collection>PAIS Index</collection><collection>Environmental Sciences and Pollution Management</collection><collection>PAIS International</collection><collection>PAIS International (Ovid)</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Nursing &amp; Allied Health Premium</collection><jtitle>European journal of public health</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Pereira, L E C</au><au>Ferraudo, A S</au><au>Panosso, A R</au><au>Carvalho, A A B</au><au>Mathias, L A</au><au>Saches, A C</au><au>Hellwig, K S</au><au>Ancêncio, R A</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Machine Learning to predict tuberculosis in cattle from the state of Sao Paulo, Brazil</atitle><jtitle>European journal of public health</jtitle><date>2020-09-01</date><risdate>2020</risdate><volume>30</volume><issue>Supplement_5</issue><issn>1101-1262</issn><eissn>1464-360X</eissn><abstract>Abstract Tuberculosis is a well-known and worldwide spread zoonosis. In Brazil 1.594.787 cases were confirmed cases since 2001, where, in Sao Paulo state, 8.226 deaths were reported. This study aims to present steps related to the use of machine learning algorithms for predictive analysis for bovine tuberculosis. For this, an application was made based on data from farms in state of São Paulo, Brazil, of an epidemiological survey, using a specific questionnaire, carried out on farms (n = 1,743). Response variable was presented by apparent prevalence of positive properties for disease, and predictors by (k = 77) predictors related to type of farm, type of lactation, number of animals on property. Application was organized according to following steps: division of data in training (75%) and testing (25%), pre-processing of predictors, learning and model evaluation. In the learning step, algorithm for adjusting gradient boosted trees models was used. The hyperparameters of algorithms were optimized by 10-fold cross-validation, to select those corresponding to best models. Models showed an accuracy of 88.07%, with an error in learning process equal to 3%. In the test / model validation procedure (n = 436), an error in 12% estimate was observed. Five important predictors were daily milk production, number of cows, type of farm, bovine breed and slaughter of adult animals. Proportion of false positives among all individuals whose response of interest was observed was 2.06%, and proportion of false negatives among those with a response of absent interest was 9.86%. It is hoped that, with increase in trained surveillance to detect the disease and availability of data, it will be possible to develop predictive models of machine learning with potential to efficiently assist professionals in disease control and assist in education program in animal health Key messages Predictive analyzes in health: application for tuberculosis in cattle from the state of Sao Paulo, Brazil. An infectious disease and zoonosis important to the world that needs support to develop means to control and consequently eradicate it.</abstract><cop>Oxford</cop><pub>Oxford University Press</pub><doi>10.1093/eurpub/ckaa166.849</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1101-1262
ispartof European journal of public health, 2020-09, Vol.30 (Supplement_5)
issn 1101-1262
1464-360X
language eng
recordid cdi_proquest_journals_2476167871
source PAIS Index; Oxford Journals Open Access Collection; EZB-FREE-00999 freely available EZB journals; PubMed Central; Alma/SFX Local Collection
subjects Algorithms
Animal health
Animals
Cattle
Cow's milk
Dairy cattle
Data processing
Disease control
Epidemiology
Farms
Health education
Infectious diseases
Lactation
Learning algorithms
Livestock industry
Machine learning
Milk
Milk production
Model accuracy
Model testing
Prediction models
Property
Public health
Surveillance
Training
Tuberculosis
Validity
Zoonoses
title Machine Learning to predict tuberculosis in cattle from the state of Sao Paulo, Brazil
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T20%3A55%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Machine%20Learning%20to%20predict%20tuberculosis%20in%20cattle%20from%20the%20state%20of%20Sao%20Paulo,%20Brazil&rft.jtitle=European%20journal%20of%20public%20health&rft.au=Pereira,%20L%20E%20C&rft.date=2020-09-01&rft.volume=30&rft.issue=Supplement_5&rft.issn=1101-1262&rft.eissn=1464-360X&rft_id=info:doi/10.1093/eurpub/ckaa166.849&rft_dat=%3Cproquest_cross%3E2476167871%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2476167871&rft_id=info:pmid/&rft_oup_id=10.1093/eurpub/ckaa166.849&rfr_iscdi=true