Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China

Background. Nonalcoholic fatty liver disease (NAFLD) is one of the most common chronic liver diseases. Machine learning techniques were introduced to evaluate the optimal predictive clinical model of NAFLD. Methods. A cross-sectional study was performed with subjects who attended a health examinatio...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:BioMed research international 2018-01, Vol.2018 (2018), p.1-9
Hauptverfasser: Yu, Chaohui, Shen, Zhe, Li, Youming, Ma, Han, Li, You-ming
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 9
container_issue 2018
container_start_page 1
container_title BioMed research international
container_volume 2018
creator Yu, Chaohui
Shen, Zhe
Li, Youming
Ma, Han
Li, You-ming
description Background. Nonalcoholic fatty liver disease (NAFLD) is one of the most common chronic liver diseases. Machine learning techniques were introduced to evaluate the optimal predictive clinical model of NAFLD. Methods. A cross-sectional study was performed with subjects who attended a health examination at the First Affiliated Hospital, Zhejiang University. Questionnaires, laboratory tests, physical examinations, and liver ultrasonography were employed. Machine learning techniques were then implemented using the open source software Weka. The tasks included feature selection and classification. Feature selection techniques built a screening model by removing the redundant features. Classification was used to build a prediction model, which was evaluated by the F-measure. 11 state-of-the-art machine learning techniques were investigated. Results. Among the 10,508 enrolled subjects, 2,522 (24%) met the diagnostic criteria of NAFLD. By leveraging a set of statistical testing techniques, BMI, triglycerides, gamma-glutamyl transpeptidase (γGT), the serum alanine aminotransferase (ALT), and uric acid were the top 5 features contributing to NAFLD. A 10-fold cross-validation was used in the classification. According to the results, the Bayesian network model demonstrated the best performance from among the 11 different techniques. It achieved accuracy, specificity, sensitivity, and F-measure scores of up to 83%, 0.878, 0.675, and 0.655, respectively. Compared with logistic regression, the Bayesian network model improves the F-measure score by 9.17%. Conclusion. Novel machine learning techniques may have screening and predictive value for NAFLD.
doi_str_mv 10.1155/2018/4304376
format Article
fullrecord <record><control><sourceid>gale_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6192080</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A621800480</galeid><sourcerecordid>A621800480</sourcerecordid><originalsourceid>FETCH-LOGICAL-c565t-7416aa6a2b6b913e7b9424e6bc6038b010de0e5ec4d0b2bc809d718816d4251d3</originalsourceid><addsrcrecordid>eNqNkk9v0zAYxiMEYtPYjTOyxAWJhfm1HSfhgFQFBkgdIG2cLcd503pK7c5Oh_ox9o1xaOmAE77438_PYz9-s-w50DcARXHOKFTnglPBS_koO2YcRC5BwOPDmPOj7DTGG5paBZLW8ml2lA5QJsrqOLufrdeDNXq03hHfk0ttltYhmaMOzroFuUazdPZ2g5H0PpBmsC7hA_kWsLNmtHdILn2HaXnxlsxIE3yM-RWaSTBhV-Om25Kk_WWaGr_0yY1c6HHcknk6HMh7G1FHJNaRJlnrZ9mTXg8RT_f9Sfb94sN18ymff_34uZnNc1PIYsxLAVJrqVkr2xo4lm0tmEDZGkl51VKgHVIs0IiOtqw1Fa27EqoUQSdYAR0_yd7tdNebdoWdQTcGPah1sCsdtsprq_7ecXapFv5OSagZrWgSeLUXCH7KZ1QrGw0Og3boN1Ex4AmDSpYJffkPeuM3IQUyUekPqeRMPFALPaCyrvfJ10yiaiYZVJSKX7ZnO8pMUQfsD1cGqqaiUFNRqH1RJPzFn888wL9LIAGvd0AKv9M_7H_KYWKw1w80MFlAzX8C0ebIEA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2120106324</pqid></control><display><type>article</type><title>Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China</title><source>MEDLINE</source><source>Wiley Online Library Open Access</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><source>PubMed Central Open Access</source><creator>Yu, Chaohui ; Shen, Zhe ; Li, Youming ; Ma, Han ; Li, You-ming</creator><contributor>Imazeki, Fumio</contributor><creatorcontrib>Yu, Chaohui ; Shen, Zhe ; Li, Youming ; Ma, Han ; Li, You-ming ; Imazeki, Fumio</creatorcontrib><description>Background. Nonalcoholic fatty liver disease (NAFLD) is one of the most common chronic liver diseases. Machine learning techniques were introduced to evaluate the optimal predictive clinical model of NAFLD. Methods. A cross-sectional study was performed with subjects who attended a health examination at the First Affiliated Hospital, Zhejiang University. Questionnaires, laboratory tests, physical examinations, and liver ultrasonography were employed. Machine learning techniques were then implemented using the open source software Weka. The tasks included feature selection and classification. Feature selection techniques built a screening model by removing the redundant features. Classification was used to build a prediction model, which was evaluated by the F-measure. 11 state-of-the-art machine learning techniques were investigated. Results. Among the 10,508 enrolled subjects, 2,522 (24%) met the diagnostic criteria of NAFLD. By leveraging a set of statistical testing techniques, BMI, triglycerides, gamma-glutamyl transpeptidase (γGT), the serum alanine aminotransferase (ALT), and uric acid were the top 5 features contributing to NAFLD. A 10-fold cross-validation was used in the classification. According to the results, the Bayesian network model demonstrated the best performance from among the 11 different techniques. It achieved accuracy, specificity, sensitivity, and F-measure scores of up to 83%, 0.878, 0.675, and 0.655, respectively. Compared with logistic regression, the Bayesian network model improves the F-measure score by 9.17%. Conclusion. Novel machine learning techniques may have screening and predictive value for NAFLD.</description><identifier>ISSN: 2314-6133</identifier><identifier>EISSN: 2314-6141</identifier><identifier>DOI: 10.1155/2018/4304376</identifier><identifier>PMID: 30402478</identifier><language>eng</language><publisher>Cairo, Egypt: Hindawi Publishing Corporation</publisher><subject>Adult ; Alanine ; Alanine transaminase ; Alanine Transaminase - blood ; Algorithms ; Artificial intelligence ; Bayesian analysis ; Big Data ; Biomarkers ; Biomedical research ; Body mass ; Body Mass Index ; China ; Classification ; Colleges &amp; universities ; Computer science ; Cross-Sectional Studies ; Data collection ; Data mining ; Diabetes ; Diagnostic imaging ; Diagnostic systems ; Fatty liver ; Female ; gamma-Glutamyltransferase - blood ; Hepatitis ; Humans ; Information systems ; Laboratory tests ; Learning algorithms ; Liver ; Liver diseases ; Machine Learning ; Male ; Medical research ; Medicine, Experimental ; Methods ; Middle Aged ; Models, Biological ; Non-alcoholic Fatty Liver Disease - blood ; Non-alcoholic Fatty Liver Disease - pathology ; Non-alcoholic Fatty Liver Disease - physiopathology ; Pattern recognition ; Physical examinations ; Prediction models ; Public software ; Regression models ; Screening ; Software ; Statistical analysis ; Statistical methods ; Studies ; Triglycerides ; Triglycerides - blood ; Type 2 diabetes ; Ultrasound ; Uric acid ; γ-Glutamyltransferase</subject><ispartof>BioMed research international, 2018-01, Vol.2018 (2018), p.1-9</ispartof><rights>Copyright © 2018 Han Ma et al.</rights><rights>COPYRIGHT 2018 John Wiley &amp; Sons, Inc.</rights><rights>Copyright © 2018 Han Ma et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0</rights><rights>Copyright © 2018 Han Ma et al. 2018</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c565t-7416aa6a2b6b913e7b9424e6bc6038b010de0e5ec4d0b2bc809d718816d4251d3</citedby><cites>FETCH-LOGICAL-c565t-7416aa6a2b6b913e7b9424e6bc6038b010de0e5ec4d0b2bc809d718816d4251d3</cites><orcidid>0000-0001-9985-3035 ; 0000-0001-9279-2903</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6192080/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6192080/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,27901,27902,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/30402478$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Imazeki, Fumio</contributor><creatorcontrib>Yu, Chaohui</creatorcontrib><creatorcontrib>Shen, Zhe</creatorcontrib><creatorcontrib>Li, Youming</creatorcontrib><creatorcontrib>Ma, Han</creatorcontrib><creatorcontrib>Li, You-ming</creatorcontrib><title>Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China</title><title>BioMed research international</title><addtitle>Biomed Res Int</addtitle><description>Background. Nonalcoholic fatty liver disease (NAFLD) is one of the most common chronic liver diseases. Machine learning techniques were introduced to evaluate the optimal predictive clinical model of NAFLD. Methods. A cross-sectional study was performed with subjects who attended a health examination at the First Affiliated Hospital, Zhejiang University. Questionnaires, laboratory tests, physical examinations, and liver ultrasonography were employed. Machine learning techniques were then implemented using the open source software Weka. The tasks included feature selection and classification. Feature selection techniques built a screening model by removing the redundant features. Classification was used to build a prediction model, which was evaluated by the F-measure. 11 state-of-the-art machine learning techniques were investigated. Results. Among the 10,508 enrolled subjects, 2,522 (24%) met the diagnostic criteria of NAFLD. By leveraging a set of statistical testing techniques, BMI, triglycerides, gamma-glutamyl transpeptidase (γGT), the serum alanine aminotransferase (ALT), and uric acid were the top 5 features contributing to NAFLD. A 10-fold cross-validation was used in the classification. According to the results, the Bayesian network model demonstrated the best performance from among the 11 different techniques. It achieved accuracy, specificity, sensitivity, and F-measure scores of up to 83%, 0.878, 0.675, and 0.655, respectively. Compared with logistic regression, the Bayesian network model improves the F-measure score by 9.17%. Conclusion. Novel machine learning techniques may have screening and predictive value for NAFLD.</description><subject>Adult</subject><subject>Alanine</subject><subject>Alanine transaminase</subject><subject>Alanine Transaminase - blood</subject><subject>Algorithms</subject><subject>Artificial intelligence</subject><subject>Bayesian analysis</subject><subject>Big Data</subject><subject>Biomarkers</subject><subject>Biomedical research</subject><subject>Body mass</subject><subject>Body Mass Index</subject><subject>China</subject><subject>Classification</subject><subject>Colleges &amp; universities</subject><subject>Computer science</subject><subject>Cross-Sectional Studies</subject><subject>Data collection</subject><subject>Data mining</subject><subject>Diabetes</subject><subject>Diagnostic imaging</subject><subject>Diagnostic systems</subject><subject>Fatty liver</subject><subject>Female</subject><subject>gamma-Glutamyltransferase - blood</subject><subject>Hepatitis</subject><subject>Humans</subject><subject>Information systems</subject><subject>Laboratory tests</subject><subject>Learning algorithms</subject><subject>Liver</subject><subject>Liver diseases</subject><subject>Machine Learning</subject><subject>Male</subject><subject>Medical research</subject><subject>Medicine, Experimental</subject><subject>Methods</subject><subject>Middle Aged</subject><subject>Models, Biological</subject><subject>Non-alcoholic Fatty Liver Disease - blood</subject><subject>Non-alcoholic Fatty Liver Disease - pathology</subject><subject>Non-alcoholic Fatty Liver Disease - physiopathology</subject><subject>Pattern recognition</subject><subject>Physical examinations</subject><subject>Prediction models</subject><subject>Public software</subject><subject>Regression models</subject><subject>Screening</subject><subject>Software</subject><subject>Statistical analysis</subject><subject>Statistical methods</subject><subject>Studies</subject><subject>Triglycerides</subject><subject>Triglycerides - blood</subject><subject>Type 2 diabetes</subject><subject>Ultrasound</subject><subject>Uric acid</subject><subject>γ-Glutamyltransferase</subject><issn>2314-6133</issn><issn>2314-6141</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>RHX</sourceid><sourceid>EIF</sourceid><sourceid>BENPR</sourceid><recordid>eNqNkk9v0zAYxiMEYtPYjTOyxAWJhfm1HSfhgFQFBkgdIG2cLcd503pK7c5Oh_ox9o1xaOmAE77438_PYz9-s-w50DcARXHOKFTnglPBS_koO2YcRC5BwOPDmPOj7DTGG5paBZLW8ml2lA5QJsrqOLufrdeDNXq03hHfk0ttltYhmaMOzroFuUazdPZ2g5H0PpBmsC7hA_kWsLNmtHdILn2HaXnxlsxIE3yM-RWaSTBhV-Om25Kk_WWaGr_0yY1c6HHcknk6HMh7G1FHJNaRJlnrZ9mTXg8RT_f9Sfb94sN18ymff_34uZnNc1PIYsxLAVJrqVkr2xo4lm0tmEDZGkl51VKgHVIs0IiOtqw1Fa27EqoUQSdYAR0_yd7tdNebdoWdQTcGPah1sCsdtsprq_7ecXapFv5OSagZrWgSeLUXCH7KZ1QrGw0Og3boN1Ex4AmDSpYJffkPeuM3IQUyUekPqeRMPFALPaCyrvfJ10yiaiYZVJSKX7ZnO8pMUQfsD1cGqqaiUFNRqH1RJPzFn888wL9LIAGvd0AKv9M_7H_KYWKw1w80MFlAzX8C0ebIEA</recordid><startdate>20180101</startdate><enddate>20180101</enddate><creator>Yu, Chaohui</creator><creator>Shen, Zhe</creator><creator>Li, Youming</creator><creator>Ma, Han</creator><creator>Li, You-ming</creator><general>Hindawi Publishing Corporation</general><general>Hindawi</general><general>John Wiley &amp; Sons, Inc</general><general>Hindawi Limited</general><scope>ADJCN</scope><scope>AHFXO</scope><scope>RHU</scope><scope>RHW</scope><scope>RHX</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7QL</scope><scope>7QO</scope><scope>7T7</scope><scope>7TK</scope><scope>7U7</scope><scope>7U9</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>CWDGH</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PHGZM</scope><scope>PHGZT</scope><scope>PIMPY</scope><scope>PJZUB</scope><scope>PKEHL</scope><scope>PPXIY</scope><scope>PQEST</scope><scope>PQGLB</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0001-9985-3035</orcidid><orcidid>https://orcid.org/0000-0001-9279-2903</orcidid></search><sort><creationdate>20180101</creationdate><title>Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China</title><author>Yu, Chaohui ; Shen, Zhe ; Li, Youming ; Ma, Han ; Li, You-ming</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c565t-7416aa6a2b6b913e7b9424e6bc6038b010de0e5ec4d0b2bc809d718816d4251d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Adult</topic><topic>Alanine</topic><topic>Alanine transaminase</topic><topic>Alanine Transaminase - blood</topic><topic>Algorithms</topic><topic>Artificial intelligence</topic><topic>Bayesian analysis</topic><topic>Big Data</topic><topic>Biomarkers</topic><topic>Biomedical research</topic><topic>Body mass</topic><topic>Body Mass Index</topic><topic>China</topic><topic>Classification</topic><topic>Colleges &amp; universities</topic><topic>Computer science</topic><topic>Cross-Sectional Studies</topic><topic>Data collection</topic><topic>Data mining</topic><topic>Diabetes</topic><topic>Diagnostic imaging</topic><topic>Diagnostic systems</topic><topic>Fatty liver</topic><topic>Female</topic><topic>gamma-Glutamyltransferase - blood</topic><topic>Hepatitis</topic><topic>Humans</topic><topic>Information systems</topic><topic>Laboratory tests</topic><topic>Learning algorithms</topic><topic>Liver</topic><topic>Liver diseases</topic><topic>Machine Learning</topic><topic>Male</topic><topic>Medical research</topic><topic>Medicine, Experimental</topic><topic>Methods</topic><topic>Middle Aged</topic><topic>Models, Biological</topic><topic>Non-alcoholic Fatty Liver Disease - blood</topic><topic>Non-alcoholic Fatty Liver Disease - pathology</topic><topic>Non-alcoholic Fatty Liver Disease - physiopathology</topic><topic>Pattern recognition</topic><topic>Physical examinations</topic><topic>Prediction models</topic><topic>Public software</topic><topic>Regression models</topic><topic>Screening</topic><topic>Software</topic><topic>Statistical analysis</topic><topic>Statistical methods</topic><topic>Studies</topic><topic>Triglycerides</topic><topic>Triglycerides - blood</topic><topic>Type 2 diabetes</topic><topic>Ultrasound</topic><topic>Uric acid</topic><topic>γ-Glutamyltransferase</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yu, Chaohui</creatorcontrib><creatorcontrib>Shen, Zhe</creatorcontrib><creatorcontrib>Li, Youming</creatorcontrib><creatorcontrib>Ma, Han</creatorcontrib><creatorcontrib>Li, You-ming</creatorcontrib><collection>الدوريات العلمية والإحصائية - e-Marefa Academic and Statistical Periodicals</collection><collection>معرفة - المحتوى العربي الأكاديمي المتكامل - e-Marefa Academic Complete</collection><collection>Hindawi Publishing Complete</collection><collection>Hindawi Publishing Subscription Journals</collection><collection>Hindawi Publishing Open Access</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>Neurosciences Abstracts</collection><collection>Toxicology Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>Natural Science Collection (ProQuest)</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>Middle East &amp; Africa Database</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biological Science Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>ProQuest Central (New)</collection><collection>ProQuest One Academic (New)</collection><collection>Publicly Available Content Database</collection><collection>ProQuest Health &amp; Medical Research Collection</collection><collection>ProQuest One Academic Middle East (New)</collection><collection>ProQuest One Health &amp; Nursing</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Applied &amp; Life Sciences</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>BioMed research international</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yu, Chaohui</au><au>Shen, Zhe</au><au>Li, Youming</au><au>Ma, Han</au><au>Li, You-ming</au><au>Imazeki, Fumio</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China</atitle><jtitle>BioMed research international</jtitle><addtitle>Biomed Res Int</addtitle><date>2018-01-01</date><risdate>2018</risdate><volume>2018</volume><issue>2018</issue><spage>1</spage><epage>9</epage><pages>1-9</pages><issn>2314-6133</issn><eissn>2314-6141</eissn><abstract>Background. Nonalcoholic fatty liver disease (NAFLD) is one of the most common chronic liver diseases. Machine learning techniques were introduced to evaluate the optimal predictive clinical model of NAFLD. Methods. A cross-sectional study was performed with subjects who attended a health examination at the First Affiliated Hospital, Zhejiang University. Questionnaires, laboratory tests, physical examinations, and liver ultrasonography were employed. Machine learning techniques were then implemented using the open source software Weka. The tasks included feature selection and classification. Feature selection techniques built a screening model by removing the redundant features. Classification was used to build a prediction model, which was evaluated by the F-measure. 11 state-of-the-art machine learning techniques were investigated. Results. Among the 10,508 enrolled subjects, 2,522 (24%) met the diagnostic criteria of NAFLD. By leveraging a set of statistical testing techniques, BMI, triglycerides, gamma-glutamyl transpeptidase (γGT), the serum alanine aminotransferase (ALT), and uric acid were the top 5 features contributing to NAFLD. A 10-fold cross-validation was used in the classification. According to the results, the Bayesian network model demonstrated the best performance from among the 11 different techniques. It achieved accuracy, specificity, sensitivity, and F-measure scores of up to 83%, 0.878, 0.675, and 0.655, respectively. Compared with logistic regression, the Bayesian network model improves the F-measure score by 9.17%. Conclusion. Novel machine learning techniques may have screening and predictive value for NAFLD.</abstract><cop>Cairo, Egypt</cop><pub>Hindawi Publishing Corporation</pub><pmid>30402478</pmid><doi>10.1155/2018/4304376</doi><tpages>9</tpages><orcidid>https://orcid.org/0000-0001-9985-3035</orcidid><orcidid>https://orcid.org/0000-0001-9279-2903</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2314-6133
ispartof BioMed research international, 2018-01, Vol.2018 (2018), p.1-9
issn 2314-6133
2314-6141
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6192080
source MEDLINE; Wiley Online Library Open Access; PubMed Central; Alma/SFX Local Collection; PubMed Central Open Access
subjects Adult
Alanine
Alanine transaminase
Alanine Transaminase - blood
Algorithms
Artificial intelligence
Bayesian analysis
Big Data
Biomarkers
Biomedical research
Body mass
Body Mass Index
China
Classification
Colleges & universities
Computer science
Cross-Sectional Studies
Data collection
Data mining
Diabetes
Diagnostic imaging
Diagnostic systems
Fatty liver
Female
gamma-Glutamyltransferase - blood
Hepatitis
Humans
Information systems
Laboratory tests
Learning algorithms
Liver
Liver diseases
Machine Learning
Male
Medical research
Medicine, Experimental
Methods
Middle Aged
Models, Biological
Non-alcoholic Fatty Liver Disease - blood
Non-alcoholic Fatty Liver Disease - pathology
Non-alcoholic Fatty Liver Disease - physiopathology
Pattern recognition
Physical examinations
Prediction models
Public software
Regression models
Screening
Software
Statistical analysis
Statistical methods
Studies
Triglycerides
Triglycerides - blood
Type 2 diabetes
Ultrasound
Uric acid
γ-Glutamyltransferase
title Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-18T22%3A42%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Application%20of%20Machine%20Learning%20Techniques%20for%20Clinical%20Predictive%20Modeling:%20A%20Cross-Sectional%20Study%20on%20Nonalcoholic%20Fatty%20Liver%20Disease%20in%20China&rft.jtitle=BioMed%20research%20international&rft.au=Yu,%20Chaohui&rft.date=2018-01-01&rft.volume=2018&rft.issue=2018&rft.spage=1&rft.epage=9&rft.pages=1-9&rft.issn=2314-6133&rft.eissn=2314-6141&rft_id=info:doi/10.1155/2018/4304376&rft_dat=%3Cgale_pubme%3EA621800480%3C/gale_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2120106324&rft_id=info:pmid/30402478&rft_galeid=A621800480&rfr_iscdi=true