Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China
Background. Nonalcoholic fatty liver disease (NAFLD) is one of the most common chronic liver diseases. Machine learning techniques were introduced to evaluate the optimal predictive clinical model of NAFLD. Methods. A cross-sectional study was performed with subjects who attended a health examinatio...
Gespeichert in:
Veröffentlicht in: | BioMed research international 2018-01, Vol.2018 (2018), p.1-9 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 9 |
---|---|
container_issue | 2018 |
container_start_page | 1 |
container_title | BioMed research international |
container_volume | 2018 |
creator | Yu, Chaohui Shen, Zhe Li, Youming Ma, Han Li, You-ming |
description | Background. Nonalcoholic fatty liver disease (NAFLD) is one of the most common chronic liver diseases. Machine learning techniques were introduced to evaluate the optimal predictive clinical model of NAFLD. Methods. A cross-sectional study was performed with subjects who attended a health examination at the First Affiliated Hospital, Zhejiang University. Questionnaires, laboratory tests, physical examinations, and liver ultrasonography were employed. Machine learning techniques were then implemented using the open source software Weka. The tasks included feature selection and classification. Feature selection techniques built a screening model by removing the redundant features. Classification was used to build a prediction model, which was evaluated by the F-measure. 11 state-of-the-art machine learning techniques were investigated. Results. Among the 10,508 enrolled subjects, 2,522 (24%) met the diagnostic criteria of NAFLD. By leveraging a set of statistical testing techniques, BMI, triglycerides, gamma-glutamyl transpeptidase (γGT), the serum alanine aminotransferase (ALT), and uric acid were the top 5 features contributing to NAFLD. A 10-fold cross-validation was used in the classification. According to the results, the Bayesian network model demonstrated the best performance from among the 11 different techniques. It achieved accuracy, specificity, sensitivity, and F-measure scores of up to 83%, 0.878, 0.675, and 0.655, respectively. Compared with logistic regression, the Bayesian network model improves the F-measure score by 9.17%. Conclusion. Novel machine learning techniques may have screening and predictive value for NAFLD. |
doi_str_mv | 10.1155/2018/4304376 |
format | Article |
fullrecord | <record><control><sourceid>gale_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6192080</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A621800480</galeid><sourcerecordid>A621800480</sourcerecordid><originalsourceid>FETCH-LOGICAL-c565t-7416aa6a2b6b913e7b9424e6bc6038b010de0e5ec4d0b2bc809d718816d4251d3</originalsourceid><addsrcrecordid>eNqNkk9v0zAYxiMEYtPYjTOyxAWJhfm1HSfhgFQFBkgdIG2cLcd503pK7c5Oh_ox9o1xaOmAE77438_PYz9-s-w50DcARXHOKFTnglPBS_koO2YcRC5BwOPDmPOj7DTGG5paBZLW8ml2lA5QJsrqOLufrdeDNXq03hHfk0ttltYhmaMOzroFuUazdPZ2g5H0PpBmsC7hA_kWsLNmtHdILn2HaXnxlsxIE3yM-RWaSTBhV-Om25Kk_WWaGr_0yY1c6HHcknk6HMh7G1FHJNaRJlnrZ9mTXg8RT_f9Sfb94sN18ymff_34uZnNc1PIYsxLAVJrqVkr2xo4lm0tmEDZGkl51VKgHVIs0IiOtqw1Fa27EqoUQSdYAR0_yd7tdNebdoWdQTcGPah1sCsdtsprq_7ecXapFv5OSagZrWgSeLUXCH7KZ1QrGw0Og3boN1Ex4AmDSpYJffkPeuM3IQUyUekPqeRMPFALPaCyrvfJ10yiaiYZVJSKX7ZnO8pMUQfsD1cGqqaiUFNRqH1RJPzFn888wL9LIAGvd0AKv9M_7H_KYWKw1w80MFlAzX8C0ebIEA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2120106324</pqid></control><display><type>article</type><title>Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China</title><source>MEDLINE</source><source>Wiley Online Library Open Access</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><source>PubMed Central Open Access</source><creator>Yu, Chaohui ; Shen, Zhe ; Li, Youming ; Ma, Han ; Li, You-ming</creator><contributor>Imazeki, Fumio</contributor><creatorcontrib>Yu, Chaohui ; Shen, Zhe ; Li, Youming ; Ma, Han ; Li, You-ming ; Imazeki, Fumio</creatorcontrib><description>Background. Nonalcoholic fatty liver disease (NAFLD) is one of the most common chronic liver diseases. Machine learning techniques were introduced to evaluate the optimal predictive clinical model of NAFLD. Methods. A cross-sectional study was performed with subjects who attended a health examination at the First Affiliated Hospital, Zhejiang University. Questionnaires, laboratory tests, physical examinations, and liver ultrasonography were employed. Machine learning techniques were then implemented using the open source software Weka. The tasks included feature selection and classification. Feature selection techniques built a screening model by removing the redundant features. Classification was used to build a prediction model, which was evaluated by the F-measure. 11 state-of-the-art machine learning techniques were investigated. Results. Among the 10,508 enrolled subjects, 2,522 (24%) met the diagnostic criteria of NAFLD. By leveraging a set of statistical testing techniques, BMI, triglycerides, gamma-glutamyl transpeptidase (γGT), the serum alanine aminotransferase (ALT), and uric acid were the top 5 features contributing to NAFLD. A 10-fold cross-validation was used in the classification. According to the results, the Bayesian network model demonstrated the best performance from among the 11 different techniques. It achieved accuracy, specificity, sensitivity, and F-measure scores of up to 83%, 0.878, 0.675, and 0.655, respectively. Compared with logistic regression, the Bayesian network model improves the F-measure score by 9.17%. Conclusion. Novel machine learning techniques may have screening and predictive value for NAFLD.</description><identifier>ISSN: 2314-6133</identifier><identifier>EISSN: 2314-6141</identifier><identifier>DOI: 10.1155/2018/4304376</identifier><identifier>PMID: 30402478</identifier><language>eng</language><publisher>Cairo, Egypt: Hindawi Publishing Corporation</publisher><subject>Adult ; Alanine ; Alanine transaminase ; Alanine Transaminase - blood ; Algorithms ; Artificial intelligence ; Bayesian analysis ; Big Data ; Biomarkers ; Biomedical research ; Body mass ; Body Mass Index ; China ; Classification ; Colleges & universities ; Computer science ; Cross-Sectional Studies ; Data collection ; Data mining ; Diabetes ; Diagnostic imaging ; Diagnostic systems ; Fatty liver ; Female ; gamma-Glutamyltransferase - blood ; Hepatitis ; Humans ; Information systems ; Laboratory tests ; Learning algorithms ; Liver ; Liver diseases ; Machine Learning ; Male ; Medical research ; Medicine, Experimental ; Methods ; Middle Aged ; Models, Biological ; Non-alcoholic Fatty Liver Disease - blood ; Non-alcoholic Fatty Liver Disease - pathology ; Non-alcoholic Fatty Liver Disease - physiopathology ; Pattern recognition ; Physical examinations ; Prediction models ; Public software ; Regression models ; Screening ; Software ; Statistical analysis ; Statistical methods ; Studies ; Triglycerides ; Triglycerides - blood ; Type 2 diabetes ; Ultrasound ; Uric acid ; γ-Glutamyltransferase</subject><ispartof>BioMed research international, 2018-01, Vol.2018 (2018), p.1-9</ispartof><rights>Copyright © 2018 Han Ma et al.</rights><rights>COPYRIGHT 2018 John Wiley & Sons, Inc.</rights><rights>Copyright © 2018 Han Ma et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0</rights><rights>Copyright © 2018 Han Ma et al. 2018</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c565t-7416aa6a2b6b913e7b9424e6bc6038b010de0e5ec4d0b2bc809d718816d4251d3</citedby><cites>FETCH-LOGICAL-c565t-7416aa6a2b6b913e7b9424e6bc6038b010de0e5ec4d0b2bc809d718816d4251d3</cites><orcidid>0000-0001-9985-3035 ; 0000-0001-9279-2903</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6192080/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6192080/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,27901,27902,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/30402478$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Imazeki, Fumio</contributor><creatorcontrib>Yu, Chaohui</creatorcontrib><creatorcontrib>Shen, Zhe</creatorcontrib><creatorcontrib>Li, Youming</creatorcontrib><creatorcontrib>Ma, Han</creatorcontrib><creatorcontrib>Li, You-ming</creatorcontrib><title>Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China</title><title>BioMed research international</title><addtitle>Biomed Res Int</addtitle><description>Background. Nonalcoholic fatty liver disease (NAFLD) is one of the most common chronic liver diseases. Machine learning techniques were introduced to evaluate the optimal predictive clinical model of NAFLD. Methods. A cross-sectional study was performed with subjects who attended a health examination at the First Affiliated Hospital, Zhejiang University. Questionnaires, laboratory tests, physical examinations, and liver ultrasonography were employed. Machine learning techniques were then implemented using the open source software Weka. The tasks included feature selection and classification. Feature selection techniques built a screening model by removing the redundant features. Classification was used to build a prediction model, which was evaluated by the F-measure. 11 state-of-the-art machine learning techniques were investigated. Results. Among the 10,508 enrolled subjects, 2,522 (24%) met the diagnostic criteria of NAFLD. By leveraging a set of statistical testing techniques, BMI, triglycerides, gamma-glutamyl transpeptidase (γGT), the serum alanine aminotransferase (ALT), and uric acid were the top 5 features contributing to NAFLD. A 10-fold cross-validation was used in the classification. According to the results, the Bayesian network model demonstrated the best performance from among the 11 different techniques. It achieved accuracy, specificity, sensitivity, and F-measure scores of up to 83%, 0.878, 0.675, and 0.655, respectively. Compared with logistic regression, the Bayesian network model improves the F-measure score by 9.17%. Conclusion. Novel machine learning techniques may have screening and predictive value for NAFLD.</description><subject>Adult</subject><subject>Alanine</subject><subject>Alanine transaminase</subject><subject>Alanine Transaminase - blood</subject><subject>Algorithms</subject><subject>Artificial intelligence</subject><subject>Bayesian analysis</subject><subject>Big Data</subject><subject>Biomarkers</subject><subject>Biomedical research</subject><subject>Body mass</subject><subject>Body Mass Index</subject><subject>China</subject><subject>Classification</subject><subject>Colleges & universities</subject><subject>Computer science</subject><subject>Cross-Sectional Studies</subject><subject>Data collection</subject><subject>Data mining</subject><subject>Diabetes</subject><subject>Diagnostic imaging</subject><subject>Diagnostic systems</subject><subject>Fatty liver</subject><subject>Female</subject><subject>gamma-Glutamyltransferase - blood</subject><subject>Hepatitis</subject><subject>Humans</subject><subject>Information systems</subject><subject>Laboratory tests</subject><subject>Learning algorithms</subject><subject>Liver</subject><subject>Liver diseases</subject><subject>Machine Learning</subject><subject>Male</subject><subject>Medical research</subject><subject>Medicine, Experimental</subject><subject>Methods</subject><subject>Middle Aged</subject><subject>Models, Biological</subject><subject>Non-alcoholic Fatty Liver Disease - blood</subject><subject>Non-alcoholic Fatty Liver Disease - pathology</subject><subject>Non-alcoholic Fatty Liver Disease - physiopathology</subject><subject>Pattern recognition</subject><subject>Physical examinations</subject><subject>Prediction models</subject><subject>Public software</subject><subject>Regression models</subject><subject>Screening</subject><subject>Software</subject><subject>Statistical analysis</subject><subject>Statistical methods</subject><subject>Studies</subject><subject>Triglycerides</subject><subject>Triglycerides - blood</subject><subject>Type 2 diabetes</subject><subject>Ultrasound</subject><subject>Uric acid</subject><subject>γ-Glutamyltransferase</subject><issn>2314-6133</issn><issn>2314-6141</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>RHX</sourceid><sourceid>EIF</sourceid><sourceid>BENPR</sourceid><recordid>eNqNkk9v0zAYxiMEYtPYjTOyxAWJhfm1HSfhgFQFBkgdIG2cLcd503pK7c5Oh_ox9o1xaOmAE77438_PYz9-s-w50DcARXHOKFTnglPBS_koO2YcRC5BwOPDmPOj7DTGG5paBZLW8ml2lA5QJsrqOLufrdeDNXq03hHfk0ttltYhmaMOzroFuUazdPZ2g5H0PpBmsC7hA_kWsLNmtHdILn2HaXnxlsxIE3yM-RWaSTBhV-Om25Kk_WWaGr_0yY1c6HHcknk6HMh7G1FHJNaRJlnrZ9mTXg8RT_f9Sfb94sN18ymff_34uZnNc1PIYsxLAVJrqVkr2xo4lm0tmEDZGkl51VKgHVIs0IiOtqw1Fa27EqoUQSdYAR0_yd7tdNebdoWdQTcGPah1sCsdtsprq_7ecXapFv5OSagZrWgSeLUXCH7KZ1QrGw0Og3boN1Ex4AmDSpYJffkPeuM3IQUyUekPqeRMPFALPaCyrvfJ10yiaiYZVJSKX7ZnO8pMUQfsD1cGqqaiUFNRqH1RJPzFn888wL9LIAGvd0AKv9M_7H_KYWKw1w80MFlAzX8C0ebIEA</recordid><startdate>20180101</startdate><enddate>20180101</enddate><creator>Yu, Chaohui</creator><creator>Shen, Zhe</creator><creator>Li, Youming</creator><creator>Ma, Han</creator><creator>Li, You-ming</creator><general>Hindawi Publishing Corporation</general><general>Hindawi</general><general>John Wiley & Sons, Inc</general><general>Hindawi Limited</general><scope>ADJCN</scope><scope>AHFXO</scope><scope>RHU</scope><scope>RHW</scope><scope>RHX</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7QL</scope><scope>7QO</scope><scope>7T7</scope><scope>7TK</scope><scope>7U7</scope><scope>7U9</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>CWDGH</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PHGZM</scope><scope>PHGZT</scope><scope>PIMPY</scope><scope>PJZUB</scope><scope>PKEHL</scope><scope>PPXIY</scope><scope>PQEST</scope><scope>PQGLB</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0001-9985-3035</orcidid><orcidid>https://orcid.org/0000-0001-9279-2903</orcidid></search><sort><creationdate>20180101</creationdate><title>Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China</title><author>Yu, Chaohui ; Shen, Zhe ; Li, Youming ; Ma, Han ; Li, You-ming</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c565t-7416aa6a2b6b913e7b9424e6bc6038b010de0e5ec4d0b2bc809d718816d4251d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Adult</topic><topic>Alanine</topic><topic>Alanine transaminase</topic><topic>Alanine Transaminase - blood</topic><topic>Algorithms</topic><topic>Artificial intelligence</topic><topic>Bayesian analysis</topic><topic>Big Data</topic><topic>Biomarkers</topic><topic>Biomedical research</topic><topic>Body mass</topic><topic>Body Mass Index</topic><topic>China</topic><topic>Classification</topic><topic>Colleges & universities</topic><topic>Computer science</topic><topic>Cross-Sectional Studies</topic><topic>Data collection</topic><topic>Data mining</topic><topic>Diabetes</topic><topic>Diagnostic imaging</topic><topic>Diagnostic systems</topic><topic>Fatty liver</topic><topic>Female</topic><topic>gamma-Glutamyltransferase - blood</topic><topic>Hepatitis</topic><topic>Humans</topic><topic>Information systems</topic><topic>Laboratory tests</topic><topic>Learning algorithms</topic><topic>Liver</topic><topic>Liver diseases</topic><topic>Machine Learning</topic><topic>Male</topic><topic>Medical research</topic><topic>Medicine, Experimental</topic><topic>Methods</topic><topic>Middle Aged</topic><topic>Models, Biological</topic><topic>Non-alcoholic Fatty Liver Disease - blood</topic><topic>Non-alcoholic Fatty Liver Disease - pathology</topic><topic>Non-alcoholic Fatty Liver Disease - physiopathology</topic><topic>Pattern recognition</topic><topic>Physical examinations</topic><topic>Prediction models</topic><topic>Public software</topic><topic>Regression models</topic><topic>Screening</topic><topic>Software</topic><topic>Statistical analysis</topic><topic>Statistical methods</topic><topic>Studies</topic><topic>Triglycerides</topic><topic>Triglycerides - blood</topic><topic>Type 2 diabetes</topic><topic>Ultrasound</topic><topic>Uric acid</topic><topic>γ-Glutamyltransferase</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yu, Chaohui</creatorcontrib><creatorcontrib>Shen, Zhe</creatorcontrib><creatorcontrib>Li, Youming</creatorcontrib><creatorcontrib>Ma, Han</creatorcontrib><creatorcontrib>Li, You-ming</creatorcontrib><collection>الدوريات العلمية والإحصائية - e-Marefa Academic and Statistical Periodicals</collection><collection>معرفة - المحتوى العربي الأكاديمي المتكامل - e-Marefa Academic Complete</collection><collection>Hindawi Publishing Complete</collection><collection>Hindawi Publishing Subscription Journals</collection><collection>Hindawi Publishing Open Access</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>Neurosciences Abstracts</collection><collection>Toxicology Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>Natural Science Collection (ProQuest)</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>Middle East & Africa Database</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biological Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>ProQuest Central (New)</collection><collection>ProQuest One Academic (New)</collection><collection>Publicly Available Content Database</collection><collection>ProQuest Health & Medical Research Collection</collection><collection>ProQuest One Academic Middle East (New)</collection><collection>ProQuest One Health & Nursing</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Applied & Life Sciences</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>BioMed research international</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yu, Chaohui</au><au>Shen, Zhe</au><au>Li, Youming</au><au>Ma, Han</au><au>Li, You-ming</au><au>Imazeki, Fumio</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China</atitle><jtitle>BioMed research international</jtitle><addtitle>Biomed Res Int</addtitle><date>2018-01-01</date><risdate>2018</risdate><volume>2018</volume><issue>2018</issue><spage>1</spage><epage>9</epage><pages>1-9</pages><issn>2314-6133</issn><eissn>2314-6141</eissn><abstract>Background. Nonalcoholic fatty liver disease (NAFLD) is one of the most common chronic liver diseases. Machine learning techniques were introduced to evaluate the optimal predictive clinical model of NAFLD. Methods. A cross-sectional study was performed with subjects who attended a health examination at the First Affiliated Hospital, Zhejiang University. Questionnaires, laboratory tests, physical examinations, and liver ultrasonography were employed. Machine learning techniques were then implemented using the open source software Weka. The tasks included feature selection and classification. Feature selection techniques built a screening model by removing the redundant features. Classification was used to build a prediction model, which was evaluated by the F-measure. 11 state-of-the-art machine learning techniques were investigated. Results. Among the 10,508 enrolled subjects, 2,522 (24%) met the diagnostic criteria of NAFLD. By leveraging a set of statistical testing techniques, BMI, triglycerides, gamma-glutamyl transpeptidase (γGT), the serum alanine aminotransferase (ALT), and uric acid were the top 5 features contributing to NAFLD. A 10-fold cross-validation was used in the classification. According to the results, the Bayesian network model demonstrated the best performance from among the 11 different techniques. It achieved accuracy, specificity, sensitivity, and F-measure scores of up to 83%, 0.878, 0.675, and 0.655, respectively. Compared with logistic regression, the Bayesian network model improves the F-measure score by 9.17%. Conclusion. Novel machine learning techniques may have screening and predictive value for NAFLD.</abstract><cop>Cairo, Egypt</cop><pub>Hindawi Publishing Corporation</pub><pmid>30402478</pmid><doi>10.1155/2018/4304376</doi><tpages>9</tpages><orcidid>https://orcid.org/0000-0001-9985-3035</orcidid><orcidid>https://orcid.org/0000-0001-9279-2903</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2314-6133 |
ispartof | BioMed research international, 2018-01, Vol.2018 (2018), p.1-9 |
issn | 2314-6133 2314-6141 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6192080 |
source | MEDLINE; Wiley Online Library Open Access; PubMed Central; Alma/SFX Local Collection; PubMed Central Open Access |
subjects | Adult Alanine Alanine transaminase Alanine Transaminase - blood Algorithms Artificial intelligence Bayesian analysis Big Data Biomarkers Biomedical research Body mass Body Mass Index China Classification Colleges & universities Computer science Cross-Sectional Studies Data collection Data mining Diabetes Diagnostic imaging Diagnostic systems Fatty liver Female gamma-Glutamyltransferase - blood Hepatitis Humans Information systems Laboratory tests Learning algorithms Liver Liver diseases Machine Learning Male Medical research Medicine, Experimental Methods Middle Aged Models, Biological Non-alcoholic Fatty Liver Disease - blood Non-alcoholic Fatty Liver Disease - pathology Non-alcoholic Fatty Liver Disease - physiopathology Pattern recognition Physical examinations Prediction models Public software Regression models Screening Software Statistical analysis Statistical methods Studies Triglycerides Triglycerides - blood Type 2 diabetes Ultrasound Uric acid γ-Glutamyltransferase |
title | Application of Machine Learning Techniques for Clinical Predictive Modeling: A Cross-Sectional Study on Nonalcoholic Fatty Liver Disease in China |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-18T22%3A42%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Application%20of%20Machine%20Learning%20Techniques%20for%20Clinical%20Predictive%20Modeling:%20A%20Cross-Sectional%20Study%20on%20Nonalcoholic%20Fatty%20Liver%20Disease%20in%20China&rft.jtitle=BioMed%20research%20international&rft.au=Yu,%20Chaohui&rft.date=2018-01-01&rft.volume=2018&rft.issue=2018&rft.spage=1&rft.epage=9&rft.pages=1-9&rft.issn=2314-6133&rft.eissn=2314-6141&rft_id=info:doi/10.1155/2018/4304376&rft_dat=%3Cgale_pubme%3EA621800480%3C/gale_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2120106324&rft_id=info:pmid/30402478&rft_galeid=A621800480&rfr_iscdi=true |