Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features

Abstract Objective The paper addresses a common and recurring problem of electrocardiogram (ECG) classification based on heart rate variability (HRV) analysis. Current understanding of the limits of HRV analysis in diagnosing different cardiac conditions is not complete. Existing research suggests t...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Artificial intelligence in medicine 2011-03, Vol.51 (3), p.175-186
Hauptverfasser: Jovic, Alan, Bogunovic, Nikola
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 186
container_issue 3
container_start_page 175
container_title Artificial intelligence in medicine
container_volume 51
creator Jovic, Alan
Bogunovic, Nikola
description Abstract Objective The paper addresses a common and recurring problem of electrocardiogram (ECG) classification based on heart rate variability (HRV) analysis. Current understanding of the limits of HRV analysis in diagnosing different cardiac conditions is not complete. Existing research suggests that a combination of carefully selected linear and nonlinear HRV features should significantly improve the accuracy for both binary and multiclass classification problems. The primary goal of this work is to evaluate a proposed combination of HRV features. Other explored objectives are the comparison of different machine learning algorithms in the HRV analysis and the inspection of the most suitable period T between two consecutively analyzed R-R intervals for nonlinear features. Methods and material We extracted 11 features from 5 min of R-R interval recordings: SDNN, RMSSD, pNN20, HRV triangular index (HTI), spatial filling index (SFI), correlation dimension, central tendency measure (CTM), and four approximate entropy features (ApEn1-ApEn4). Analyzed heart conditions included normal heart rhythm, arrhythmia (any), supraventricular arrhythmia, and congestive heart failure. One hundred patient records from six online databases were analyzed, 25 for each condition. Feature vectors were extracted by a platform designed for this purpose, named ECG Chaos Extractor. The vectors were then analyzed by seven clustering and classification algorithms in the Weka system: K-means, expectation maximization (EM), C4.5 decision tree, Bayesian network, artificial neural network (ANN), support vector machines (SVM) and random forest (RF). Four-class and two-class (normal vs. abnormal) classification was performed. Relevance of particular features was evaluated using 1-Rule and C4.5 decision tree in the cases of individual features classification and classification with features’ pairs. Results Average total classification accuracy obtained for top three classification methods in the two classes’ case was: RF 99.7%, ANN 99.1%, SVM 98.9%. In the four classes’ case the best results were: RF 99.6%, Bayesian network 99.4%, SVM 98.4%. The best overall method was RF. C4.5 decision tree was successful in the construction of useful classification rules for the two classes’ case. EM and K-means showed comparable clustering results: around 50% for the four classes’ case and around 75% for the two classes’ case. HTI, pNN20, RMSSD, ApEn3, ApEn4 and SFI were shown to be the most relevant fea
doi_str_mv 10.1016/j.artmed.2010.09.005
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_876235119</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0933365710001193</els_id><sourcerecordid>1671490358</sourcerecordid><originalsourceid>FETCH-LOGICAL-c481t-1d33cfdfd8d02d1c7e7188c490f4dc8a0ff2de56d7ce89d46d193eecb24f711d3</originalsourceid><addsrcrecordid>eNqFkk9vFSEUxYnR2Gf1GxjDThed5wVmgNmYNE39kzRxoa4JD-48ec4MFZia9-1l8tqNC7sBAufcC_wOIa8ZbBkw-f6wtalM6Lcc6hb0W4DuCdkwrUTDtYSnZAO9EI2QnTojL3I-AIBqmXxOzjj0GphoN-TP9YiupOhs8iHuk52one14zCHTJYd5Ty11cdqF2ZYQZxoHmktd5hKcHS_oHuOEJQV3UX2eznEew4w20Z91KDTZgvTOpmB3YQzlSAe0ZUmYX5Jngx0zvrqfz8mPj9ffrz43N18_fbm6vGlcq1lpmBfCDX7w2gP3zClUTGvX9jC03mkLw8A9dtIrh7r3rfSsF4hux9tBseo-J29PdW9T_L1gLmYK2eE42hnjko1WkouOVdejSskl8F6xqnz3XyWTitUbik5XaXuSuhRzTjiY2xQmm46GgVkxmoM5YTQrRgO9qRir7c19h2W3nj2YHrhVwYeTAOvf3QVMJruAs0MfUuVpfAyPdfi3gKvgVqa_8Ij5EJdUY1DfYjI3YL6tUVqTxGqI1t8SfwG7cseG</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1671490358</pqid></control><display><type>article</type><title>Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features</title><source>MEDLINE</source><source>ScienceDirect Journals (5 years ago - present)</source><creator>Jovic, Alan ; Bogunovic, Nikola</creator><creatorcontrib>Jovic, Alan ; Bogunovic, Nikola</creatorcontrib><description>Abstract Objective The paper addresses a common and recurring problem of electrocardiogram (ECG) classification based on heart rate variability (HRV) analysis. Current understanding of the limits of HRV analysis in diagnosing different cardiac conditions is not complete. Existing research suggests that a combination of carefully selected linear and nonlinear HRV features should significantly improve the accuracy for both binary and multiclass classification problems. The primary goal of this work is to evaluate a proposed combination of HRV features. Other explored objectives are the comparison of different machine learning algorithms in the HRV analysis and the inspection of the most suitable period T between two consecutively analyzed R-R intervals for nonlinear features. Methods and material We extracted 11 features from 5 min of R-R interval recordings: SDNN, RMSSD, pNN20, HRV triangular index (HTI), spatial filling index (SFI), correlation dimension, central tendency measure (CTM), and four approximate entropy features (ApEn1-ApEn4). Analyzed heart conditions included normal heart rhythm, arrhythmia (any), supraventricular arrhythmia, and congestive heart failure. One hundred patient records from six online databases were analyzed, 25 for each condition. Feature vectors were extracted by a platform designed for this purpose, named ECG Chaos Extractor. The vectors were then analyzed by seven clustering and classification algorithms in the Weka system: K-means, expectation maximization (EM), C4.5 decision tree, Bayesian network, artificial neural network (ANN), support vector machines (SVM) and random forest (RF). Four-class and two-class (normal vs. abnormal) classification was performed. Relevance of particular features was evaluated using 1-Rule and C4.5 decision tree in the cases of individual features classification and classification with features’ pairs. Results Average total classification accuracy obtained for top three classification methods in the two classes’ case was: RF 99.7%, ANN 99.1%, SVM 98.9%. In the four classes’ case the best results were: RF 99.6%, Bayesian network 99.4%, SVM 98.4%. The best overall method was RF. C4.5 decision tree was successful in the construction of useful classification rules for the two classes’ case. EM and K-means showed comparable clustering results: around 50% for the four classes’ case and around 75% for the two classes’ case. HTI, pNN20, RMSSD, ApEn3, ApEn4 and SFI were shown to be the most relevant features. HTI in particular appears in most of the top-ranked pairs of features and is the best analyzed feature. The choice of the period T for nonlinear features was shown to be arbitrary. However, a combination of five different periods significantly improved classification accuracy, from 70% for a single period up to 99% for five periods. Conclusions Analysis shows that the proposed combination of 11 linear and nonlinear HRV features gives high classification accuracy when nonlinear features are extracted for five periods. The features’ combination was thoroughly analyzed using several machine learning algorithms. In particular, RF algorithm proved to be highly efficient and accurate in both binary and multiclass classification of HRV records. Interpretable and useful rules were obtained with C4.5 decision tree. Further work in this area should elucidate which features should be extracted for the best classification results for specific types of cardiac disorders.</description><identifier>ISSN: 0933-3657</identifier><identifier>EISSN: 1873-2860</identifier><identifier>DOI: 10.1016/j.artmed.2010.09.005</identifier><identifier>PMID: 20980134</identifier><language>eng</language><publisher>Netherlands: Elsevier B.V</publisher><subject>Algorithms ; C4.5 decision tree ; Classification ; Data Interpretation, Statistical ; Decision trees ; Electrocardiography - methods ; Heart - physiology ; Heart disorder classification ; Heart Rate - physiology ; Heart rate variability ; Humans ; Internal Medicine ; Learning theory ; Neural networks ; Nonlinear analysis ; Nonlinear Dynamics ; Nonlinearity ; Other ; Radio frequencies ; Random forest ; Support vector machines</subject><ispartof>Artificial intelligence in medicine, 2011-03, Vol.51 (3), p.175-186</ispartof><rights>Elsevier B.V.</rights><rights>2010 Elsevier B.V.</rights><rights>Copyright © 2010 Elsevier B.V. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c481t-1d33cfdfd8d02d1c7e7188c490f4dc8a0ff2de56d7ce89d46d193eecb24f711d3</citedby><cites>FETCH-LOGICAL-c481t-1d33cfdfd8d02d1c7e7188c490f4dc8a0ff2de56d7ce89d46d193eecb24f711d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.artmed.2010.09.005$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3548,27923,27924,45994</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/20980134$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Jovic, Alan</creatorcontrib><creatorcontrib>Bogunovic, Nikola</creatorcontrib><title>Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features</title><title>Artificial intelligence in medicine</title><addtitle>Artif Intell Med</addtitle><description>Abstract Objective The paper addresses a common and recurring problem of electrocardiogram (ECG) classification based on heart rate variability (HRV) analysis. Current understanding of the limits of HRV analysis in diagnosing different cardiac conditions is not complete. Existing research suggests that a combination of carefully selected linear and nonlinear HRV features should significantly improve the accuracy for both binary and multiclass classification problems. The primary goal of this work is to evaluate a proposed combination of HRV features. Other explored objectives are the comparison of different machine learning algorithms in the HRV analysis and the inspection of the most suitable period T between two consecutively analyzed R-R intervals for nonlinear features. Methods and material We extracted 11 features from 5 min of R-R interval recordings: SDNN, RMSSD, pNN20, HRV triangular index (HTI), spatial filling index (SFI), correlation dimension, central tendency measure (CTM), and four approximate entropy features (ApEn1-ApEn4). Analyzed heart conditions included normal heart rhythm, arrhythmia (any), supraventricular arrhythmia, and congestive heart failure. One hundred patient records from six online databases were analyzed, 25 for each condition. Feature vectors were extracted by a platform designed for this purpose, named ECG Chaos Extractor. The vectors were then analyzed by seven clustering and classification algorithms in the Weka system: K-means, expectation maximization (EM), C4.5 decision tree, Bayesian network, artificial neural network (ANN), support vector machines (SVM) and random forest (RF). Four-class and two-class (normal vs. abnormal) classification was performed. Relevance of particular features was evaluated using 1-Rule and C4.5 decision tree in the cases of individual features classification and classification with features’ pairs. Results Average total classification accuracy obtained for top three classification methods in the two classes’ case was: RF 99.7%, ANN 99.1%, SVM 98.9%. In the four classes’ case the best results were: RF 99.6%, Bayesian network 99.4%, SVM 98.4%. The best overall method was RF. C4.5 decision tree was successful in the construction of useful classification rules for the two classes’ case. EM and K-means showed comparable clustering results: around 50% for the four classes’ case and around 75% for the two classes’ case. HTI, pNN20, RMSSD, ApEn3, ApEn4 and SFI were shown to be the most relevant features. HTI in particular appears in most of the top-ranked pairs of features and is the best analyzed feature. The choice of the period T for nonlinear features was shown to be arbitrary. However, a combination of five different periods significantly improved classification accuracy, from 70% for a single period up to 99% for five periods. Conclusions Analysis shows that the proposed combination of 11 linear and nonlinear HRV features gives high classification accuracy when nonlinear features are extracted for five periods. The features’ combination was thoroughly analyzed using several machine learning algorithms. In particular, RF algorithm proved to be highly efficient and accurate in both binary and multiclass classification of HRV records. Interpretable and useful rules were obtained with C4.5 decision tree. Further work in this area should elucidate which features should be extracted for the best classification results for specific types of cardiac disorders.</description><subject>Algorithms</subject><subject>C4.5 decision tree</subject><subject>Classification</subject><subject>Data Interpretation, Statistical</subject><subject>Decision trees</subject><subject>Electrocardiography - methods</subject><subject>Heart - physiology</subject><subject>Heart disorder classification</subject><subject>Heart Rate - physiology</subject><subject>Heart rate variability</subject><subject>Humans</subject><subject>Internal Medicine</subject><subject>Learning theory</subject><subject>Neural networks</subject><subject>Nonlinear analysis</subject><subject>Nonlinear Dynamics</subject><subject>Nonlinearity</subject><subject>Other</subject><subject>Radio frequencies</subject><subject>Random forest</subject><subject>Support vector machines</subject><issn>0933-3657</issn><issn>1873-2860</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkk9vFSEUxYnR2Gf1GxjDThed5wVmgNmYNE39kzRxoa4JD-48ec4MFZia9-1l8tqNC7sBAufcC_wOIa8ZbBkw-f6wtalM6Lcc6hb0W4DuCdkwrUTDtYSnZAO9EI2QnTojL3I-AIBqmXxOzjj0GphoN-TP9YiupOhs8iHuk52one14zCHTJYd5Ty11cdqF2ZYQZxoHmktd5hKcHS_oHuOEJQV3UX2eznEew4w20Z91KDTZgvTOpmB3YQzlSAe0ZUmYX5Jngx0zvrqfz8mPj9ffrz43N18_fbm6vGlcq1lpmBfCDX7w2gP3zClUTGvX9jC03mkLw8A9dtIrh7r3rfSsF4hux9tBseo-J29PdW9T_L1gLmYK2eE42hnjko1WkouOVdejSskl8F6xqnz3XyWTitUbik5XaXuSuhRzTjiY2xQmm46GgVkxmoM5YTQrRgO9qRir7c19h2W3nj2YHrhVwYeTAOvf3QVMJruAs0MfUuVpfAyPdfi3gKvgVqa_8Ij5EJdUY1DfYjI3YL6tUVqTxGqI1t8SfwG7cseG</recordid><startdate>20110301</startdate><enddate>20110301</enddate><creator>Jovic, Alan</creator><creator>Bogunovic, Nikola</creator><general>Elsevier B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope><scope>7QO</scope><scope>FR3</scope><scope>P64</scope></search><sort><creationdate>20110301</creationdate><title>Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features</title><author>Jovic, Alan ; Bogunovic, Nikola</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c481t-1d33cfdfd8d02d1c7e7188c490f4dc8a0ff2de56d7ce89d46d193eecb24f711d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Algorithms</topic><topic>C4.5 decision tree</topic><topic>Classification</topic><topic>Data Interpretation, Statistical</topic><topic>Decision trees</topic><topic>Electrocardiography - methods</topic><topic>Heart - physiology</topic><topic>Heart disorder classification</topic><topic>Heart Rate - physiology</topic><topic>Heart rate variability</topic><topic>Humans</topic><topic>Internal Medicine</topic><topic>Learning theory</topic><topic>Neural networks</topic><topic>Nonlinear analysis</topic><topic>Nonlinear Dynamics</topic><topic>Nonlinearity</topic><topic>Other</topic><topic>Radio frequencies</topic><topic>Random forest</topic><topic>Support vector machines</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jovic, Alan</creatorcontrib><creatorcontrib>Bogunovic, Nikola</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><collection>Biotechnology Research Abstracts</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><jtitle>Artificial intelligence in medicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jovic, Alan</au><au>Bogunovic, Nikola</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features</atitle><jtitle>Artificial intelligence in medicine</jtitle><addtitle>Artif Intell Med</addtitle><date>2011-03-01</date><risdate>2011</risdate><volume>51</volume><issue>3</issue><spage>175</spage><epage>186</epage><pages>175-186</pages><issn>0933-3657</issn><eissn>1873-2860</eissn><abstract>Abstract Objective The paper addresses a common and recurring problem of electrocardiogram (ECG) classification based on heart rate variability (HRV) analysis. Current understanding of the limits of HRV analysis in diagnosing different cardiac conditions is not complete. Existing research suggests that a combination of carefully selected linear and nonlinear HRV features should significantly improve the accuracy for both binary and multiclass classification problems. The primary goal of this work is to evaluate a proposed combination of HRV features. Other explored objectives are the comparison of different machine learning algorithms in the HRV analysis and the inspection of the most suitable period T between two consecutively analyzed R-R intervals for nonlinear features. Methods and material We extracted 11 features from 5 min of R-R interval recordings: SDNN, RMSSD, pNN20, HRV triangular index (HTI), spatial filling index (SFI), correlation dimension, central tendency measure (CTM), and four approximate entropy features (ApEn1-ApEn4). Analyzed heart conditions included normal heart rhythm, arrhythmia (any), supraventricular arrhythmia, and congestive heart failure. One hundred patient records from six online databases were analyzed, 25 for each condition. Feature vectors were extracted by a platform designed for this purpose, named ECG Chaos Extractor. The vectors were then analyzed by seven clustering and classification algorithms in the Weka system: K-means, expectation maximization (EM), C4.5 decision tree, Bayesian network, artificial neural network (ANN), support vector machines (SVM) and random forest (RF). Four-class and two-class (normal vs. abnormal) classification was performed. Relevance of particular features was evaluated using 1-Rule and C4.5 decision tree in the cases of individual features classification and classification with features’ pairs. Results Average total classification accuracy obtained for top three classification methods in the two classes’ case was: RF 99.7%, ANN 99.1%, SVM 98.9%. In the four classes’ case the best results were: RF 99.6%, Bayesian network 99.4%, SVM 98.4%. The best overall method was RF. C4.5 decision tree was successful in the construction of useful classification rules for the two classes’ case. EM and K-means showed comparable clustering results: around 50% for the four classes’ case and around 75% for the two classes’ case. HTI, pNN20, RMSSD, ApEn3, ApEn4 and SFI were shown to be the most relevant features. HTI in particular appears in most of the top-ranked pairs of features and is the best analyzed feature. The choice of the period T for nonlinear features was shown to be arbitrary. However, a combination of five different periods significantly improved classification accuracy, from 70% for a single period up to 99% for five periods. Conclusions Analysis shows that the proposed combination of 11 linear and nonlinear HRV features gives high classification accuracy when nonlinear features are extracted for five periods. The features’ combination was thoroughly analyzed using several machine learning algorithms. In particular, RF algorithm proved to be highly efficient and accurate in both binary and multiclass classification of HRV records. Interpretable and useful rules were obtained with C4.5 decision tree. Further work in this area should elucidate which features should be extracted for the best classification results for specific types of cardiac disorders.</abstract><cop>Netherlands</cop><pub>Elsevier B.V</pub><pmid>20980134</pmid><doi>10.1016/j.artmed.2010.09.005</doi><tpages>12</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0933-3657
ispartof Artificial intelligence in medicine, 2011-03, Vol.51 (3), p.175-186
issn 0933-3657
1873-2860
language eng
recordid cdi_proquest_miscellaneous_876235119
source MEDLINE; ScienceDirect Journals (5 years ago - present)
subjects Algorithms
C4.5 decision tree
Classification
Data Interpretation, Statistical
Decision trees
Electrocardiography - methods
Heart - physiology
Heart disorder classification
Heart Rate - physiology
Heart rate variability
Humans
Internal Medicine
Learning theory
Neural networks
Nonlinear analysis
Nonlinear Dynamics
Nonlinearity
Other
Radio frequencies
Random forest
Support vector machines
title Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T01%3A33%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Electrocardiogram%20analysis%20using%20a%20combination%20of%20statistical,%20geometric,%20and%20nonlinear%20heart%20rate%20variability%20features&rft.jtitle=Artificial%20intelligence%20in%20medicine&rft.au=Jovic,%20Alan&rft.date=2011-03-01&rft.volume=51&rft.issue=3&rft.spage=175&rft.epage=186&rft.pages=175-186&rft.issn=0933-3657&rft.eissn=1873-2860&rft_id=info:doi/10.1016/j.artmed.2010.09.005&rft_dat=%3Cproquest_cross%3E1671490358%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1671490358&rft_id=info:pmid/20980134&rft_els_id=S0933365710001193&rfr_iscdi=true