Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features
Abstract Objective The paper addresses a common and recurring problem of electrocardiogram (ECG) classification based on heart rate variability (HRV) analysis. Current understanding of the limits of HRV analysis in diagnosing different cardiac conditions is not complete. Existing research suggests t...
Gespeichert in:
Veröffentlicht in: | Artificial intelligence in medicine 2011-03, Vol.51 (3), p.175-186 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 186 |
---|---|
container_issue | 3 |
container_start_page | 175 |
container_title | Artificial intelligence in medicine |
container_volume | 51 |
creator | Jovic, Alan Bogunovic, Nikola |
description | Abstract Objective The paper addresses a common and recurring problem of electrocardiogram (ECG) classification based on heart rate variability (HRV) analysis. Current understanding of the limits of HRV analysis in diagnosing different cardiac conditions is not complete. Existing research suggests that a combination of carefully selected linear and nonlinear HRV features should significantly improve the accuracy for both binary and multiclass classification problems. The primary goal of this work is to evaluate a proposed combination of HRV features. Other explored objectives are the comparison of different machine learning algorithms in the HRV analysis and the inspection of the most suitable period T between two consecutively analyzed R-R intervals for nonlinear features. Methods and material We extracted 11 features from 5 min of R-R interval recordings: SDNN, RMSSD, pNN20, HRV triangular index (HTI), spatial filling index (SFI), correlation dimension, central tendency measure (CTM), and four approximate entropy features (ApEn1-ApEn4). Analyzed heart conditions included normal heart rhythm, arrhythmia (any), supraventricular arrhythmia, and congestive heart failure. One hundred patient records from six online databases were analyzed, 25 for each condition. Feature vectors were extracted by a platform designed for this purpose, named ECG Chaos Extractor. The vectors were then analyzed by seven clustering and classification algorithms in the Weka system: K-means, expectation maximization (EM), C4.5 decision tree, Bayesian network, artificial neural network (ANN), support vector machines (SVM) and random forest (RF). Four-class and two-class (normal vs. abnormal) classification was performed. Relevance of particular features was evaluated using 1-Rule and C4.5 decision tree in the cases of individual features classification and classification with features’ pairs. Results Average total classification accuracy obtained for top three classification methods in the two classes’ case was: RF 99.7%, ANN 99.1%, SVM 98.9%. In the four classes’ case the best results were: RF 99.6%, Bayesian network 99.4%, SVM 98.4%. The best overall method was RF. C4.5 decision tree was successful in the construction of useful classification rules for the two classes’ case. EM and K-means showed comparable clustering results: around 50% for the four classes’ case and around 75% for the two classes’ case. HTI, pNN20, RMSSD, ApEn3, ApEn4 and SFI were shown to be the most relevant fea |
doi_str_mv | 10.1016/j.artmed.2010.09.005 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_876235119</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0933365710001193</els_id><sourcerecordid>1671490358</sourcerecordid><originalsourceid>FETCH-LOGICAL-c481t-1d33cfdfd8d02d1c7e7188c490f4dc8a0ff2de56d7ce89d46d193eecb24f711d3</originalsourceid><addsrcrecordid>eNqFkk9vFSEUxYnR2Gf1GxjDThed5wVmgNmYNE39kzRxoa4JD-48ec4MFZia9-1l8tqNC7sBAufcC_wOIa8ZbBkw-f6wtalM6Lcc6hb0W4DuCdkwrUTDtYSnZAO9EI2QnTojL3I-AIBqmXxOzjj0GphoN-TP9YiupOhs8iHuk52one14zCHTJYd5Ty11cdqF2ZYQZxoHmktd5hKcHS_oHuOEJQV3UX2eznEew4w20Z91KDTZgvTOpmB3YQzlSAe0ZUmYX5Jngx0zvrqfz8mPj9ffrz43N18_fbm6vGlcq1lpmBfCDX7w2gP3zClUTGvX9jC03mkLw8A9dtIrh7r3rfSsF4hux9tBseo-J29PdW9T_L1gLmYK2eE42hnjko1WkouOVdejSskl8F6xqnz3XyWTitUbik5XaXuSuhRzTjiY2xQmm46GgVkxmoM5YTQrRgO9qRir7c19h2W3nj2YHrhVwYeTAOvf3QVMJruAs0MfUuVpfAyPdfi3gKvgVqa_8Ij5EJdUY1DfYjI3YL6tUVqTxGqI1t8SfwG7cseG</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1671490358</pqid></control><display><type>article</type><title>Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features</title><source>MEDLINE</source><source>ScienceDirect Journals (5 years ago - present)</source><creator>Jovic, Alan ; Bogunovic, Nikola</creator><creatorcontrib>Jovic, Alan ; Bogunovic, Nikola</creatorcontrib><description>Abstract Objective The paper addresses a common and recurring problem of electrocardiogram (ECG) classification based on heart rate variability (HRV) analysis. Current understanding of the limits of HRV analysis in diagnosing different cardiac conditions is not complete. Existing research suggests that a combination of carefully selected linear and nonlinear HRV features should significantly improve the accuracy for both binary and multiclass classification problems. The primary goal of this work is to evaluate a proposed combination of HRV features. Other explored objectives are the comparison of different machine learning algorithms in the HRV analysis and the inspection of the most suitable period T between two consecutively analyzed R-R intervals for nonlinear features. Methods and material We extracted 11 features from 5 min of R-R interval recordings: SDNN, RMSSD, pNN20, HRV triangular index (HTI), spatial filling index (SFI), correlation dimension, central tendency measure (CTM), and four approximate entropy features (ApEn1-ApEn4). Analyzed heart conditions included normal heart rhythm, arrhythmia (any), supraventricular arrhythmia, and congestive heart failure. One hundred patient records from six online databases were analyzed, 25 for each condition. Feature vectors were extracted by a platform designed for this purpose, named ECG Chaos Extractor. The vectors were then analyzed by seven clustering and classification algorithms in the Weka system: K-means, expectation maximization (EM), C4.5 decision tree, Bayesian network, artificial neural network (ANN), support vector machines (SVM) and random forest (RF). Four-class and two-class (normal vs. abnormal) classification was performed. Relevance of particular features was evaluated using 1-Rule and C4.5 decision tree in the cases of individual features classification and classification with features’ pairs. Results Average total classification accuracy obtained for top three classification methods in the two classes’ case was: RF 99.7%, ANN 99.1%, SVM 98.9%. In the four classes’ case the best results were: RF 99.6%, Bayesian network 99.4%, SVM 98.4%. The best overall method was RF. C4.5 decision tree was successful in the construction of useful classification rules for the two classes’ case. EM and K-means showed comparable clustering results: around 50% for the four classes’ case and around 75% for the two classes’ case. HTI, pNN20, RMSSD, ApEn3, ApEn4 and SFI were shown to be the most relevant features. HTI in particular appears in most of the top-ranked pairs of features and is the best analyzed feature. The choice of the period T for nonlinear features was shown to be arbitrary. However, a combination of five different periods significantly improved classification accuracy, from 70% for a single period up to 99% for five periods. Conclusions Analysis shows that the proposed combination of 11 linear and nonlinear HRV features gives high classification accuracy when nonlinear features are extracted for five periods. The features’ combination was thoroughly analyzed using several machine learning algorithms. In particular, RF algorithm proved to be highly efficient and accurate in both binary and multiclass classification of HRV records. Interpretable and useful rules were obtained with C4.5 decision tree. Further work in this area should elucidate which features should be extracted for the best classification results for specific types of cardiac disorders.</description><identifier>ISSN: 0933-3657</identifier><identifier>EISSN: 1873-2860</identifier><identifier>DOI: 10.1016/j.artmed.2010.09.005</identifier><identifier>PMID: 20980134</identifier><language>eng</language><publisher>Netherlands: Elsevier B.V</publisher><subject>Algorithms ; C4.5 decision tree ; Classification ; Data Interpretation, Statistical ; Decision trees ; Electrocardiography - methods ; Heart - physiology ; Heart disorder classification ; Heart Rate - physiology ; Heart rate variability ; Humans ; Internal Medicine ; Learning theory ; Neural networks ; Nonlinear analysis ; Nonlinear Dynamics ; Nonlinearity ; Other ; Radio frequencies ; Random forest ; Support vector machines</subject><ispartof>Artificial intelligence in medicine, 2011-03, Vol.51 (3), p.175-186</ispartof><rights>Elsevier B.V.</rights><rights>2010 Elsevier B.V.</rights><rights>Copyright © 2010 Elsevier B.V. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c481t-1d33cfdfd8d02d1c7e7188c490f4dc8a0ff2de56d7ce89d46d193eecb24f711d3</citedby><cites>FETCH-LOGICAL-c481t-1d33cfdfd8d02d1c7e7188c490f4dc8a0ff2de56d7ce89d46d193eecb24f711d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.artmed.2010.09.005$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3548,27923,27924,45994</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/20980134$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Jovic, Alan</creatorcontrib><creatorcontrib>Bogunovic, Nikola</creatorcontrib><title>Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features</title><title>Artificial intelligence in medicine</title><addtitle>Artif Intell Med</addtitle><description>Abstract Objective The paper addresses a common and recurring problem of electrocardiogram (ECG) classification based on heart rate variability (HRV) analysis. Current understanding of the limits of HRV analysis in diagnosing different cardiac conditions is not complete. Existing research suggests that a combination of carefully selected linear and nonlinear HRV features should significantly improve the accuracy for both binary and multiclass classification problems. The primary goal of this work is to evaluate a proposed combination of HRV features. Other explored objectives are the comparison of different machine learning algorithms in the HRV analysis and the inspection of the most suitable period T between two consecutively analyzed R-R intervals for nonlinear features. Methods and material We extracted 11 features from 5 min of R-R interval recordings: SDNN, RMSSD, pNN20, HRV triangular index (HTI), spatial filling index (SFI), correlation dimension, central tendency measure (CTM), and four approximate entropy features (ApEn1-ApEn4). Analyzed heart conditions included normal heart rhythm, arrhythmia (any), supraventricular arrhythmia, and congestive heart failure. One hundred patient records from six online databases were analyzed, 25 for each condition. Feature vectors were extracted by a platform designed for this purpose, named ECG Chaos Extractor. The vectors were then analyzed by seven clustering and classification algorithms in the Weka system: K-means, expectation maximization (EM), C4.5 decision tree, Bayesian network, artificial neural network (ANN), support vector machines (SVM) and random forest (RF). Four-class and two-class (normal vs. abnormal) classification was performed. Relevance of particular features was evaluated using 1-Rule and C4.5 decision tree in the cases of individual features classification and classification with features’ pairs. Results Average total classification accuracy obtained for top three classification methods in the two classes’ case was: RF 99.7%, ANN 99.1%, SVM 98.9%. In the four classes’ case the best results were: RF 99.6%, Bayesian network 99.4%, SVM 98.4%. The best overall method was RF. C4.5 decision tree was successful in the construction of useful classification rules for the two classes’ case. EM and K-means showed comparable clustering results: around 50% for the four classes’ case and around 75% for the two classes’ case. HTI, pNN20, RMSSD, ApEn3, ApEn4 and SFI were shown to be the most relevant features. HTI in particular appears in most of the top-ranked pairs of features and is the best analyzed feature. The choice of the period T for nonlinear features was shown to be arbitrary. However, a combination of five different periods significantly improved classification accuracy, from 70% for a single period up to 99% for five periods. Conclusions Analysis shows that the proposed combination of 11 linear and nonlinear HRV features gives high classification accuracy when nonlinear features are extracted for five periods. The features’ combination was thoroughly analyzed using several machine learning algorithms. In particular, RF algorithm proved to be highly efficient and accurate in both binary and multiclass classification of HRV records. Interpretable and useful rules were obtained with C4.5 decision tree. Further work in this area should elucidate which features should be extracted for the best classification results for specific types of cardiac disorders.</description><subject>Algorithms</subject><subject>C4.5 decision tree</subject><subject>Classification</subject><subject>Data Interpretation, Statistical</subject><subject>Decision trees</subject><subject>Electrocardiography - methods</subject><subject>Heart - physiology</subject><subject>Heart disorder classification</subject><subject>Heart Rate - physiology</subject><subject>Heart rate variability</subject><subject>Humans</subject><subject>Internal Medicine</subject><subject>Learning theory</subject><subject>Neural networks</subject><subject>Nonlinear analysis</subject><subject>Nonlinear Dynamics</subject><subject>Nonlinearity</subject><subject>Other</subject><subject>Radio frequencies</subject><subject>Random forest</subject><subject>Support vector machines</subject><issn>0933-3657</issn><issn>1873-2860</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkk9vFSEUxYnR2Gf1GxjDThed5wVmgNmYNE39kzRxoa4JD-48ec4MFZia9-1l8tqNC7sBAufcC_wOIa8ZbBkw-f6wtalM6Lcc6hb0W4DuCdkwrUTDtYSnZAO9EI2QnTojL3I-AIBqmXxOzjj0GphoN-TP9YiupOhs8iHuk52one14zCHTJYd5Ty11cdqF2ZYQZxoHmktd5hKcHS_oHuOEJQV3UX2eznEew4w20Z91KDTZgvTOpmB3YQzlSAe0ZUmYX5Jngx0zvrqfz8mPj9ffrz43N18_fbm6vGlcq1lpmBfCDX7w2gP3zClUTGvX9jC03mkLw8A9dtIrh7r3rfSsF4hux9tBseo-J29PdW9T_L1gLmYK2eE42hnjko1WkouOVdejSskl8F6xqnz3XyWTitUbik5XaXuSuhRzTjiY2xQmm46GgVkxmoM5YTQrRgO9qRir7c19h2W3nj2YHrhVwYeTAOvf3QVMJruAs0MfUuVpfAyPdfi3gKvgVqa_8Ij5EJdUY1DfYjI3YL6tUVqTxGqI1t8SfwG7cseG</recordid><startdate>20110301</startdate><enddate>20110301</enddate><creator>Jovic, Alan</creator><creator>Bogunovic, Nikola</creator><general>Elsevier B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope><scope>7QO</scope><scope>FR3</scope><scope>P64</scope></search><sort><creationdate>20110301</creationdate><title>Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features</title><author>Jovic, Alan ; Bogunovic, Nikola</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c481t-1d33cfdfd8d02d1c7e7188c490f4dc8a0ff2de56d7ce89d46d193eecb24f711d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Algorithms</topic><topic>C4.5 decision tree</topic><topic>Classification</topic><topic>Data Interpretation, Statistical</topic><topic>Decision trees</topic><topic>Electrocardiography - methods</topic><topic>Heart - physiology</topic><topic>Heart disorder classification</topic><topic>Heart Rate - physiology</topic><topic>Heart rate variability</topic><topic>Humans</topic><topic>Internal Medicine</topic><topic>Learning theory</topic><topic>Neural networks</topic><topic>Nonlinear analysis</topic><topic>Nonlinear Dynamics</topic><topic>Nonlinearity</topic><topic>Other</topic><topic>Radio frequencies</topic><topic>Random forest</topic><topic>Support vector machines</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jovic, Alan</creatorcontrib><creatorcontrib>Bogunovic, Nikola</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><collection>Biotechnology Research Abstracts</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><jtitle>Artificial intelligence in medicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jovic, Alan</au><au>Bogunovic, Nikola</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features</atitle><jtitle>Artificial intelligence in medicine</jtitle><addtitle>Artif Intell Med</addtitle><date>2011-03-01</date><risdate>2011</risdate><volume>51</volume><issue>3</issue><spage>175</spage><epage>186</epage><pages>175-186</pages><issn>0933-3657</issn><eissn>1873-2860</eissn><abstract>Abstract Objective The paper addresses a common and recurring problem of electrocardiogram (ECG) classification based on heart rate variability (HRV) analysis. Current understanding of the limits of HRV analysis in diagnosing different cardiac conditions is not complete. Existing research suggests that a combination of carefully selected linear and nonlinear HRV features should significantly improve the accuracy for both binary and multiclass classification problems. The primary goal of this work is to evaluate a proposed combination of HRV features. Other explored objectives are the comparison of different machine learning algorithms in the HRV analysis and the inspection of the most suitable period T between two consecutively analyzed R-R intervals for nonlinear features. Methods and material We extracted 11 features from 5 min of R-R interval recordings: SDNN, RMSSD, pNN20, HRV triangular index (HTI), spatial filling index (SFI), correlation dimension, central tendency measure (CTM), and four approximate entropy features (ApEn1-ApEn4). Analyzed heart conditions included normal heart rhythm, arrhythmia (any), supraventricular arrhythmia, and congestive heart failure. One hundred patient records from six online databases were analyzed, 25 for each condition. Feature vectors were extracted by a platform designed for this purpose, named ECG Chaos Extractor. The vectors were then analyzed by seven clustering and classification algorithms in the Weka system: K-means, expectation maximization (EM), C4.5 decision tree, Bayesian network, artificial neural network (ANN), support vector machines (SVM) and random forest (RF). Four-class and two-class (normal vs. abnormal) classification was performed. Relevance of particular features was evaluated using 1-Rule and C4.5 decision tree in the cases of individual features classification and classification with features’ pairs. Results Average total classification accuracy obtained for top three classification methods in the two classes’ case was: RF 99.7%, ANN 99.1%, SVM 98.9%. In the four classes’ case the best results were: RF 99.6%, Bayesian network 99.4%, SVM 98.4%. The best overall method was RF. C4.5 decision tree was successful in the construction of useful classification rules for the two classes’ case. EM and K-means showed comparable clustering results: around 50% for the four classes’ case and around 75% for the two classes’ case. HTI, pNN20, RMSSD, ApEn3, ApEn4 and SFI were shown to be the most relevant features. HTI in particular appears in most of the top-ranked pairs of features and is the best analyzed feature. The choice of the period T for nonlinear features was shown to be arbitrary. However, a combination of five different periods significantly improved classification accuracy, from 70% for a single period up to 99% for five periods. Conclusions Analysis shows that the proposed combination of 11 linear and nonlinear HRV features gives high classification accuracy when nonlinear features are extracted for five periods. The features’ combination was thoroughly analyzed using several machine learning algorithms. In particular, RF algorithm proved to be highly efficient and accurate in both binary and multiclass classification of HRV records. Interpretable and useful rules were obtained with C4.5 decision tree. Further work in this area should elucidate which features should be extracted for the best classification results for specific types of cardiac disorders.</abstract><cop>Netherlands</cop><pub>Elsevier B.V</pub><pmid>20980134</pmid><doi>10.1016/j.artmed.2010.09.005</doi><tpages>12</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0933-3657 |
ispartof | Artificial intelligence in medicine, 2011-03, Vol.51 (3), p.175-186 |
issn | 0933-3657 1873-2860 |
language | eng |
recordid | cdi_proquest_miscellaneous_876235119 |
source | MEDLINE; ScienceDirect Journals (5 years ago - present) |
subjects | Algorithms C4.5 decision tree Classification Data Interpretation, Statistical Decision trees Electrocardiography - methods Heart - physiology Heart disorder classification Heart Rate - physiology Heart rate variability Humans Internal Medicine Learning theory Neural networks Nonlinear analysis Nonlinear Dynamics Nonlinearity Other Radio frequencies Random forest Support vector machines |
title | Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T01%3A33%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Electrocardiogram%20analysis%20using%20a%20combination%20of%20statistical,%20geometric,%20and%20nonlinear%20heart%20rate%20variability%20features&rft.jtitle=Artificial%20intelligence%20in%20medicine&rft.au=Jovic,%20Alan&rft.date=2011-03-01&rft.volume=51&rft.issue=3&rft.spage=175&rft.epage=186&rft.pages=175-186&rft.issn=0933-3657&rft.eissn=1873-2860&rft_id=info:doi/10.1016/j.artmed.2010.09.005&rft_dat=%3Cproquest_cross%3E1671490358%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1671490358&rft_id=info:pmid/20980134&rft_els_id=S0933365710001193&rfr_iscdi=true |