On the development of conjunctival hyperemia computer-assisted diagnosis tools: influence of feature selection and class imbalance in automatic gradings
Abstract Objective The sudden increase of blood flow in the bulbar conjunctiva, known as hyperemia, is associated to a red hue of variable intensity. Experts measure hyperemia using levels in a grading scale, a procedure that is subjective, non-repeatable and time consuming, thus creating a need for...
Gespeichert in:
Veröffentlicht in: | Artificial intelligence in medicine 2016-07, Vol.71, p.30-42 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 42 |
---|---|
container_issue | |
container_start_page | 30 |
container_title | Artificial intelligence in medicine |
container_volume | 71 |
creator | Sanchez Brea, Maria Luisa Barreira Rodriguez, Noelia Sanchez Marono, Noelia Mosquera Gonzalez, Antonio Garcia-Resua, Carlos Giraldez Fernandez, Maria Jesus |
description | Abstract Objective The sudden increase of blood flow in the bulbar conjunctiva, known as hyperemia, is associated to a red hue of variable intensity. Experts measure hyperemia using levels in a grading scale, a procedure that is subjective, non-repeatable and time consuming, thus creating a need for its automatisation. However, the task is far from straightforward due to data issues such as class imbalance or correlated features. In this paper, we study the specific features of hyperemia and propose various approaches to address these problems in the context of an automatic framework for hyperemia grading. Methodology Oversampling, undersampling and SMOTE approaches were applied in order to tackle the problem of class imbalance. 25 features were computed for each image and regression methods were then used to transform them into a value on the grading scale. The values and relationships among features and experts’ values were analysed, and five feature selection techniques were subsequently studied. Results The lowest mean square error (MSE) for the regression systems trained with individual features is below 0.1 for both scales. Multi-layer perceptron (MLP) obtains the best values, but is less consistent than the random forest (RF) method. When all features are combined, the best results for both scales are achieved with MLP. Correlation based feature selection (CFS) and M5 provide the best results, MSE = 0.108 and MSE = 0.061 respectively. Finally, the class imbalance problem is minimised with the SMOTE approach for both scales (MSE < 0.006). Conclusions Machine learning methods are able to perform an objective assessment of hyperemia grading, removing both intra- and inter-expert subjectivity while providing a gain in computation time. SMOTE and oversampling approaches minimise the class imbalance problem, while feature selection reduces the number of features from 25 to 3-5 without worsening the MSE. As the differences between the system and a human expert are similar to the differences between experts, we can therefore conclude that the system behaves like an expert. |
doi_str_mv | 10.1016/j.artmed.2016.06.004 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1835629795</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>1_s2_0_S0933365716300215</els_id><sourcerecordid>1835629795</sourcerecordid><originalsourceid>FETCH-LOGICAL-c483t-9e55970a96a0d76dd60437051a8280c69eb32789be3f82630c697ceb114f905c3</originalsourceid><addsrcrecordid>eNqNUk2LFDEQbURxx9F_IJKjlx4rne6k40GQxS9Y2IN6DumkejZjdzIm6YH5J_5c08zuxYsLBaFeXr1K6lVVvaawo0D5u8NOxzyj3TUl20EJaJ9UG9oLVjc9h6fVBiRjNeOduKpepHQAANFS_ry6akQHnDZyU_259STfIbF4wikcZ_SZhJGY4A-LN9md9ETuzkeMODtd4Pm4ZIy1TsmljJZYp_c-lITkEKb0njg_Tgt6g6vMiDovEUnCCYtY8ER7S8xUyombBz3plegKvOQw6-wM2Udtnd-nl9WzUU8JX92f2-rn508_rr_WN7dfvl1_vKlN27NcS-w6KUBLrsEKbi2HlgnoqO6bHgyXOLBG9HJANvYNZyskDA6UtqOEzrBt9faie4zh94Ipq9klg1N5GoYlKdqzjjdSyO4RVNqJMmPJH0OFXoAAVqjthWpiSCniqI7RzTqeFQW1Oq0O6uK0Wp1WUKL8cVu9ue-wDOvdQ9GDtYXw4ULAMr2Tw6iScasx1sVihrLB_a_DvwJmct4ZPf3CM6ZDWKIvziiqUqNAfV-3bV02WqYMDe3YXwc40pA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1810870703</pqid></control><display><type>article</type><title>On the development of conjunctival hyperemia computer-assisted diagnosis tools: influence of feature selection and class imbalance in automatic gradings</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals Complete</source><creator>Sanchez Brea, Maria Luisa ; Barreira Rodriguez, Noelia ; Sanchez Marono, Noelia ; Mosquera Gonzalez, Antonio ; Garcia-Resua, Carlos ; Giraldez Fernandez, Maria Jesus</creator><creatorcontrib>Sanchez Brea, Maria Luisa ; Barreira Rodriguez, Noelia ; Sanchez Marono, Noelia ; Mosquera Gonzalez, Antonio ; Garcia-Resua, Carlos ; Giraldez Fernandez, Maria Jesus</creatorcontrib><description>Abstract Objective The sudden increase of blood flow in the bulbar conjunctiva, known as hyperemia, is associated to a red hue of variable intensity. Experts measure hyperemia using levels in a grading scale, a procedure that is subjective, non-repeatable and time consuming, thus creating a need for its automatisation. However, the task is far from straightforward due to data issues such as class imbalance or correlated features. In this paper, we study the specific features of hyperemia and propose various approaches to address these problems in the context of an automatic framework for hyperemia grading. Methodology Oversampling, undersampling and SMOTE approaches were applied in order to tackle the problem of class imbalance. 25 features were computed for each image and regression methods were then used to transform them into a value on the grading scale. The values and relationships among features and experts’ values were analysed, and five feature selection techniques were subsequently studied. Results The lowest mean square error (MSE) for the regression systems trained with individual features is below 0.1 for both scales. Multi-layer perceptron (MLP) obtains the best values, but is less consistent than the random forest (RF) method. When all features are combined, the best results for both scales are achieved with MLP. Correlation based feature selection (CFS) and M5 provide the best results, MSE = 0.108 and MSE = 0.061 respectively. Finally, the class imbalance problem is minimised with the SMOTE approach for both scales (MSE < 0.006). Conclusions Machine learning methods are able to perform an objective assessment of hyperemia grading, removing both intra- and inter-expert subjectivity while providing a gain in computation time. SMOTE and oversampling approaches minimise the class imbalance problem, while feature selection reduces the number of features from 25 to 3-5 without worsening the MSE. As the differences between the system and a human expert are similar to the differences between experts, we can therefore conclude that the system behaves like an expert.</description><identifier>ISSN: 0933-3657</identifier><identifier>EISSN: 1873-2860</identifier><identifier>DOI: 10.1016/j.artmed.2016.06.004</identifier><identifier>PMID: 27506129</identifier><language>eng</language><publisher>Netherlands: Elsevier B.V</publisher><subject>Assessments ; Automation ; Computation ; Correlation ; Correlation-based feature selection ; Diagnosis, Computer-Assisted ; Evaluation ; Grading ; Humans ; Hyperemia ; Hyperemia grading ; Image processing ; Internal Medicine ; Multi-layer perceptron ; Neural Networks (Computer) ; Other ; Oversampling ; Radial basis function network ; Random forests ; Regression ; Regression Analysis ; Relief ; SMOReg ; SMOTE ; Undersampling</subject><ispartof>Artificial intelligence in medicine, 2016-07, Vol.71, p.30-42</ispartof><rights>Elsevier B.V.</rights><rights>2016 Elsevier B.V.</rights><rights>Copyright © 2016 Elsevier B.V. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c483t-9e55970a96a0d76dd60437051a8280c69eb32789be3f82630c697ceb114f905c3</citedby><cites>FETCH-LOGICAL-c483t-9e55970a96a0d76dd60437051a8280c69eb32789be3f82630c697ceb114f905c3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0933365716300215$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3536,27903,27904,65309</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27506129$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Sanchez Brea, Maria Luisa</creatorcontrib><creatorcontrib>Barreira Rodriguez, Noelia</creatorcontrib><creatorcontrib>Sanchez Marono, Noelia</creatorcontrib><creatorcontrib>Mosquera Gonzalez, Antonio</creatorcontrib><creatorcontrib>Garcia-Resua, Carlos</creatorcontrib><creatorcontrib>Giraldez Fernandez, Maria Jesus</creatorcontrib><title>On the development of conjunctival hyperemia computer-assisted diagnosis tools: influence of feature selection and class imbalance in automatic gradings</title><title>Artificial intelligence in medicine</title><addtitle>Artif Intell Med</addtitle><description>Abstract Objective The sudden increase of blood flow in the bulbar conjunctiva, known as hyperemia, is associated to a red hue of variable intensity. Experts measure hyperemia using levels in a grading scale, a procedure that is subjective, non-repeatable and time consuming, thus creating a need for its automatisation. However, the task is far from straightforward due to data issues such as class imbalance or correlated features. In this paper, we study the specific features of hyperemia and propose various approaches to address these problems in the context of an automatic framework for hyperemia grading. Methodology Oversampling, undersampling and SMOTE approaches were applied in order to tackle the problem of class imbalance. 25 features were computed for each image and regression methods were then used to transform them into a value on the grading scale. The values and relationships among features and experts’ values were analysed, and five feature selection techniques were subsequently studied. Results The lowest mean square error (MSE) for the regression systems trained with individual features is below 0.1 for both scales. Multi-layer perceptron (MLP) obtains the best values, but is less consistent than the random forest (RF) method. When all features are combined, the best results for both scales are achieved with MLP. Correlation based feature selection (CFS) and M5 provide the best results, MSE = 0.108 and MSE = 0.061 respectively. Finally, the class imbalance problem is minimised with the SMOTE approach for both scales (MSE < 0.006). Conclusions Machine learning methods are able to perform an objective assessment of hyperemia grading, removing both intra- and inter-expert subjectivity while providing a gain in computation time. SMOTE and oversampling approaches minimise the class imbalance problem, while feature selection reduces the number of features from 25 to 3-5 without worsening the MSE. As the differences between the system and a human expert are similar to the differences between experts, we can therefore conclude that the system behaves like an expert.</description><subject>Assessments</subject><subject>Automation</subject><subject>Computation</subject><subject>Correlation</subject><subject>Correlation-based feature selection</subject><subject>Diagnosis, Computer-Assisted</subject><subject>Evaluation</subject><subject>Grading</subject><subject>Humans</subject><subject>Hyperemia</subject><subject>Hyperemia grading</subject><subject>Image processing</subject><subject>Internal Medicine</subject><subject>Multi-layer perceptron</subject><subject>Neural Networks (Computer)</subject><subject>Other</subject><subject>Oversampling</subject><subject>Radial basis function network</subject><subject>Random forests</subject><subject>Regression</subject><subject>Regression Analysis</subject><subject>Relief</subject><subject>SMOReg</subject><subject>SMOTE</subject><subject>Undersampling</subject><issn>0933-3657</issn><issn>1873-2860</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqNUk2LFDEQbURxx9F_IJKjlx4rne6k40GQxS9Y2IN6DumkejZjdzIm6YH5J_5c08zuxYsLBaFeXr1K6lVVvaawo0D5u8NOxzyj3TUl20EJaJ9UG9oLVjc9h6fVBiRjNeOduKpepHQAANFS_ry6akQHnDZyU_259STfIbF4wikcZ_SZhJGY4A-LN9md9ETuzkeMODtd4Pm4ZIy1TsmljJZYp_c-lITkEKb0njg_Tgt6g6vMiDovEUnCCYtY8ER7S8xUyombBz3plegKvOQw6-wM2Udtnd-nl9WzUU8JX92f2-rn508_rr_WN7dfvl1_vKlN27NcS-w6KUBLrsEKbi2HlgnoqO6bHgyXOLBG9HJANvYNZyskDA6UtqOEzrBt9faie4zh94Ipq9klg1N5GoYlKdqzjjdSyO4RVNqJMmPJH0OFXoAAVqjthWpiSCniqI7RzTqeFQW1Oq0O6uK0Wp1WUKL8cVu9ue-wDOvdQ9GDtYXw4ULAMr2Tw6iScasx1sVihrLB_a_DvwJmct4ZPf3CM6ZDWKIvziiqUqNAfV-3bV02WqYMDe3YXwc40pA</recordid><startdate>20160701</startdate><enddate>20160701</enddate><creator>Sanchez Brea, Maria Luisa</creator><creator>Barreira Rodriguez, Noelia</creator><creator>Sanchez Marono, Noelia</creator><creator>Mosquera Gonzalez, Antonio</creator><creator>Garcia-Resua, Carlos</creator><creator>Giraldez Fernandez, Maria Jesus</creator><general>Elsevier B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7QO</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>7SC</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20160701</creationdate><title>On the development of conjunctival hyperemia computer-assisted diagnosis tools: influence of feature selection and class imbalance in automatic gradings</title><author>Sanchez Brea, Maria Luisa ; Barreira Rodriguez, Noelia ; Sanchez Marono, Noelia ; Mosquera Gonzalez, Antonio ; Garcia-Resua, Carlos ; Giraldez Fernandez, Maria Jesus</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c483t-9e55970a96a0d76dd60437051a8280c69eb32789be3f82630c697ceb114f905c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Assessments</topic><topic>Automation</topic><topic>Computation</topic><topic>Correlation</topic><topic>Correlation-based feature selection</topic><topic>Diagnosis, Computer-Assisted</topic><topic>Evaluation</topic><topic>Grading</topic><topic>Humans</topic><topic>Hyperemia</topic><topic>Hyperemia grading</topic><topic>Image processing</topic><topic>Internal Medicine</topic><topic>Multi-layer perceptron</topic><topic>Neural Networks (Computer)</topic><topic>Other</topic><topic>Oversampling</topic><topic>Radial basis function network</topic><topic>Random forests</topic><topic>Regression</topic><topic>Regression Analysis</topic><topic>Relief</topic><topic>SMOReg</topic><topic>SMOTE</topic><topic>Undersampling</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sanchez Brea, Maria Luisa</creatorcontrib><creatorcontrib>Barreira Rodriguez, Noelia</creatorcontrib><creatorcontrib>Sanchez Marono, Noelia</creatorcontrib><creatorcontrib>Mosquera Gonzalez, Antonio</creatorcontrib><creatorcontrib>Garcia-Resua, Carlos</creatorcontrib><creatorcontrib>Giraldez Fernandez, Maria Jesus</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Biotechnology Research Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Artificial intelligence in medicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sanchez Brea, Maria Luisa</au><au>Barreira Rodriguez, Noelia</au><au>Sanchez Marono, Noelia</au><au>Mosquera Gonzalez, Antonio</au><au>Garcia-Resua, Carlos</au><au>Giraldez Fernandez, Maria Jesus</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>On the development of conjunctival hyperemia computer-assisted diagnosis tools: influence of feature selection and class imbalance in automatic gradings</atitle><jtitle>Artificial intelligence in medicine</jtitle><addtitle>Artif Intell Med</addtitle><date>2016-07-01</date><risdate>2016</risdate><volume>71</volume><spage>30</spage><epage>42</epage><pages>30-42</pages><issn>0933-3657</issn><eissn>1873-2860</eissn><abstract>Abstract Objective The sudden increase of blood flow in the bulbar conjunctiva, known as hyperemia, is associated to a red hue of variable intensity. Experts measure hyperemia using levels in a grading scale, a procedure that is subjective, non-repeatable and time consuming, thus creating a need for its automatisation. However, the task is far from straightforward due to data issues such as class imbalance or correlated features. In this paper, we study the specific features of hyperemia and propose various approaches to address these problems in the context of an automatic framework for hyperemia grading. Methodology Oversampling, undersampling and SMOTE approaches were applied in order to tackle the problem of class imbalance. 25 features were computed for each image and regression methods were then used to transform them into a value on the grading scale. The values and relationships among features and experts’ values were analysed, and five feature selection techniques were subsequently studied. Results The lowest mean square error (MSE) for the regression systems trained with individual features is below 0.1 for both scales. Multi-layer perceptron (MLP) obtains the best values, but is less consistent than the random forest (RF) method. When all features are combined, the best results for both scales are achieved with MLP. Correlation based feature selection (CFS) and M5 provide the best results, MSE = 0.108 and MSE = 0.061 respectively. Finally, the class imbalance problem is minimised with the SMOTE approach for both scales (MSE < 0.006). Conclusions Machine learning methods are able to perform an objective assessment of hyperemia grading, removing both intra- and inter-expert subjectivity while providing a gain in computation time. SMOTE and oversampling approaches minimise the class imbalance problem, while feature selection reduces the number of features from 25 to 3-5 without worsening the MSE. As the differences between the system and a human expert are similar to the differences between experts, we can therefore conclude that the system behaves like an expert.</abstract><cop>Netherlands</cop><pub>Elsevier B.V</pub><pmid>27506129</pmid><doi>10.1016/j.artmed.2016.06.004</doi><tpages>13</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0933-3657 |
ispartof | Artificial intelligence in medicine, 2016-07, Vol.71, p.30-42 |
issn | 0933-3657 1873-2860 |
language | eng |
recordid | cdi_proquest_miscellaneous_1835629795 |
source | MEDLINE; Elsevier ScienceDirect Journals Complete |
subjects | Assessments Automation Computation Correlation Correlation-based feature selection Diagnosis, Computer-Assisted Evaluation Grading Humans Hyperemia Hyperemia grading Image processing Internal Medicine Multi-layer perceptron Neural Networks (Computer) Other Oversampling Radial basis function network Random forests Regression Regression Analysis Relief SMOReg SMOTE Undersampling |
title | On the development of conjunctival hyperemia computer-assisted diagnosis tools: influence of feature selection and class imbalance in automatic gradings |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T17%3A01%3A28IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=On%20the%20development%20of%20conjunctival%20hyperemia%20computer-assisted%20diagnosis%20tools:%20influence%20of%20feature%20selection%20and%20class%20imbalance%20in%20automatic%20gradings&rft.jtitle=Artificial%20intelligence%20in%20medicine&rft.au=Sanchez%20Brea,%20Maria%20Luisa&rft.date=2016-07-01&rft.volume=71&rft.spage=30&rft.epage=42&rft.pages=30-42&rft.issn=0933-3657&rft.eissn=1873-2860&rft_id=info:doi/10.1016/j.artmed.2016.06.004&rft_dat=%3Cproquest_cross%3E1835629795%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1810870703&rft_id=info:pmid/27506129&rft_els_id=1_s2_0_S0933365716300215&rfr_iscdi=true |