On the development of conjunctival hyperemia computer-assisted diagnosis tools: influence of feature selection and class imbalance in automatic gradings

Abstract Objective The sudden increase of blood flow in the bulbar conjunctiva, known as hyperemia, is associated to a red hue of variable intensity. Experts measure hyperemia using levels in a grading scale, a procedure that is subjective, non-repeatable and time consuming, thus creating a need for...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Artificial intelligence in medicine 2016-07, Vol.71, p.30-42
Hauptverfasser: Sanchez Brea, Maria Luisa, Barreira Rodriguez, Noelia, Sanchez Marono, Noelia, Mosquera Gonzalez, Antonio, Garcia-Resua, Carlos, Giraldez Fernandez, Maria Jesus
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 42
container_issue
container_start_page 30
container_title Artificial intelligence in medicine
container_volume 71
creator Sanchez Brea, Maria Luisa
Barreira Rodriguez, Noelia
Sanchez Marono, Noelia
Mosquera Gonzalez, Antonio
Garcia-Resua, Carlos
Giraldez Fernandez, Maria Jesus
description Abstract Objective The sudden increase of blood flow in the bulbar conjunctiva, known as hyperemia, is associated to a red hue of variable intensity. Experts measure hyperemia using levels in a grading scale, a procedure that is subjective, non-repeatable and time consuming, thus creating a need for its automatisation. However, the task is far from straightforward due to data issues such as class imbalance or correlated features. In this paper, we study the specific features of hyperemia and propose various approaches to address these problems in the context of an automatic framework for hyperemia grading. Methodology Oversampling, undersampling and SMOTE approaches were applied in order to tackle the problem of class imbalance. 25 features were computed for each image and regression methods were then used to transform them into a value on the grading scale. The values and relationships among features and experts’ values were analysed, and five feature selection techniques were subsequently studied. Results The lowest mean square error (MSE) for the regression systems trained with individual features is below 0.1 for both scales. Multi-layer perceptron (MLP) obtains the best values, but is less consistent than the random forest (RF) method. When all features are combined, the best results for both scales are achieved with MLP. Correlation based feature selection (CFS) and M5 provide the best results, MSE = 0.108 and MSE = 0.061 respectively. Finally, the class imbalance problem is minimised with the SMOTE approach for both scales (MSE < 0.006). Conclusions Machine learning methods are able to perform an objective assessment of hyperemia grading, removing both intra- and inter-expert subjectivity while providing a gain in computation time. SMOTE and oversampling approaches minimise the class imbalance problem, while feature selection reduces the number of features from 25 to 3-5 without worsening the MSE. As the differences between the system and a human expert are similar to the differences between experts, we can therefore conclude that the system behaves like an expert.
doi_str_mv 10.1016/j.artmed.2016.06.004
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1835629795</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>1_s2_0_S0933365716300215</els_id><sourcerecordid>1835629795</sourcerecordid><originalsourceid>FETCH-LOGICAL-c483t-9e55970a96a0d76dd60437051a8280c69eb32789be3f82630c697ceb114f905c3</originalsourceid><addsrcrecordid>eNqNUk2LFDEQbURxx9F_IJKjlx4rne6k40GQxS9Y2IN6DumkejZjdzIm6YH5J_5c08zuxYsLBaFeXr1K6lVVvaawo0D5u8NOxzyj3TUl20EJaJ9UG9oLVjc9h6fVBiRjNeOduKpepHQAANFS_ry6akQHnDZyU_259STfIbF4wikcZ_SZhJGY4A-LN9md9ETuzkeMODtd4Pm4ZIy1TsmljJZYp_c-lITkEKb0njg_Tgt6g6vMiDovEUnCCYtY8ER7S8xUyombBz3plegKvOQw6-wM2Udtnd-nl9WzUU8JX92f2-rn508_rr_WN7dfvl1_vKlN27NcS-w6KUBLrsEKbi2HlgnoqO6bHgyXOLBG9HJANvYNZyskDA6UtqOEzrBt9faie4zh94Ipq9klg1N5GoYlKdqzjjdSyO4RVNqJMmPJH0OFXoAAVqjthWpiSCniqI7RzTqeFQW1Oq0O6uK0Wp1WUKL8cVu9ue-wDOvdQ9GDtYXw4ULAMr2Tw6iScasx1sVihrLB_a_DvwJmct4ZPf3CM6ZDWKIvziiqUqNAfV-3bV02WqYMDe3YXwc40pA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1810870703</pqid></control><display><type>article</type><title>On the development of conjunctival hyperemia computer-assisted diagnosis tools: influence of feature selection and class imbalance in automatic gradings</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals Complete</source><creator>Sanchez Brea, Maria Luisa ; Barreira Rodriguez, Noelia ; Sanchez Marono, Noelia ; Mosquera Gonzalez, Antonio ; Garcia-Resua, Carlos ; Giraldez Fernandez, Maria Jesus</creator><creatorcontrib>Sanchez Brea, Maria Luisa ; Barreira Rodriguez, Noelia ; Sanchez Marono, Noelia ; Mosquera Gonzalez, Antonio ; Garcia-Resua, Carlos ; Giraldez Fernandez, Maria Jesus</creatorcontrib><description>Abstract Objective The sudden increase of blood flow in the bulbar conjunctiva, known as hyperemia, is associated to a red hue of variable intensity. Experts measure hyperemia using levels in a grading scale, a procedure that is subjective, non-repeatable and time consuming, thus creating a need for its automatisation. However, the task is far from straightforward due to data issues such as class imbalance or correlated features. In this paper, we study the specific features of hyperemia and propose various approaches to address these problems in the context of an automatic framework for hyperemia grading. Methodology Oversampling, undersampling and SMOTE approaches were applied in order to tackle the problem of class imbalance. 25 features were computed for each image and regression methods were then used to transform them into a value on the grading scale. The values and relationships among features and experts’ values were analysed, and five feature selection techniques were subsequently studied. Results The lowest mean square error (MSE) for the regression systems trained with individual features is below 0.1 for both scales. Multi-layer perceptron (MLP) obtains the best values, but is less consistent than the random forest (RF) method. When all features are combined, the best results for both scales are achieved with MLP. Correlation based feature selection (CFS) and M5 provide the best results, MSE = 0.108 and MSE = 0.061 respectively. Finally, the class imbalance problem is minimised with the SMOTE approach for both scales (MSE &lt; 0.006). Conclusions Machine learning methods are able to perform an objective assessment of hyperemia grading, removing both intra- and inter-expert subjectivity while providing a gain in computation time. SMOTE and oversampling approaches minimise the class imbalance problem, while feature selection reduces the number of features from 25 to 3-5 without worsening the MSE. As the differences between the system and a human expert are similar to the differences between experts, we can therefore conclude that the system behaves like an expert.</description><identifier>ISSN: 0933-3657</identifier><identifier>EISSN: 1873-2860</identifier><identifier>DOI: 10.1016/j.artmed.2016.06.004</identifier><identifier>PMID: 27506129</identifier><language>eng</language><publisher>Netherlands: Elsevier B.V</publisher><subject>Assessments ; Automation ; Computation ; Correlation ; Correlation-based feature selection ; Diagnosis, Computer-Assisted ; Evaluation ; Grading ; Humans ; Hyperemia ; Hyperemia grading ; Image processing ; Internal Medicine ; Multi-layer perceptron ; Neural Networks (Computer) ; Other ; Oversampling ; Radial basis function network ; Random forests ; Regression ; Regression Analysis ; Relief ; SMOReg ; SMOTE ; Undersampling</subject><ispartof>Artificial intelligence in medicine, 2016-07, Vol.71, p.30-42</ispartof><rights>Elsevier B.V.</rights><rights>2016 Elsevier B.V.</rights><rights>Copyright © 2016 Elsevier B.V. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c483t-9e55970a96a0d76dd60437051a8280c69eb32789be3f82630c697ceb114f905c3</citedby><cites>FETCH-LOGICAL-c483t-9e55970a96a0d76dd60437051a8280c69eb32789be3f82630c697ceb114f905c3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0933365716300215$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3536,27903,27904,65309</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27506129$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Sanchez Brea, Maria Luisa</creatorcontrib><creatorcontrib>Barreira Rodriguez, Noelia</creatorcontrib><creatorcontrib>Sanchez Marono, Noelia</creatorcontrib><creatorcontrib>Mosquera Gonzalez, Antonio</creatorcontrib><creatorcontrib>Garcia-Resua, Carlos</creatorcontrib><creatorcontrib>Giraldez Fernandez, Maria Jesus</creatorcontrib><title>On the development of conjunctival hyperemia computer-assisted diagnosis tools: influence of feature selection and class imbalance in automatic gradings</title><title>Artificial intelligence in medicine</title><addtitle>Artif Intell Med</addtitle><description>Abstract Objective The sudden increase of blood flow in the bulbar conjunctiva, known as hyperemia, is associated to a red hue of variable intensity. Experts measure hyperemia using levels in a grading scale, a procedure that is subjective, non-repeatable and time consuming, thus creating a need for its automatisation. However, the task is far from straightforward due to data issues such as class imbalance or correlated features. In this paper, we study the specific features of hyperemia and propose various approaches to address these problems in the context of an automatic framework for hyperemia grading. Methodology Oversampling, undersampling and SMOTE approaches were applied in order to tackle the problem of class imbalance. 25 features were computed for each image and regression methods were then used to transform them into a value on the grading scale. The values and relationships among features and experts’ values were analysed, and five feature selection techniques were subsequently studied. Results The lowest mean square error (MSE) for the regression systems trained with individual features is below 0.1 for both scales. Multi-layer perceptron (MLP) obtains the best values, but is less consistent than the random forest (RF) method. When all features are combined, the best results for both scales are achieved with MLP. Correlation based feature selection (CFS) and M5 provide the best results, MSE = 0.108 and MSE = 0.061 respectively. Finally, the class imbalance problem is minimised with the SMOTE approach for both scales (MSE &lt; 0.006). Conclusions Machine learning methods are able to perform an objective assessment of hyperemia grading, removing both intra- and inter-expert subjectivity while providing a gain in computation time. SMOTE and oversampling approaches minimise the class imbalance problem, while feature selection reduces the number of features from 25 to 3-5 without worsening the MSE. As the differences between the system and a human expert are similar to the differences between experts, we can therefore conclude that the system behaves like an expert.</description><subject>Assessments</subject><subject>Automation</subject><subject>Computation</subject><subject>Correlation</subject><subject>Correlation-based feature selection</subject><subject>Diagnosis, Computer-Assisted</subject><subject>Evaluation</subject><subject>Grading</subject><subject>Humans</subject><subject>Hyperemia</subject><subject>Hyperemia grading</subject><subject>Image processing</subject><subject>Internal Medicine</subject><subject>Multi-layer perceptron</subject><subject>Neural Networks (Computer)</subject><subject>Other</subject><subject>Oversampling</subject><subject>Radial basis function network</subject><subject>Random forests</subject><subject>Regression</subject><subject>Regression Analysis</subject><subject>Relief</subject><subject>SMOReg</subject><subject>SMOTE</subject><subject>Undersampling</subject><issn>0933-3657</issn><issn>1873-2860</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqNUk2LFDEQbURxx9F_IJKjlx4rne6k40GQxS9Y2IN6DumkejZjdzIm6YH5J_5c08zuxYsLBaFeXr1K6lVVvaawo0D5u8NOxzyj3TUl20EJaJ9UG9oLVjc9h6fVBiRjNeOduKpepHQAANFS_ry6akQHnDZyU_259STfIbF4wikcZ_SZhJGY4A-LN9md9ETuzkeMODtd4Pm4ZIy1TsmljJZYp_c-lITkEKb0njg_Tgt6g6vMiDovEUnCCYtY8ER7S8xUyombBz3plegKvOQw6-wM2Udtnd-nl9WzUU8JX92f2-rn508_rr_WN7dfvl1_vKlN27NcS-w6KUBLrsEKbi2HlgnoqO6bHgyXOLBG9HJANvYNZyskDA6UtqOEzrBt9faie4zh94Ipq9klg1N5GoYlKdqzjjdSyO4RVNqJMmPJH0OFXoAAVqjthWpiSCniqI7RzTqeFQW1Oq0O6uK0Wp1WUKL8cVu9ue-wDOvdQ9GDtYXw4ULAMr2Tw6iScasx1sVihrLB_a_DvwJmct4ZPf3CM6ZDWKIvziiqUqNAfV-3bV02WqYMDe3YXwc40pA</recordid><startdate>20160701</startdate><enddate>20160701</enddate><creator>Sanchez Brea, Maria Luisa</creator><creator>Barreira Rodriguez, Noelia</creator><creator>Sanchez Marono, Noelia</creator><creator>Mosquera Gonzalez, Antonio</creator><creator>Garcia-Resua, Carlos</creator><creator>Giraldez Fernandez, Maria Jesus</creator><general>Elsevier B.V</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7QO</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>7SC</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20160701</creationdate><title>On the development of conjunctival hyperemia computer-assisted diagnosis tools: influence of feature selection and class imbalance in automatic gradings</title><author>Sanchez Brea, Maria Luisa ; Barreira Rodriguez, Noelia ; Sanchez Marono, Noelia ; Mosquera Gonzalez, Antonio ; Garcia-Resua, Carlos ; Giraldez Fernandez, Maria Jesus</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c483t-9e55970a96a0d76dd60437051a8280c69eb32789be3f82630c697ceb114f905c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Assessments</topic><topic>Automation</topic><topic>Computation</topic><topic>Correlation</topic><topic>Correlation-based feature selection</topic><topic>Diagnosis, Computer-Assisted</topic><topic>Evaluation</topic><topic>Grading</topic><topic>Humans</topic><topic>Hyperemia</topic><topic>Hyperemia grading</topic><topic>Image processing</topic><topic>Internal Medicine</topic><topic>Multi-layer perceptron</topic><topic>Neural Networks (Computer)</topic><topic>Other</topic><topic>Oversampling</topic><topic>Radial basis function network</topic><topic>Random forests</topic><topic>Regression</topic><topic>Regression Analysis</topic><topic>Relief</topic><topic>SMOReg</topic><topic>SMOTE</topic><topic>Undersampling</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sanchez Brea, Maria Luisa</creatorcontrib><creatorcontrib>Barreira Rodriguez, Noelia</creatorcontrib><creatorcontrib>Sanchez Marono, Noelia</creatorcontrib><creatorcontrib>Mosquera Gonzalez, Antonio</creatorcontrib><creatorcontrib>Garcia-Resua, Carlos</creatorcontrib><creatorcontrib>Giraldez Fernandez, Maria Jesus</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Biotechnology Research Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Artificial intelligence in medicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sanchez Brea, Maria Luisa</au><au>Barreira Rodriguez, Noelia</au><au>Sanchez Marono, Noelia</au><au>Mosquera Gonzalez, Antonio</au><au>Garcia-Resua, Carlos</au><au>Giraldez Fernandez, Maria Jesus</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>On the development of conjunctival hyperemia computer-assisted diagnosis tools: influence of feature selection and class imbalance in automatic gradings</atitle><jtitle>Artificial intelligence in medicine</jtitle><addtitle>Artif Intell Med</addtitle><date>2016-07-01</date><risdate>2016</risdate><volume>71</volume><spage>30</spage><epage>42</epage><pages>30-42</pages><issn>0933-3657</issn><eissn>1873-2860</eissn><abstract>Abstract Objective The sudden increase of blood flow in the bulbar conjunctiva, known as hyperemia, is associated to a red hue of variable intensity. Experts measure hyperemia using levels in a grading scale, a procedure that is subjective, non-repeatable and time consuming, thus creating a need for its automatisation. However, the task is far from straightforward due to data issues such as class imbalance or correlated features. In this paper, we study the specific features of hyperemia and propose various approaches to address these problems in the context of an automatic framework for hyperemia grading. Methodology Oversampling, undersampling and SMOTE approaches were applied in order to tackle the problem of class imbalance. 25 features were computed for each image and regression methods were then used to transform them into a value on the grading scale. The values and relationships among features and experts’ values were analysed, and five feature selection techniques were subsequently studied. Results The lowest mean square error (MSE) for the regression systems trained with individual features is below 0.1 for both scales. Multi-layer perceptron (MLP) obtains the best values, but is less consistent than the random forest (RF) method. When all features are combined, the best results for both scales are achieved with MLP. Correlation based feature selection (CFS) and M5 provide the best results, MSE = 0.108 and MSE = 0.061 respectively. Finally, the class imbalance problem is minimised with the SMOTE approach for both scales (MSE &lt; 0.006). Conclusions Machine learning methods are able to perform an objective assessment of hyperemia grading, removing both intra- and inter-expert subjectivity while providing a gain in computation time. SMOTE and oversampling approaches minimise the class imbalance problem, while feature selection reduces the number of features from 25 to 3-5 without worsening the MSE. As the differences between the system and a human expert are similar to the differences between experts, we can therefore conclude that the system behaves like an expert.</abstract><cop>Netherlands</cop><pub>Elsevier B.V</pub><pmid>27506129</pmid><doi>10.1016/j.artmed.2016.06.004</doi><tpages>13</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0933-3657
ispartof Artificial intelligence in medicine, 2016-07, Vol.71, p.30-42
issn 0933-3657
1873-2860
language eng
recordid cdi_proquest_miscellaneous_1835629795
source MEDLINE; Elsevier ScienceDirect Journals Complete
subjects Assessments
Automation
Computation
Correlation
Correlation-based feature selection
Diagnosis, Computer-Assisted
Evaluation
Grading
Humans
Hyperemia
Hyperemia grading
Image processing
Internal Medicine
Multi-layer perceptron
Neural Networks (Computer)
Other
Oversampling
Radial basis function network
Random forests
Regression
Regression Analysis
Relief
SMOReg
SMOTE
Undersampling
title On the development of conjunctival hyperemia computer-assisted diagnosis tools: influence of feature selection and class imbalance in automatic gradings
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T17%3A01%3A28IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=On%20the%20development%20of%20conjunctival%20hyperemia%20computer-assisted%20diagnosis%20tools:%20influence%20of%20feature%20selection%20and%20class%20imbalance%20in%20automatic%20gradings&rft.jtitle=Artificial%20intelligence%20in%20medicine&rft.au=Sanchez%20Brea,%20Maria%20Luisa&rft.date=2016-07-01&rft.volume=71&rft.spage=30&rft.epage=42&rft.pages=30-42&rft.issn=0933-3657&rft.eissn=1873-2860&rft_id=info:doi/10.1016/j.artmed.2016.06.004&rft_dat=%3Cproquest_cross%3E1835629795%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1810870703&rft_id=info:pmid/27506129&rft_els_id=1_s2_0_S0933365716300215&rfr_iscdi=true