Cross-validation and robustness of daylight glare metrics
This study evaluates the performance and robustness of 22 established and newly proposed glare prediction metrics. Experimental datasets of daylight-dominated workplaces in office-like test rooms were collected from studies by seven research groups in six different locations (Argentina, Denmark, Ger...
Gespeichert in:
Veröffentlicht in: | Lighting research & technology (London, England : 2001) England : 2001), 2019-11, Vol.51 (7), p.983-1013 |
---|---|
Hauptverfasser: | , , , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1013 |
---|---|
container_issue | 7 |
container_start_page | 983 |
container_title | Lighting research & technology (London, England : 2001) |
container_volume | 51 |
creator | Wienold, J Iwata, T Sarey Khanie, M Erell, E Kaftan, E Rodriguez, RG Yamin Garreton, JA Tzempelikos, T Konstantzos, I Christoffersen, J Kuhn, TE Pierson, C Andersen, M |
description | This study evaluates the performance and robustness of 22 established and newly proposed glare prediction metrics. Experimental datasets of daylight-dominated workplaces in office-like test rooms were collected from studies by seven research groups in six different locations (Argentina, Denmark, Germany, Israel, Japan and the United States). The variability in experimental setups, locations and research teams allowed reliable evaluation of the performance and robustness of glare metrics for daylight-dominated workplaces. Independent statistical methods were applied to individual datasets and also to one combined dataset to evaluate the performance and robustness of the 22 glare metrics. As performance and robustness are not established in literature, we defined performance as: (1) the ability of the metric value to describe the glare scale (evaluated by Spearman rank correlation), and (2) the ability of the metric to distinguish between disturbing and non-disturbing situations (evaluated by diagnostic receiver operating characteristic curve analysis tests). Furthermore, we defined robustness as the ability of a metric to deliver meaningful results when applied to different datasets and to fail as few as possible statistical tests. Average Spearman rank correlation coefficients in the range of 0.55–0.60 as well as average prediction rates to distinguish between disturbing and non-disturbing glare of 70–75% for several of the metrics indicate their reliability. The results also show that metrics considering the saturation effect as a main input in their equation perform better and are more robust in daylight-dominated workplaces than purely contrast-based metrics or purely empirical metrics. In this study, the daylight glare probability (DGP) delivered the highest performance amongst the tested metrics and was also found to be the most robust. Future research should aim to optimise the terms of glare equations which combine contrast and saturation effects, such as DGP, PGSV or UGRexp, to achieve metrics that also perform reliably in dimmer lighting conditions than the ones explored in this study. |
doi_str_mv | 10.1177/1477153519826003 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2307362730</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sage_id>10.1177_1477153519826003</sage_id><sourcerecordid>2307362730</sourcerecordid><originalsourceid>FETCH-LOGICAL-c465t-7ddba1b681a86ecf2193f88368cd4a6dda675a179cf9ee72f34ced705bd3fd873</originalsourceid><addsrcrecordid>eNp1kM1Lw0AQxRdRsFbvHgOeo7uZZD-OEtQKBS96Xib7EVPSpO4mQv97E1IQBE8zzPzeG-YRcsvoPWNCPLBcCFZAwZTMOKVwRlbzKKUK5Pmpn_eX5CrGHaVMUoAVUWXoY0y_sW0sDk3fJdjZJPTVGIfOxZj0PrF4bJv6c0jqFoNL9m4IjYnX5MJjG93Nqa7Jx_PTe7lJt28vr-XjNjU5L4ZUWFshq7hkKLkzPmMKvJTApbE5cmuRiwKZUMYr50TmITfOClpUFryVAtbkbvE9hP5rdHHQu34M3XRSZ0AF8EwAnSi6UGb-JzivD6HZYzhqRvUckP4b0CRJF0nE2v2a_sv_AC_BZTo</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2307362730</pqid></control><display><type>article</type><title>Cross-validation and robustness of daylight glare metrics</title><source>Access via SAGE</source><source>Alma/SFX Local Collection</source><creator>Wienold, J ; Iwata, T ; Sarey Khanie, M ; Erell, E ; Kaftan, E ; Rodriguez, RG ; Yamin Garreton, JA ; Tzempelikos, T ; Konstantzos, I ; Christoffersen, J ; Kuhn, TE ; Pierson, C ; Andersen, M</creator><creatorcontrib>Wienold, J ; Iwata, T ; Sarey Khanie, M ; Erell, E ; Kaftan, E ; Rodriguez, RG ; Yamin Garreton, JA ; Tzempelikos, T ; Konstantzos, I ; Christoffersen, J ; Kuhn, TE ; Pierson, C ; Andersen, M</creatorcontrib><description>This study evaluates the performance and robustness of 22 established and newly proposed glare prediction metrics. Experimental datasets of daylight-dominated workplaces in office-like test rooms were collected from studies by seven research groups in six different locations (Argentina, Denmark, Germany, Israel, Japan and the United States). The variability in experimental setups, locations and research teams allowed reliable evaluation of the performance and robustness of glare metrics for daylight-dominated workplaces. Independent statistical methods were applied to individual datasets and also to one combined dataset to evaluate the performance and robustness of the 22 glare metrics. As performance and robustness are not established in literature, we defined performance as: (1) the ability of the metric value to describe the glare scale (evaluated by Spearman rank correlation), and (2) the ability of the metric to distinguish between disturbing and non-disturbing situations (evaluated by diagnostic receiver operating characteristic curve analysis tests). Furthermore, we defined robustness as the ability of a metric to deliver meaningful results when applied to different datasets and to fail as few as possible statistical tests. Average Spearman rank correlation coefficients in the range of 0.55–0.60 as well as average prediction rates to distinguish between disturbing and non-disturbing glare of 70–75% for several of the metrics indicate their reliability. The results also show that metrics considering the saturation effect as a main input in their equation perform better and are more robust in daylight-dominated workplaces than purely contrast-based metrics or purely empirical metrics. In this study, the daylight glare probability (DGP) delivered the highest performance amongst the tested metrics and was also found to be the most robust. Future research should aim to optimise the terms of glare equations which combine contrast and saturation effects, such as DGP, PGSV or UGRexp, to achieve metrics that also perform reliably in dimmer lighting conditions than the ones explored in this study.</description><identifier>ISSN: 1477-1535</identifier><identifier>EISSN: 1477-0938</identifier><identifier>DOI: 10.1177/1477153519826003</identifier><language>eng</language><publisher>London, England: SAGE Publications</publisher><subject>Correlation coefficients ; Datasets ; Daylight ; Diagnostic systems ; Empirical analysis ; Glare ; Performance evaluation ; Robustness ; Saturation ; Statistical analysis ; Statistical methods ; Statistical tests ; Workplaces</subject><ispartof>Lighting research & technology (London, England : 2001), 2019-11, Vol.51 (7), p.983-1013</ispartof><rights>The Chartered Institution of Building Services Engineers 2019</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c465t-7ddba1b681a86ecf2193f88368cd4a6dda675a179cf9ee72f34ced705bd3fd873</citedby><cites>FETCH-LOGICAL-c465t-7ddba1b681a86ecf2193f88368cd4a6dda675a179cf9ee72f34ced705bd3fd873</cites><orcidid>0000-0003-0855-3745 ; 0000-0001-7847-6568 ; 0000-0002-3723-0323</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://journals.sagepub.com/doi/pdf/10.1177/1477153519826003$$EPDF$$P50$$Gsage$$H</linktopdf><linktohtml>$$Uhttps://journals.sagepub.com/doi/10.1177/1477153519826003$$EHTML$$P50$$Gsage$$H</linktohtml><link.rule.ids>315,781,785,21824,27929,27930,43626,43627</link.rule.ids></links><search><creatorcontrib>Wienold, J</creatorcontrib><creatorcontrib>Iwata, T</creatorcontrib><creatorcontrib>Sarey Khanie, M</creatorcontrib><creatorcontrib>Erell, E</creatorcontrib><creatorcontrib>Kaftan, E</creatorcontrib><creatorcontrib>Rodriguez, RG</creatorcontrib><creatorcontrib>Yamin Garreton, JA</creatorcontrib><creatorcontrib>Tzempelikos, T</creatorcontrib><creatorcontrib>Konstantzos, I</creatorcontrib><creatorcontrib>Christoffersen, J</creatorcontrib><creatorcontrib>Kuhn, TE</creatorcontrib><creatorcontrib>Pierson, C</creatorcontrib><creatorcontrib>Andersen, M</creatorcontrib><title>Cross-validation and robustness of daylight glare metrics</title><title>Lighting research & technology (London, England : 2001)</title><description>This study evaluates the performance and robustness of 22 established and newly proposed glare prediction metrics. Experimental datasets of daylight-dominated workplaces in office-like test rooms were collected from studies by seven research groups in six different locations (Argentina, Denmark, Germany, Israel, Japan and the United States). The variability in experimental setups, locations and research teams allowed reliable evaluation of the performance and robustness of glare metrics for daylight-dominated workplaces. Independent statistical methods were applied to individual datasets and also to one combined dataset to evaluate the performance and robustness of the 22 glare metrics. As performance and robustness are not established in literature, we defined performance as: (1) the ability of the metric value to describe the glare scale (evaluated by Spearman rank correlation), and (2) the ability of the metric to distinguish between disturbing and non-disturbing situations (evaluated by diagnostic receiver operating characteristic curve analysis tests). Furthermore, we defined robustness as the ability of a metric to deliver meaningful results when applied to different datasets and to fail as few as possible statistical tests. Average Spearman rank correlation coefficients in the range of 0.55–0.60 as well as average prediction rates to distinguish between disturbing and non-disturbing glare of 70–75% for several of the metrics indicate their reliability. The results also show that metrics considering the saturation effect as a main input in their equation perform better and are more robust in daylight-dominated workplaces than purely contrast-based metrics or purely empirical metrics. In this study, the daylight glare probability (DGP) delivered the highest performance amongst the tested metrics and was also found to be the most robust. Future research should aim to optimise the terms of glare equations which combine contrast and saturation effects, such as DGP, PGSV or UGRexp, to achieve metrics that also perform reliably in dimmer lighting conditions than the ones explored in this study.</description><subject>Correlation coefficients</subject><subject>Datasets</subject><subject>Daylight</subject><subject>Diagnostic systems</subject><subject>Empirical analysis</subject><subject>Glare</subject><subject>Performance evaluation</subject><subject>Robustness</subject><subject>Saturation</subject><subject>Statistical analysis</subject><subject>Statistical methods</subject><subject>Statistical tests</subject><subject>Workplaces</subject><issn>1477-1535</issn><issn>1477-0938</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp1kM1Lw0AQxRdRsFbvHgOeo7uZZD-OEtQKBS96Xib7EVPSpO4mQv97E1IQBE8zzPzeG-YRcsvoPWNCPLBcCFZAwZTMOKVwRlbzKKUK5Pmpn_eX5CrGHaVMUoAVUWXoY0y_sW0sDk3fJdjZJPTVGIfOxZj0PrF4bJv6c0jqFoNL9m4IjYnX5MJjG93Nqa7Jx_PTe7lJt28vr-XjNjU5L4ZUWFshq7hkKLkzPmMKvJTApbE5cmuRiwKZUMYr50TmITfOClpUFryVAtbkbvE9hP5rdHHQu34M3XRSZ0AF8EwAnSi6UGb-JzivD6HZYzhqRvUckP4b0CRJF0nE2v2a_sv_AC_BZTo</recordid><startdate>20191101</startdate><enddate>20191101</enddate><creator>Wienold, J</creator><creator>Iwata, T</creator><creator>Sarey Khanie, M</creator><creator>Erell, E</creator><creator>Kaftan, E</creator><creator>Rodriguez, RG</creator><creator>Yamin Garreton, JA</creator><creator>Tzempelikos, T</creator><creator>Konstantzos, I</creator><creator>Christoffersen, J</creator><creator>Kuhn, TE</creator><creator>Pierson, C</creator><creator>Andersen, M</creator><general>SAGE Publications</general><general>Sage Publications Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>8FD</scope><scope>FR3</scope><scope>KR7</scope><scope>L7M</scope><orcidid>https://orcid.org/0000-0003-0855-3745</orcidid><orcidid>https://orcid.org/0000-0001-7847-6568</orcidid><orcidid>https://orcid.org/0000-0002-3723-0323</orcidid></search><sort><creationdate>20191101</creationdate><title>Cross-validation and robustness of daylight glare metrics</title><author>Wienold, J ; Iwata, T ; Sarey Khanie, M ; Erell, E ; Kaftan, E ; Rodriguez, RG ; Yamin Garreton, JA ; Tzempelikos, T ; Konstantzos, I ; Christoffersen, J ; Kuhn, TE ; Pierson, C ; Andersen, M</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c465t-7ddba1b681a86ecf2193f88368cd4a6dda675a179cf9ee72f34ced705bd3fd873</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Correlation coefficients</topic><topic>Datasets</topic><topic>Daylight</topic><topic>Diagnostic systems</topic><topic>Empirical analysis</topic><topic>Glare</topic><topic>Performance evaluation</topic><topic>Robustness</topic><topic>Saturation</topic><topic>Statistical analysis</topic><topic>Statistical methods</topic><topic>Statistical tests</topic><topic>Workplaces</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wienold, J</creatorcontrib><creatorcontrib>Iwata, T</creatorcontrib><creatorcontrib>Sarey Khanie, M</creatorcontrib><creatorcontrib>Erell, E</creatorcontrib><creatorcontrib>Kaftan, E</creatorcontrib><creatorcontrib>Rodriguez, RG</creatorcontrib><creatorcontrib>Yamin Garreton, JA</creatorcontrib><creatorcontrib>Tzempelikos, T</creatorcontrib><creatorcontrib>Konstantzos, I</creatorcontrib><creatorcontrib>Christoffersen, J</creatorcontrib><creatorcontrib>Kuhn, TE</creatorcontrib><creatorcontrib>Pierson, C</creatorcontrib><creatorcontrib>Andersen, M</creatorcontrib><collection>CrossRef</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>Lighting research & technology (London, England : 2001)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wienold, J</au><au>Iwata, T</au><au>Sarey Khanie, M</au><au>Erell, E</au><au>Kaftan, E</au><au>Rodriguez, RG</au><au>Yamin Garreton, JA</au><au>Tzempelikos, T</au><au>Konstantzos, I</au><au>Christoffersen, J</au><au>Kuhn, TE</au><au>Pierson, C</au><au>Andersen, M</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Cross-validation and robustness of daylight glare metrics</atitle><jtitle>Lighting research & technology (London, England : 2001)</jtitle><date>2019-11-01</date><risdate>2019</risdate><volume>51</volume><issue>7</issue><spage>983</spage><epage>1013</epage><pages>983-1013</pages><issn>1477-1535</issn><eissn>1477-0938</eissn><abstract>This study evaluates the performance and robustness of 22 established and newly proposed glare prediction metrics. Experimental datasets of daylight-dominated workplaces in office-like test rooms were collected from studies by seven research groups in six different locations (Argentina, Denmark, Germany, Israel, Japan and the United States). The variability in experimental setups, locations and research teams allowed reliable evaluation of the performance and robustness of glare metrics for daylight-dominated workplaces. Independent statistical methods were applied to individual datasets and also to one combined dataset to evaluate the performance and robustness of the 22 glare metrics. As performance and robustness are not established in literature, we defined performance as: (1) the ability of the metric value to describe the glare scale (evaluated by Spearman rank correlation), and (2) the ability of the metric to distinguish between disturbing and non-disturbing situations (evaluated by diagnostic receiver operating characteristic curve analysis tests). Furthermore, we defined robustness as the ability of a metric to deliver meaningful results when applied to different datasets and to fail as few as possible statistical tests. Average Spearman rank correlation coefficients in the range of 0.55–0.60 as well as average prediction rates to distinguish between disturbing and non-disturbing glare of 70–75% for several of the metrics indicate their reliability. The results also show that metrics considering the saturation effect as a main input in their equation perform better and are more robust in daylight-dominated workplaces than purely contrast-based metrics or purely empirical metrics. In this study, the daylight glare probability (DGP) delivered the highest performance amongst the tested metrics and was also found to be the most robust. Future research should aim to optimise the terms of glare equations which combine contrast and saturation effects, such as DGP, PGSV or UGRexp, to achieve metrics that also perform reliably in dimmer lighting conditions than the ones explored in this study.</abstract><cop>London, England</cop><pub>SAGE Publications</pub><doi>10.1177/1477153519826003</doi><tpages>31</tpages><orcidid>https://orcid.org/0000-0003-0855-3745</orcidid><orcidid>https://orcid.org/0000-0001-7847-6568</orcidid><orcidid>https://orcid.org/0000-0002-3723-0323</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1477-1535 |
ispartof | Lighting research & technology (London, England : 2001), 2019-11, Vol.51 (7), p.983-1013 |
issn | 1477-1535 1477-0938 |
language | eng |
recordid | cdi_proquest_journals_2307362730 |
source | Access via SAGE; Alma/SFX Local Collection |
subjects | Correlation coefficients Datasets Daylight Diagnostic systems Empirical analysis Glare Performance evaluation Robustness Saturation Statistical analysis Statistical methods Statistical tests Workplaces |
title | Cross-validation and robustness of daylight glare metrics |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-11T12%3A12%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Cross-validation%20and%20robustness%20of%20daylight%20glare%20metrics&rft.jtitle=Lighting%20research%20&%20technology%20(London,%20England%20:%202001)&rft.au=Wienold,%20J&rft.date=2019-11-01&rft.volume=51&rft.issue=7&rft.spage=983&rft.epage=1013&rft.pages=983-1013&rft.issn=1477-1535&rft.eissn=1477-0938&rft_id=info:doi/10.1177/1477153519826003&rft_dat=%3Cproquest_cross%3E2307362730%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2307362730&rft_id=info:pmid/&rft_sage_id=10.1177_1477153519826003&rfr_iscdi=true |