Use of a Multiscale Vision Transformer to Predict Nursing Activities Score From Low-Resolution Thermal Videos in an Intensive Care Unit

Excessive caregiver workload in hospital nurses has been implicated in poorer patient care and increased worker burnout. Measurement of this workload in the intensive care unit (ICU) is often done using the nursing activities score (NAS), but this is usually recorded manually and sporadically. Previ...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE sensors letters 2024-07, Vol.8 (7), p.1-4
Hauptverfasser:	Lee, Isaac YL, Nguyen-Duc, Thanh, Ueno, Ryo, Smith, Jesse, Chan, Peter Y
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Ambient intelligence Biomedical imaging Caregivers Computational modeling Computer vision Deep learning Feasibility studies Intensive care Medical services multiscale vision transformer (MViT) Nurses nursing activities score (NAS) nursing workload monitoring Predictions Sensor applications Sensors thermal imaging Transformers Vectors Video Workload Workloads
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	4
container_issue	7
container_start_page	1
container_title	IEEE sensors letters
container_volume	8
creator	Lee, Isaac YL Nguyen-Duc, Thanh Ueno, Ryo Smith, Jesse Chan, Peter Y
description	Excessive caregiver workload in hospital nurses has been implicated in poorer patient care and increased worker burnout. Measurement of this workload in the intensive care unit (ICU) is often done using the nursing activities score (NAS), but this is usually recorded manually and sporadically. Previous work has made use of ambient intelligence by using computer vision to passively derive caregiver-patient interaction times to monitor staff workload. In this letter, we propose using a multiscale vision transformer (MViT) to passively predict the NAS from low-resolution thermal videos recorded in an ICU. 458 videos were obtained from an ICU in Melbourne, Australia, and used to train an MViT v2 (MViTv2) model using an indirect prediction and a direct prediction method. The indirect method predicted one of eight potentially identifiable NAS activities from the video before inferring the NAS. The direct method predicted the NAS score immediately from the video. The indirect method yielded an average fivefold accuracy of 57.21%, an area under the receiver operating characteristic curve of 0.865, an F1 score of 0.570, and a mean squared error (MSE) of 28.16. The direct method yielded an MSE of 18.16. We also showed that the MViTv2 outperforms similar models, such as R(2 + 1)D and ResNet50-LSTM, under identical settings. This study shows the feasibility of using a MViTv2 to passively predict the NAS in an ICU and monitor staff workload automatically. Our abovementioned results also show an increased accuracy in predicting NAS directly versus predicting NAS indirectly. We hope that our study can provide a direction for future work and further improve the accuracy of passive NAS monitoring.
doi_str_mv	10.1109/LSENS.2024.3408320
format	Article
fullrecord	<record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_journals_3070779768</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10545536</ieee_id><sourcerecordid>3070779768</sourcerecordid><originalsourceid>FETCH-LOGICAL-c221t-eec3e809a9a15c0743cd6a711210d0f34ba50155fd0ff19b7834a42a928e3c5d3</originalsourceid><addsrcrecordid>eNpNkE1LAzEQhhdRUNQ_IB4CnrdOvprdYymtCrWKtV6XNDurkW1Sk6ziL_Bvu7U9eJoZeJ934MmyCwoDSqG8ni0m88WAARMDLqDgDA6yEyaUzKlQ7PDffpydx_gOALRgCjicZD_LiMQ3RJP7rk02Gt0iebHRekeeg3ax8WGNgSRPHgPW1iQy70K07pWMTLKfNlmMZGF8QDINfk1m_it_wujbLv11vGFY67avrNFHYh3Rjty5hC7aTyRj3XNLZ9NZdtToNuL5fp5my-nkeXybzx5u7sajWW4YoylHNBwLKHWpqTSgBDf1UCtKGYUaGi5WWgKVsumPhpYrVXChBdMlK5AbWfPT7GrXuwn-o8OYqnffBde_rDgoUKpUw6JPsV3KBB9jwKbaBLvW4buiUG2dV3_Oq63zau-8hy53kEXEf4AUUvIh_wXus35m</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3070779768</pqid></control><display><type>article</type><title>Use of a Multiscale Vision Transformer to Predict Nursing Activities Score From Low-Resolution Thermal Videos in an Intensive Care Unit</title><source>IEEE Electronic Library (IEL)</source><creator>Lee, Isaac YL ; Nguyen-Duc, Thanh ; Ueno, Ryo ; Smith, Jesse ; Chan, Peter Y</creator><creatorcontrib>Lee, Isaac YL ; Nguyen-Duc, Thanh ; Ueno, Ryo ; Smith, Jesse ; Chan, Peter Y</creatorcontrib><description>Excessive caregiver workload in hospital nurses has been implicated in poorer patient care and increased worker burnout. Measurement of this workload in the intensive care unit (ICU) is often done using the nursing activities score (NAS), but this is usually recorded manually and sporadically. Previous work has made use of ambient intelligence by using computer vision to passively derive caregiver-patient interaction times to monitor staff workload. In this letter, we propose using a multiscale vision transformer (MViT) to passively predict the NAS from low-resolution thermal videos recorded in an ICU. 458 videos were obtained from an ICU in Melbourne, Australia, and used to train an MViT v2 (MViTv2) model using an indirect prediction and a direct prediction method. The indirect method predicted one of eight potentially identifiable NAS activities from the video before inferring the NAS. The direct method predicted the NAS score immediately from the video. The indirect method yielded an average fivefold accuracy of 57.21%, an area under the receiver operating characteristic curve of 0.865, an F1 score of 0.570, and a mean squared error (MSE) of 28.16. The direct method yielded an MSE of 18.16. We also showed that the MViTv2 outperforms similar models, such as R(2 + 1)D and ResNet50-LSTM, under identical settings. This study shows the feasibility of using a MViTv2 to passively predict the NAS in an ICU and monitor staff workload automatically. Our abovementioned results also show an increased accuracy in predicting NAS directly versus predicting NAS indirectly. We hope that our study can provide a direction for future work and further improve the accuracy of passive NAS monitoring.</description><identifier>ISSN: 2475-1472</identifier><identifier>EISSN: 2475-1472</identifier><identifier>DOI: 10.1109/LSENS.2024.3408320</identifier><identifier>CODEN: ISLECD</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Accuracy ; Ambient intelligence ; Biomedical imaging ; Caregivers ; Computational modeling ; Computer vision ; Deep learning ; Feasibility studies ; Intensive care ; Medical services ; multiscale vision transformer (MViT) ; Nurses ; nursing activities score (NAS) ; nursing workload monitoring ; Predictions ; Sensor applications ; Sensors ; thermal imaging ; Transformers ; Vectors ; Video ; Workload ; Workloads</subject><ispartof>IEEE sensors letters, 2024-07, Vol.8 (7), p.1-4</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2024</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c221t-eec3e809a9a15c0743cd6a711210d0f34ba50155fd0ff19b7834a42a928e3c5d3</cites><orcidid>0009-0006-3351-5539 ; 0000-0003-4578-9394</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10545536$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27903,27904,54736</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10545536$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Lee, Isaac YL</creatorcontrib><creatorcontrib>Nguyen-Duc, Thanh</creatorcontrib><creatorcontrib>Ueno, Ryo</creatorcontrib><creatorcontrib>Smith, Jesse</creatorcontrib><creatorcontrib>Chan, Peter Y</creatorcontrib><title>Use of a Multiscale Vision Transformer to Predict Nursing Activities Score From Low-Resolution Thermal Videos in an Intensive Care Unit</title><title>IEEE sensors letters</title><addtitle>LSENS</addtitle><description>Excessive caregiver workload in hospital nurses has been implicated in poorer patient care and increased worker burnout. Measurement of this workload in the intensive care unit (ICU) is often done using the nursing activities score (NAS), but this is usually recorded manually and sporadically. Previous work has made use of ambient intelligence by using computer vision to passively derive caregiver-patient interaction times to monitor staff workload. In this letter, we propose using a multiscale vision transformer (MViT) to passively predict the NAS from low-resolution thermal videos recorded in an ICU. 458 videos were obtained from an ICU in Melbourne, Australia, and used to train an MViT v2 (MViTv2) model using an indirect prediction and a direct prediction method. The indirect method predicted one of eight potentially identifiable NAS activities from the video before inferring the NAS. The direct method predicted the NAS score immediately from the video. The indirect method yielded an average fivefold accuracy of 57.21%, an area under the receiver operating characteristic curve of 0.865, an F1 score of 0.570, and a mean squared error (MSE) of 28.16. The direct method yielded an MSE of 18.16. We also showed that the MViTv2 outperforms similar models, such as R(2 + 1)D and ResNet50-LSTM, under identical settings. This study shows the feasibility of using a MViTv2 to passively predict the NAS in an ICU and monitor staff workload automatically. Our abovementioned results also show an increased accuracy in predicting NAS directly versus predicting NAS indirectly. We hope that our study can provide a direction for future work and further improve the accuracy of passive NAS monitoring.</description><subject>Accuracy</subject><subject>Ambient intelligence</subject><subject>Biomedical imaging</subject><subject>Caregivers</subject><subject>Computational modeling</subject><subject>Computer vision</subject><subject>Deep learning</subject><subject>Feasibility studies</subject><subject>Intensive care</subject><subject>Medical services</subject><subject>multiscale vision transformer (MViT)</subject><subject>Nurses</subject><subject>nursing activities score (NAS)</subject><subject>nursing workload monitoring</subject><subject>Predictions</subject><subject>Sensor applications</subject><subject>Sensors</subject><subject>thermal imaging</subject><subject>Transformers</subject><subject>Vectors</subject><subject>Video</subject><subject>Workload</subject><subject>Workloads</subject><issn>2475-1472</issn><issn>2475-1472</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpNkE1LAzEQhhdRUNQ_IB4CnrdOvprdYymtCrWKtV6XNDurkW1Sk6ziL_Bvu7U9eJoZeJ934MmyCwoDSqG8ni0m88WAARMDLqDgDA6yEyaUzKlQ7PDffpydx_gOALRgCjicZD_LiMQ3RJP7rk02Gt0iebHRekeeg3ax8WGNgSRPHgPW1iQy70K07pWMTLKfNlmMZGF8QDINfk1m_it_wujbLv11vGFY67avrNFHYh3Rjty5hC7aTyRj3XNLZ9NZdtToNuL5fp5my-nkeXybzx5u7sajWW4YoylHNBwLKHWpqTSgBDf1UCtKGYUaGi5WWgKVsumPhpYrVXChBdMlK5AbWfPT7GrXuwn-o8OYqnffBde_rDgoUKpUw6JPsV3KBB9jwKbaBLvW4buiUG2dV3_Oq63zau-8hy53kEXEf4AUUvIh_wXus35m</recordid><startdate>20240701</startdate><enddate>20240701</enddate><creator>Lee, Isaac YL</creator><creator>Nguyen-Duc, Thanh</creator><creator>Ueno, Ryo</creator><creator>Smith, Jesse</creator><creator>Chan, Peter Y</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>8FD</scope><scope>L7M</scope><orcidid>https://orcid.org/0009-0006-3351-5539</orcidid><orcidid>https://orcid.org/0000-0003-4578-9394</orcidid></search><sort><creationdate>20240701</creationdate><title>Use of a Multiscale Vision Transformer to Predict Nursing Activities Score From Low-Resolution Thermal Videos in an Intensive Care Unit</title><author>Lee, Isaac YL ; Nguyen-Duc, Thanh ; Ueno, Ryo ; Smith, Jesse ; Chan, Peter Y</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c221t-eec3e809a9a15c0743cd6a711210d0f34ba50155fd0ff19b7834a42a928e3c5d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Accuracy</topic><topic>Ambient intelligence</topic><topic>Biomedical imaging</topic><topic>Caregivers</topic><topic>Computational modeling</topic><topic>Computer vision</topic><topic>Deep learning</topic><topic>Feasibility studies</topic><topic>Intensive care</topic><topic>Medical services</topic><topic>multiscale vision transformer (MViT)</topic><topic>Nurses</topic><topic>nursing activities score (NAS)</topic><topic>nursing workload monitoring</topic><topic>Predictions</topic><topic>Sensor applications</topic><topic>Sensors</topic><topic>thermal imaging</topic><topic>Transformers</topic><topic>Vectors</topic><topic>Video</topic><topic>Workload</topic><topic>Workloads</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lee, Isaac YL</creatorcontrib><creatorcontrib>Nguyen-Duc, Thanh</creatorcontrib><creatorcontrib>Ueno, Ryo</creatorcontrib><creatorcontrib>Smith, Jesse</creatorcontrib><creatorcontrib>Chan, Peter Y</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>IEEE sensors letters</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Lee, Isaac YL</au><au>Nguyen-Duc, Thanh</au><au>Ueno, Ryo</au><au>Smith, Jesse</au><au>Chan, Peter Y</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Use of a Multiscale Vision Transformer to Predict Nursing Activities Score From Low-Resolution Thermal Videos in an Intensive Care Unit</atitle><jtitle>IEEE sensors letters</jtitle><stitle>LSENS</stitle><date>2024-07-01</date><risdate>2024</risdate><volume>8</volume><issue>7</issue><spage>1</spage><epage>4</epage><pages>1-4</pages><issn>2475-1472</issn><eissn>2475-1472</eissn><coden>ISLECD</coden><abstract>Excessive caregiver workload in hospital nurses has been implicated in poorer patient care and increased worker burnout. Measurement of this workload in the intensive care unit (ICU) is often done using the nursing activities score (NAS), but this is usually recorded manually and sporadically. Previous work has made use of ambient intelligence by using computer vision to passively derive caregiver-patient interaction times to monitor staff workload. In this letter, we propose using a multiscale vision transformer (MViT) to passively predict the NAS from low-resolution thermal videos recorded in an ICU. 458 videos were obtained from an ICU in Melbourne, Australia, and used to train an MViT v2 (MViTv2) model using an indirect prediction and a direct prediction method. The indirect method predicted one of eight potentially identifiable NAS activities from the video before inferring the NAS. The direct method predicted the NAS score immediately from the video. The indirect method yielded an average fivefold accuracy of 57.21%, an area under the receiver operating characteristic curve of 0.865, an F1 score of 0.570, and a mean squared error (MSE) of 28.16. The direct method yielded an MSE of 18.16. We also showed that the MViTv2 outperforms similar models, such as R(2 + 1)D and ResNet50-LSTM, under identical settings. This study shows the feasibility of using a MViTv2 to passively predict the NAS in an ICU and monitor staff workload automatically. Our abovementioned results also show an increased accuracy in predicting NAS directly versus predicting NAS indirectly. We hope that our study can provide a direction for future work and further improve the accuracy of passive NAS monitoring.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/LSENS.2024.3408320</doi><tpages>4</tpages><orcidid>https://orcid.org/0009-0006-3351-5539</orcidid><orcidid>https://orcid.org/0000-0003-4578-9394</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 2475-1472
ispartof	IEEE sensors letters, 2024-07, Vol.8 (7), p.1-4
issn	2475-1472 2475-1472
language	eng
recordid	cdi_proquest_journals_3070779768
source	IEEE Electronic Library (IEL)
subjects	Accuracy Ambient intelligence Biomedical imaging Caregivers Computational modeling Computer vision Deep learning Feasibility studies Intensive care Medical services multiscale vision transformer (MViT) Nurses nursing activities score (NAS) nursing workload monitoring Predictions Sensor applications Sensors thermal imaging Transformers Vectors Video Workload Workloads
title	Use of a Multiscale Vision Transformer to Predict Nursing Activities Score From Low-Resolution Thermal Videos in an Intensive Care Unit
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T17%3A01%3A10IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Use%20of%20a%20Multiscale%20Vision%20Transformer%20to%20Predict%20Nursing%20Activities%20Score%20From%20Low-Resolution%20Thermal%20Videos%20in%20an%20Intensive%20Care%20Unit&rft.jtitle=IEEE%20sensors%20letters&rft.au=Lee,%20Isaac%20YL&rft.date=2024-07-01&rft.volume=8&rft.issue=7&rft.spage=1&rft.epage=4&rft.pages=1-4&rft.issn=2475-1472&rft.eissn=2475-1472&rft.coden=ISLECD&rft_id=info:doi/10.1109/LSENS.2024.3408320&rft_dat=%3Cproquest_RIE%3E3070779768%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3070779768&rft_id=info:pmid/&rft_ieee_id=10545536&rfr_iscdi=true