Sound-based multiple-equipment activity recognition using convolutional neural networks

Automatically recognizing activities of heavy construction equipment using sound data has recently received considerable attention as a promising research area in construction. Although existing methods are effective, they only focus on tracking the activities of one single piece of equipment. On co...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Automation in construction 2022-03, Vol.135, p.104104, Article 104104
Hauptverfasser: Sherafat, Behnam, Rashidi, Abbas, Asgari, Sadegh
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page 104104
container_title Automation in construction
container_volume 135
creator Sherafat, Behnam
Rashidi, Abbas
Asgari, Sadegh
description Automatically recognizing activities of heavy construction equipment using sound data has recently received considerable attention as a promising research area in construction. Although existing methods are effective, they only focus on tracking the activities of one single piece of equipment. On construction job sites, multiple equipment sound signals are mixed in the environment; Thus, there is a need for a robust method to recognize these activities that are taking place simultaneously. To address this shortcoming, we proposed a multi-label multi-level sound classification method based on Short-Time Fourier Transform (STFT) and Convolutional Neural Network (CNN) that only requires a single-channel off-the-shelf microphone. In addition, we developed a data augmentation method to simulate real-world equipment sound mixtures. We tested the proposed method on both synthetic and real-world equipment sound mixtures. The results of our study showed that this method was effective in identifying activities of multiple pieces of equipment on real construction job sites without the need for separating sound signals in advance. Future studies can focus on other potential applications of sound signal processing in the construction domain, including analyzing engine abnormalities and monitoring environmental performance of the equipment. •This study introduces a novel method for multiple construction equipment activity recognition using off-the-shelf single-channel microphone.•This method utilizes sound-based image signatures to extract specific features for equipment activities.•The method identifies the activities directly without the need for separating sound signals.•Also, a novel data augmentation method is introduced to simulate the real-world sound mixtures when multiple machines performing activities simultaneously.•This method is tested on both synthetic and real-world mixed data and the results are promising.
doi_str_mv 10.1016/j.autcon.2021.104104
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2641056599</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0926580521005550</els_id><sourcerecordid>2641056599</sourcerecordid><originalsourceid>FETCH-LOGICAL-c380t-35fb4cf1d52bf02da3ecb3718f1e59efef41753533be1709c319832daab8bed33</originalsourceid><addsrcrecordid>eNp9kE1LxDAQhoMouK7-Aw8Fz12TpmmbiyCLX7DgQcVjSNPJktptdvOxsv_erPUsDAy8vPPOzIPQNcELgkl12y9kDMqOiwIXJEllqhM0I01d5HXDySmaYV5UOWswO0cX3vcY4xpXfIY-32wcu7yVHrpsE4dgtgPksItmu4ExZFIFszfhkDlQdj2aYOyYRW_GdZYW7u0Qj4ocshGi-23h27ovf4nOtBw8XP31Ofp4fHhfPuer16eX5f0qV7TBIadMt6XSpGNFq3HRSQqqpTVpNAHGQYMuSc0oo7QFUmOuKOENTT7ZNi10lM7RzZS7dXYXwQfR2-jSQV4UVeLAKsZ5cpWTSznrvQMtts5spDsIgsURoejFhFAcEYoJYRq7m8YgfbA34IRXBkYFnUk4guis-T_gB61Lfn0</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2641056599</pqid></control><display><type>article</type><title>Sound-based multiple-equipment activity recognition using convolutional neural networks</title><source>Elsevier ScienceDirect Journals</source><creator>Sherafat, Behnam ; Rashidi, Abbas ; Asgari, Sadegh</creator><creatorcontrib>Sherafat, Behnam ; Rashidi, Abbas ; Asgari, Sadegh</creatorcontrib><description>Automatically recognizing activities of heavy construction equipment using sound data has recently received considerable attention as a promising research area in construction. Although existing methods are effective, they only focus on tracking the activities of one single piece of equipment. On construction job sites, multiple equipment sound signals are mixed in the environment; Thus, there is a need for a robust method to recognize these activities that are taking place simultaneously. To address this shortcoming, we proposed a multi-label multi-level sound classification method based on Short-Time Fourier Transform (STFT) and Convolutional Neural Network (CNN) that only requires a single-channel off-the-shelf microphone. In addition, we developed a data augmentation method to simulate real-world equipment sound mixtures. We tested the proposed method on both synthetic and real-world equipment sound mixtures. The results of our study showed that this method was effective in identifying activities of multiple pieces of equipment on real construction job sites without the need for separating sound signals in advance. Future studies can focus on other potential applications of sound signal processing in the construction domain, including analyzing engine abnormalities and monitoring environmental performance of the equipment. •This study introduces a novel method for multiple construction equipment activity recognition using off-the-shelf single-channel microphone.•This method utilizes sound-based image signatures to extract specific features for equipment activities.•The method identifies the activities directly without the need for separating sound signals.•Also, a novel data augmentation method is introduced to simulate the real-world sound mixtures when multiple machines performing activities simultaneously.•This method is tested on both synthetic and real-world mixed data and the results are promising.</description><identifier>ISSN: 0926-5805</identifier><identifier>EISSN: 1872-7891</identifier><identifier>DOI: 10.1016/j.autcon.2021.104104</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>2D spectrogram ; Abnormalities ; Activity recognition ; Artificial neural networks ; Construction ; Construction equipment ; Construction sites ; Convolutional neural network (CNN) ; Data augmentation ; Deep learning ; Fourier transforms ; Heavy construction ; Heavy equipment ; Mixtures ; Multi-label sound classification ; Neural networks ; Short time Fourier transform (STFT) ; Signal processing ; Sound</subject><ispartof>Automation in construction, 2022-03, Vol.135, p.104104, Article 104104</ispartof><rights>2021</rights><rights>Copyright Elsevier BV Mar 2022</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c380t-35fb4cf1d52bf02da3ecb3718f1e59efef41753533be1709c319832daab8bed33</citedby><cites>FETCH-LOGICAL-c380t-35fb4cf1d52bf02da3ecb3718f1e59efef41753533be1709c319832daab8bed33</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0926580521005550$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Sherafat, Behnam</creatorcontrib><creatorcontrib>Rashidi, Abbas</creatorcontrib><creatorcontrib>Asgari, Sadegh</creatorcontrib><title>Sound-based multiple-equipment activity recognition using convolutional neural networks</title><title>Automation in construction</title><description>Automatically recognizing activities of heavy construction equipment using sound data has recently received considerable attention as a promising research area in construction. Although existing methods are effective, they only focus on tracking the activities of one single piece of equipment. On construction job sites, multiple equipment sound signals are mixed in the environment; Thus, there is a need for a robust method to recognize these activities that are taking place simultaneously. To address this shortcoming, we proposed a multi-label multi-level sound classification method based on Short-Time Fourier Transform (STFT) and Convolutional Neural Network (CNN) that only requires a single-channel off-the-shelf microphone. In addition, we developed a data augmentation method to simulate real-world equipment sound mixtures. We tested the proposed method on both synthetic and real-world equipment sound mixtures. The results of our study showed that this method was effective in identifying activities of multiple pieces of equipment on real construction job sites without the need for separating sound signals in advance. Future studies can focus on other potential applications of sound signal processing in the construction domain, including analyzing engine abnormalities and monitoring environmental performance of the equipment. •This study introduces a novel method for multiple construction equipment activity recognition using off-the-shelf single-channel microphone.•This method utilizes sound-based image signatures to extract specific features for equipment activities.•The method identifies the activities directly without the need for separating sound signals.•Also, a novel data augmentation method is introduced to simulate the real-world sound mixtures when multiple machines performing activities simultaneously.•This method is tested on both synthetic and real-world mixed data and the results are promising.</description><subject>2D spectrogram</subject><subject>Abnormalities</subject><subject>Activity recognition</subject><subject>Artificial neural networks</subject><subject>Construction</subject><subject>Construction equipment</subject><subject>Construction sites</subject><subject>Convolutional neural network (CNN)</subject><subject>Data augmentation</subject><subject>Deep learning</subject><subject>Fourier transforms</subject><subject>Heavy construction</subject><subject>Heavy equipment</subject><subject>Mixtures</subject><subject>Multi-label sound classification</subject><subject>Neural networks</subject><subject>Short time Fourier transform (STFT)</subject><subject>Signal processing</subject><subject>Sound</subject><issn>0926-5805</issn><issn>1872-7891</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kE1LxDAQhoMouK7-Aw8Fz12TpmmbiyCLX7DgQcVjSNPJktptdvOxsv_erPUsDAy8vPPOzIPQNcELgkl12y9kDMqOiwIXJEllqhM0I01d5HXDySmaYV5UOWswO0cX3vcY4xpXfIY-32wcu7yVHrpsE4dgtgPksItmu4ExZFIFszfhkDlQdj2aYOyYRW_GdZYW7u0Qj4ocshGi-23h27ovf4nOtBw8XP31Ofp4fHhfPuer16eX5f0qV7TBIadMt6XSpGNFq3HRSQqqpTVpNAHGQYMuSc0oo7QFUmOuKOENTT7ZNi10lM7RzZS7dXYXwQfR2-jSQV4UVeLAKsZ5cpWTSznrvQMtts5spDsIgsURoejFhFAcEYoJYRq7m8YgfbA34IRXBkYFnUk4guis-T_gB61Lfn0</recordid><startdate>202203</startdate><enddate>202203</enddate><creator>Sherafat, Behnam</creator><creator>Rashidi, Abbas</creator><creator>Asgari, Sadegh</creator><general>Elsevier B.V</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>FR3</scope><scope>JQ2</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>202203</creationdate><title>Sound-based multiple-equipment activity recognition using convolutional neural networks</title><author>Sherafat, Behnam ; Rashidi, Abbas ; Asgari, Sadegh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c380t-35fb4cf1d52bf02da3ecb3718f1e59efef41753533be1709c319832daab8bed33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>2D spectrogram</topic><topic>Abnormalities</topic><topic>Activity recognition</topic><topic>Artificial neural networks</topic><topic>Construction</topic><topic>Construction equipment</topic><topic>Construction sites</topic><topic>Convolutional neural network (CNN)</topic><topic>Data augmentation</topic><topic>Deep learning</topic><topic>Fourier transforms</topic><topic>Heavy construction</topic><topic>Heavy equipment</topic><topic>Mixtures</topic><topic>Multi-label sound classification</topic><topic>Neural networks</topic><topic>Short time Fourier transform (STFT)</topic><topic>Signal processing</topic><topic>Sound</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sherafat, Behnam</creatorcontrib><creatorcontrib>Rashidi, Abbas</creatorcontrib><creatorcontrib>Asgari, Sadegh</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Automation in construction</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sherafat, Behnam</au><au>Rashidi, Abbas</au><au>Asgari, Sadegh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Sound-based multiple-equipment activity recognition using convolutional neural networks</atitle><jtitle>Automation in construction</jtitle><date>2022-03</date><risdate>2022</risdate><volume>135</volume><spage>104104</spage><pages>104104-</pages><artnum>104104</artnum><issn>0926-5805</issn><eissn>1872-7891</eissn><abstract>Automatically recognizing activities of heavy construction equipment using sound data has recently received considerable attention as a promising research area in construction. Although existing methods are effective, they only focus on tracking the activities of one single piece of equipment. On construction job sites, multiple equipment sound signals are mixed in the environment; Thus, there is a need for a robust method to recognize these activities that are taking place simultaneously. To address this shortcoming, we proposed a multi-label multi-level sound classification method based on Short-Time Fourier Transform (STFT) and Convolutional Neural Network (CNN) that only requires a single-channel off-the-shelf microphone. In addition, we developed a data augmentation method to simulate real-world equipment sound mixtures. We tested the proposed method on both synthetic and real-world equipment sound mixtures. The results of our study showed that this method was effective in identifying activities of multiple pieces of equipment on real construction job sites without the need for separating sound signals in advance. Future studies can focus on other potential applications of sound signal processing in the construction domain, including analyzing engine abnormalities and monitoring environmental performance of the equipment. •This study introduces a novel method for multiple construction equipment activity recognition using off-the-shelf single-channel microphone.•This method utilizes sound-based image signatures to extract specific features for equipment activities.•The method identifies the activities directly without the need for separating sound signals.•Also, a novel data augmentation method is introduced to simulate the real-world sound mixtures when multiple machines performing activities simultaneously.•This method is tested on both synthetic and real-world mixed data and the results are promising.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.autcon.2021.104104</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0926-5805
ispartof Automation in construction, 2022-03, Vol.135, p.104104, Article 104104
issn 0926-5805
1872-7891
language eng
recordid cdi_proquest_journals_2641056599
source Elsevier ScienceDirect Journals
subjects 2D spectrogram
Abnormalities
Activity recognition
Artificial neural networks
Construction
Construction equipment
Construction sites
Convolutional neural network (CNN)
Data augmentation
Deep learning
Fourier transforms
Heavy construction
Heavy equipment
Mixtures
Multi-label sound classification
Neural networks
Short time Fourier transform (STFT)
Signal processing
Sound
title Sound-based multiple-equipment activity recognition using convolutional neural networks
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T18%3A27%3A05IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Sound-based%20multiple-equipment%20activity%20recognition%20using%20convolutional%20neural%20networks&rft.jtitle=Automation%20in%20construction&rft.au=Sherafat,%20Behnam&rft.date=2022-03&rft.volume=135&rft.spage=104104&rft.pages=104104-&rft.artnum=104104&rft.issn=0926-5805&rft.eissn=1872-7891&rft_id=info:doi/10.1016/j.autcon.2021.104104&rft_dat=%3Cproquest_cross%3E2641056599%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2641056599&rft_id=info:pmid/&rft_els_id=S0926580521005550&rfr_iscdi=true