Sound-based multiple-equipment activity recognition using convolutional neural networks
Automatically recognizing activities of heavy construction equipment using sound data has recently received considerable attention as a promising research area in construction. Although existing methods are effective, they only focus on tracking the activities of one single piece of equipment. On co...
Gespeichert in:
Veröffentlicht in: | Automation in construction 2022-03, Vol.135, p.104104, Article 104104 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | 104104 |
container_title | Automation in construction |
container_volume | 135 |
creator | Sherafat, Behnam Rashidi, Abbas Asgari, Sadegh |
description | Automatically recognizing activities of heavy construction equipment using sound data has recently received considerable attention as a promising research area in construction. Although existing methods are effective, they only focus on tracking the activities of one single piece of equipment. On construction job sites, multiple equipment sound signals are mixed in the environment; Thus, there is a need for a robust method to recognize these activities that are taking place simultaneously. To address this shortcoming, we proposed a multi-label multi-level sound classification method based on Short-Time Fourier Transform (STFT) and Convolutional Neural Network (CNN) that only requires a single-channel off-the-shelf microphone. In addition, we developed a data augmentation method to simulate real-world equipment sound mixtures. We tested the proposed method on both synthetic and real-world equipment sound mixtures. The results of our study showed that this method was effective in identifying activities of multiple pieces of equipment on real construction job sites without the need for separating sound signals in advance. Future studies can focus on other potential applications of sound signal processing in the construction domain, including analyzing engine abnormalities and monitoring environmental performance of the equipment.
•This study introduces a novel method for multiple construction equipment activity recognition using off-the-shelf single-channel microphone.•This method utilizes sound-based image signatures to extract specific features for equipment activities.•The method identifies the activities directly without the need for separating sound signals.•Also, a novel data augmentation method is introduced to simulate the real-world sound mixtures when multiple machines performing activities simultaneously.•This method is tested on both synthetic and real-world mixed data and the results are promising. |
doi_str_mv | 10.1016/j.autcon.2021.104104 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2641056599</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0926580521005550</els_id><sourcerecordid>2641056599</sourcerecordid><originalsourceid>FETCH-LOGICAL-c380t-35fb4cf1d52bf02da3ecb3718f1e59efef41753533be1709c319832daab8bed33</originalsourceid><addsrcrecordid>eNp9kE1LxDAQhoMouK7-Aw8Fz12TpmmbiyCLX7DgQcVjSNPJktptdvOxsv_erPUsDAy8vPPOzIPQNcELgkl12y9kDMqOiwIXJEllqhM0I01d5HXDySmaYV5UOWswO0cX3vcY4xpXfIY-32wcu7yVHrpsE4dgtgPksItmu4ExZFIFszfhkDlQdj2aYOyYRW_GdZYW7u0Qj4ocshGi-23h27ovf4nOtBw8XP31Ofp4fHhfPuer16eX5f0qV7TBIadMt6XSpGNFq3HRSQqqpTVpNAHGQYMuSc0oo7QFUmOuKOENTT7ZNi10lM7RzZS7dXYXwQfR2-jSQV4UVeLAKsZ5cpWTSznrvQMtts5spDsIgsURoejFhFAcEYoJYRq7m8YgfbA34IRXBkYFnUk4guis-T_gB61Lfn0</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2641056599</pqid></control><display><type>article</type><title>Sound-based multiple-equipment activity recognition using convolutional neural networks</title><source>Elsevier ScienceDirect Journals</source><creator>Sherafat, Behnam ; Rashidi, Abbas ; Asgari, Sadegh</creator><creatorcontrib>Sherafat, Behnam ; Rashidi, Abbas ; Asgari, Sadegh</creatorcontrib><description>Automatically recognizing activities of heavy construction equipment using sound data has recently received considerable attention as a promising research area in construction. Although existing methods are effective, they only focus on tracking the activities of one single piece of equipment. On construction job sites, multiple equipment sound signals are mixed in the environment; Thus, there is a need for a robust method to recognize these activities that are taking place simultaneously. To address this shortcoming, we proposed a multi-label multi-level sound classification method based on Short-Time Fourier Transform (STFT) and Convolutional Neural Network (CNN) that only requires a single-channel off-the-shelf microphone. In addition, we developed a data augmentation method to simulate real-world equipment sound mixtures. We tested the proposed method on both synthetic and real-world equipment sound mixtures. The results of our study showed that this method was effective in identifying activities of multiple pieces of equipment on real construction job sites without the need for separating sound signals in advance. Future studies can focus on other potential applications of sound signal processing in the construction domain, including analyzing engine abnormalities and monitoring environmental performance of the equipment.
•This study introduces a novel method for multiple construction equipment activity recognition using off-the-shelf single-channel microphone.•This method utilizes sound-based image signatures to extract specific features for equipment activities.•The method identifies the activities directly without the need for separating sound signals.•Also, a novel data augmentation method is introduced to simulate the real-world sound mixtures when multiple machines performing activities simultaneously.•This method is tested on both synthetic and real-world mixed data and the results are promising.</description><identifier>ISSN: 0926-5805</identifier><identifier>EISSN: 1872-7891</identifier><identifier>DOI: 10.1016/j.autcon.2021.104104</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>2D spectrogram ; Abnormalities ; Activity recognition ; Artificial neural networks ; Construction ; Construction equipment ; Construction sites ; Convolutional neural network (CNN) ; Data augmentation ; Deep learning ; Fourier transforms ; Heavy construction ; Heavy equipment ; Mixtures ; Multi-label sound classification ; Neural networks ; Short time Fourier transform (STFT) ; Signal processing ; Sound</subject><ispartof>Automation in construction, 2022-03, Vol.135, p.104104, Article 104104</ispartof><rights>2021</rights><rights>Copyright Elsevier BV Mar 2022</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c380t-35fb4cf1d52bf02da3ecb3718f1e59efef41753533be1709c319832daab8bed33</citedby><cites>FETCH-LOGICAL-c380t-35fb4cf1d52bf02da3ecb3718f1e59efef41753533be1709c319832daab8bed33</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0926580521005550$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Sherafat, Behnam</creatorcontrib><creatorcontrib>Rashidi, Abbas</creatorcontrib><creatorcontrib>Asgari, Sadegh</creatorcontrib><title>Sound-based multiple-equipment activity recognition using convolutional neural networks</title><title>Automation in construction</title><description>Automatically recognizing activities of heavy construction equipment using sound data has recently received considerable attention as a promising research area in construction. Although existing methods are effective, they only focus on tracking the activities of one single piece of equipment. On construction job sites, multiple equipment sound signals are mixed in the environment; Thus, there is a need for a robust method to recognize these activities that are taking place simultaneously. To address this shortcoming, we proposed a multi-label multi-level sound classification method based on Short-Time Fourier Transform (STFT) and Convolutional Neural Network (CNN) that only requires a single-channel off-the-shelf microphone. In addition, we developed a data augmentation method to simulate real-world equipment sound mixtures. We tested the proposed method on both synthetic and real-world equipment sound mixtures. The results of our study showed that this method was effective in identifying activities of multiple pieces of equipment on real construction job sites without the need for separating sound signals in advance. Future studies can focus on other potential applications of sound signal processing in the construction domain, including analyzing engine abnormalities and monitoring environmental performance of the equipment.
•This study introduces a novel method for multiple construction equipment activity recognition using off-the-shelf single-channel microphone.•This method utilizes sound-based image signatures to extract specific features for equipment activities.•The method identifies the activities directly without the need for separating sound signals.•Also, a novel data augmentation method is introduced to simulate the real-world sound mixtures when multiple machines performing activities simultaneously.•This method is tested on both synthetic and real-world mixed data and the results are promising.</description><subject>2D spectrogram</subject><subject>Abnormalities</subject><subject>Activity recognition</subject><subject>Artificial neural networks</subject><subject>Construction</subject><subject>Construction equipment</subject><subject>Construction sites</subject><subject>Convolutional neural network (CNN)</subject><subject>Data augmentation</subject><subject>Deep learning</subject><subject>Fourier transforms</subject><subject>Heavy construction</subject><subject>Heavy equipment</subject><subject>Mixtures</subject><subject>Multi-label sound classification</subject><subject>Neural networks</subject><subject>Short time Fourier transform (STFT)</subject><subject>Signal processing</subject><subject>Sound</subject><issn>0926-5805</issn><issn>1872-7891</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kE1LxDAQhoMouK7-Aw8Fz12TpmmbiyCLX7DgQcVjSNPJktptdvOxsv_erPUsDAy8vPPOzIPQNcELgkl12y9kDMqOiwIXJEllqhM0I01d5HXDySmaYV5UOWswO0cX3vcY4xpXfIY-32wcu7yVHrpsE4dgtgPksItmu4ExZFIFszfhkDlQdj2aYOyYRW_GdZYW7u0Qj4ocshGi-23h27ovf4nOtBw8XP31Ofp4fHhfPuer16eX5f0qV7TBIadMt6XSpGNFq3HRSQqqpTVpNAHGQYMuSc0oo7QFUmOuKOENTT7ZNi10lM7RzZS7dXYXwQfR2-jSQV4UVeLAKsZ5cpWTSznrvQMtts5spDsIgsURoejFhFAcEYoJYRq7m8YgfbA34IRXBkYFnUk4guis-T_gB61Lfn0</recordid><startdate>202203</startdate><enddate>202203</enddate><creator>Sherafat, Behnam</creator><creator>Rashidi, Abbas</creator><creator>Asgari, Sadegh</creator><general>Elsevier B.V</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>FR3</scope><scope>JQ2</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>202203</creationdate><title>Sound-based multiple-equipment activity recognition using convolutional neural networks</title><author>Sherafat, Behnam ; Rashidi, Abbas ; Asgari, Sadegh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c380t-35fb4cf1d52bf02da3ecb3718f1e59efef41753533be1709c319832daab8bed33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>2D spectrogram</topic><topic>Abnormalities</topic><topic>Activity recognition</topic><topic>Artificial neural networks</topic><topic>Construction</topic><topic>Construction equipment</topic><topic>Construction sites</topic><topic>Convolutional neural network (CNN)</topic><topic>Data augmentation</topic><topic>Deep learning</topic><topic>Fourier transforms</topic><topic>Heavy construction</topic><topic>Heavy equipment</topic><topic>Mixtures</topic><topic>Multi-label sound classification</topic><topic>Neural networks</topic><topic>Short time Fourier transform (STFT)</topic><topic>Signal processing</topic><topic>Sound</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sherafat, Behnam</creatorcontrib><creatorcontrib>Rashidi, Abbas</creatorcontrib><creatorcontrib>Asgari, Sadegh</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Automation in construction</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sherafat, Behnam</au><au>Rashidi, Abbas</au><au>Asgari, Sadegh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Sound-based multiple-equipment activity recognition using convolutional neural networks</atitle><jtitle>Automation in construction</jtitle><date>2022-03</date><risdate>2022</risdate><volume>135</volume><spage>104104</spage><pages>104104-</pages><artnum>104104</artnum><issn>0926-5805</issn><eissn>1872-7891</eissn><abstract>Automatically recognizing activities of heavy construction equipment using sound data has recently received considerable attention as a promising research area in construction. Although existing methods are effective, they only focus on tracking the activities of one single piece of equipment. On construction job sites, multiple equipment sound signals are mixed in the environment; Thus, there is a need for a robust method to recognize these activities that are taking place simultaneously. To address this shortcoming, we proposed a multi-label multi-level sound classification method based on Short-Time Fourier Transform (STFT) and Convolutional Neural Network (CNN) that only requires a single-channel off-the-shelf microphone. In addition, we developed a data augmentation method to simulate real-world equipment sound mixtures. We tested the proposed method on both synthetic and real-world equipment sound mixtures. The results of our study showed that this method was effective in identifying activities of multiple pieces of equipment on real construction job sites without the need for separating sound signals in advance. Future studies can focus on other potential applications of sound signal processing in the construction domain, including analyzing engine abnormalities and monitoring environmental performance of the equipment.
•This study introduces a novel method for multiple construction equipment activity recognition using off-the-shelf single-channel microphone.•This method utilizes sound-based image signatures to extract specific features for equipment activities.•The method identifies the activities directly without the need for separating sound signals.•Also, a novel data augmentation method is introduced to simulate the real-world sound mixtures when multiple machines performing activities simultaneously.•This method is tested on both synthetic and real-world mixed data and the results are promising.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.autcon.2021.104104</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0926-5805 |
ispartof | Automation in construction, 2022-03, Vol.135, p.104104, Article 104104 |
issn | 0926-5805 1872-7891 |
language | eng |
recordid | cdi_proquest_journals_2641056599 |
source | Elsevier ScienceDirect Journals |
subjects | 2D spectrogram Abnormalities Activity recognition Artificial neural networks Construction Construction equipment Construction sites Convolutional neural network (CNN) Data augmentation Deep learning Fourier transforms Heavy construction Heavy equipment Mixtures Multi-label sound classification Neural networks Short time Fourier transform (STFT) Signal processing Sound |
title | Sound-based multiple-equipment activity recognition using convolutional neural networks |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T18%3A27%3A05IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Sound-based%20multiple-equipment%20activity%20recognition%20using%20convolutional%20neural%20networks&rft.jtitle=Automation%20in%20construction&rft.au=Sherafat,%20Behnam&rft.date=2022-03&rft.volume=135&rft.spage=104104&rft.pages=104104-&rft.artnum=104104&rft.issn=0926-5805&rft.eissn=1872-7891&rft_id=info:doi/10.1016/j.autcon.2021.104104&rft_dat=%3Cproquest_cross%3E2641056599%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2641056599&rft_id=info:pmid/&rft_els_id=S0926580521005550&rfr_iscdi=true |