YOLO-SGC: A Dangerous Driving Behavior Detection Method With Multiscale Spatial-Channel Feature Aggregation

In intelligent transportation system, it is significant to detect drivers' dangerous driving behaviors accurately and in real time. However, current fatigue driving detection methods only focus on facial expressions or hand movements and ignore the impact of global behavior, resulting in poor d...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE sensors journal 2024-11, Vol.24 (21), p.36044-36056
Hauptverfasser:	Li, Ruijie, Yu, Changdong, Qin, Xiangrong, An, Xin, Zhao, Jinpeng, Chuai, Wenhui, Liu, Baisheng
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Adaptation models Dangerous driving detection Fatigue Feature extraction Monitoring multiscale feature capture QIN_dataset Real-time systems spatial-channel attention Vehicles YOLO
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	36056
container_issue	21
container_start_page	36044
container_title	IEEE sensors journal
container_volume	24
creator	Li, Ruijie Yu, Changdong Qin, Xiangrong An, Xin Zhao, Jinpeng Chuai, Wenhui Liu, Baisheng
description	In intelligent transportation system, it is significant to detect drivers' dangerous driving behaviors accurately and in real time. However, current fatigue driving detection methods only focus on facial expressions or hand movements and ignore the impact of global behavior, resulting in poor detection results in complex scenes. To solve this problem, we propose a novel dangerous driving detection method called YOLO-SGC. This method builds on the YOLOv8 framework, enhancing it with the addition of spatial and channel reconstruction convolution (SCConv) and a global attention mechanism (GAM) to efficiently capture multiscale features based on spatial-channel information. Additionally, we leverage the cross-scale partial connections (CSPCs) method to optimize the spatial pyramid pooling fast (SPPF), expanding the model's field of view. These innovations significantly enhance the algorithm's ability to express features at various scales, thereby improving the model's capacity to recognize multiscale driving behaviors. In addition, we construct a new dataset of real driving scenarios called QIN_Dataset, which contains 34 000 images of real driving scenes from frontal and side angles. Finally, we validated YOLO-SGC and other mainstream methods on the public datasets and our QIN_Dataset. The results show that compared with other mainstream models, YOLO-SGC has a significant improvement in the average accuracy (overall increasing 1%~42.7% in mAP50 and 1.1%~55.6% in mAP50-95) while maintaining a high detection speed. This demonstrate that YOLO-SGC is an effective and practical solution for vision-based driver dangerous behaviors detection.
doi_str_mv	10.1109/JSEN.2024.3457686
format	Article
fullrecord	<record><control><sourceid>crossref_RIE</sourceid><recordid>TN_cdi_crossref_primary_10_1109_JSEN_2024_3457686</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10680977</ieee_id><sourcerecordid>10_1109_JSEN_2024_3457686</sourcerecordid><originalsourceid>FETCH-LOGICAL-c637-2ec01becf72e17820ee673bb445ffcf4906291f53d3ea1cf354039c917a8284f3</originalsourceid><addsrcrecordid>eNpNkLtOwzAYhS0EEqXwAEgMfoEUX-OEraQXQC0dWgmYItf9nRhCUjlOJd4eonZgOmc43xk-hG4pGVFK0vuX9fR1xAgTIy6kipP4DA2olElElUjO-85JJLh6v0RXbftJCE2VVAP09bFarKL1PHvAYzzRdQG-6Vo88e7g6gI_QqkPrvF4AgFMcE2NlxDKZoffXCjxsquCa42uAK_3OjhdRVmp6xoqPAMdOg94XBQeCt2j1-jC6qqFm1MO0WY23WRP0WI1f87Gi8jEXEUMDKFbMFYxoCphBCBWfLsVQlprrEhJzFJqJd9x0NRYLgXhqUmp0glLhOVDRI-3xjdt68Hme---tf_JKcl7WXkvK-9l5SdZf8zdkXEA8G8fJyRViv8CG_tmbA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>YOLO-SGC: A Dangerous Driving Behavior Detection Method With Multiscale Spatial-Channel Feature Aggregation</title><source>IEEE Electronic Library (IEL)</source><creator>Li, Ruijie ; Yu, Changdong ; Qin, Xiangrong ; An, Xin ; Zhao, Jinpeng ; Chuai, Wenhui ; Liu, Baisheng</creator><creatorcontrib>Li, Ruijie ; Yu, Changdong ; Qin, Xiangrong ; An, Xin ; Zhao, Jinpeng ; Chuai, Wenhui ; Liu, Baisheng</creatorcontrib><description>In intelligent transportation system, it is significant to detect drivers' dangerous driving behaviors accurately and in real time. However, current fatigue driving detection methods only focus on facial expressions or hand movements and ignore the impact of global behavior, resulting in poor detection results in complex scenes. To solve this problem, we propose a novel dangerous driving detection method called YOLO-SGC. This method builds on the YOLOv8 framework, enhancing it with the addition of spatial and channel reconstruction convolution (SCConv) and a global attention mechanism (GAM) to efficiently capture multiscale features based on spatial-channel information. Additionally, we leverage the cross-scale partial connections (CSPCs) method to optimize the spatial pyramid pooling fast (SPPF), expanding the model's field of view. These innovations significantly enhance the algorithm's ability to express features at various scales, thereby improving the model's capacity to recognize multiscale driving behaviors. In addition, we construct a new dataset of real driving scenarios called QIN_Dataset, which contains 34 000 images of real driving scenes from frontal and side angles. Finally, we validated YOLO-SGC and other mainstream methods on the public datasets and our QIN_Dataset. The results show that compared with other mainstream models, YOLO-SGC has a significant improvement in the average accuracy (overall increasing 1%~42.7% in mAP50 and 1.1%~55.6% in mAP50-95) while maintaining a high detection speed. This demonstrate that YOLO-SGC is an effective and practical solution for vision-based driver dangerous behaviors detection.</description><identifier>ISSN: 1530-437X</identifier><identifier>EISSN: 1558-1748</identifier><identifier>DOI: 10.1109/JSEN.2024.3457686</identifier><identifier>CODEN: ISJEAZ</identifier><language>eng</language><publisher>IEEE</publisher><subject>Accuracy ; Adaptation models ; Dangerous driving detection ; Fatigue ; Feature extraction ; Monitoring ; multiscale feature capture ; QIN_dataset ; Real-time systems ; spatial-channel attention ; Vehicles ; YOLO</subject><ispartof>IEEE sensors journal, 2024-11, Vol.24 (21), p.36044-36056</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c637-2ec01becf72e17820ee673bb445ffcf4906291f53d3ea1cf354039c917a8284f3</cites><orcidid>0009-0009-1208-4399 ; 0000-0002-5759-4589</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10680977$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10680977$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Li, Ruijie</creatorcontrib><creatorcontrib>Yu, Changdong</creatorcontrib><creatorcontrib>Qin, Xiangrong</creatorcontrib><creatorcontrib>An, Xin</creatorcontrib><creatorcontrib>Zhao, Jinpeng</creatorcontrib><creatorcontrib>Chuai, Wenhui</creatorcontrib><creatorcontrib>Liu, Baisheng</creatorcontrib><title>YOLO-SGC: A Dangerous Driving Behavior Detection Method With Multiscale Spatial-Channel Feature Aggregation</title><title>IEEE sensors journal</title><addtitle>JSEN</addtitle><description>In intelligent transportation system, it is significant to detect drivers' dangerous driving behaviors accurately and in real time. However, current fatigue driving detection methods only focus on facial expressions or hand movements and ignore the impact of global behavior, resulting in poor detection results in complex scenes. To solve this problem, we propose a novel dangerous driving detection method called YOLO-SGC. This method builds on the YOLOv8 framework, enhancing it with the addition of spatial and channel reconstruction convolution (SCConv) and a global attention mechanism (GAM) to efficiently capture multiscale features based on spatial-channel information. Additionally, we leverage the cross-scale partial connections (CSPCs) method to optimize the spatial pyramid pooling fast (SPPF), expanding the model's field of view. These innovations significantly enhance the algorithm's ability to express features at various scales, thereby improving the model's capacity to recognize multiscale driving behaviors. In addition, we construct a new dataset of real driving scenarios called QIN_Dataset, which contains 34 000 images of real driving scenes from frontal and side angles. Finally, we validated YOLO-SGC and other mainstream methods on the public datasets and our QIN_Dataset. The results show that compared with other mainstream models, YOLO-SGC has a significant improvement in the average accuracy (overall increasing 1%~42.7% in mAP50 and 1.1%~55.6% in mAP50-95) while maintaining a high detection speed. This demonstrate that YOLO-SGC is an effective and practical solution for vision-based driver dangerous behaviors detection.</description><subject>Accuracy</subject><subject>Adaptation models</subject><subject>Dangerous driving detection</subject><subject>Fatigue</subject><subject>Feature extraction</subject><subject>Monitoring</subject><subject>multiscale feature capture</subject><subject>QIN_dataset</subject><subject>Real-time systems</subject><subject>spatial-channel attention</subject><subject>Vehicles</subject><subject>YOLO</subject><issn>1530-437X</issn><issn>1558-1748</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpNkLtOwzAYhS0EEqXwAEgMfoEUX-OEraQXQC0dWgmYItf9nRhCUjlOJd4eonZgOmc43xk-hG4pGVFK0vuX9fR1xAgTIy6kipP4DA2olElElUjO-85JJLh6v0RXbftJCE2VVAP09bFarKL1PHvAYzzRdQG-6Vo88e7g6gI_QqkPrvF4AgFMcE2NlxDKZoffXCjxsquCa42uAK_3OjhdRVmp6xoqPAMdOg94XBQeCt2j1-jC6qqFm1MO0WY23WRP0WI1f87Gi8jEXEUMDKFbMFYxoCphBCBWfLsVQlprrEhJzFJqJd9x0NRYLgXhqUmp0glLhOVDRI-3xjdt68Hme---tf_JKcl7WXkvK-9l5SdZf8zdkXEA8G8fJyRViv8CG_tmbA</recordid><startdate>20241101</startdate><enddate>20241101</enddate><creator>Li, Ruijie</creator><creator>Yu, Changdong</creator><creator>Qin, Xiangrong</creator><creator>An, Xin</creator><creator>Zhao, Jinpeng</creator><creator>Chuai, Wenhui</creator><creator>Liu, Baisheng</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0009-0009-1208-4399</orcidid><orcidid>https://orcid.org/0000-0002-5759-4589</orcidid></search><sort><creationdate>20241101</creationdate><title>YOLO-SGC: A Dangerous Driving Behavior Detection Method With Multiscale Spatial-Channel Feature Aggregation</title><author>Li, Ruijie ; Yu, Changdong ; Qin, Xiangrong ; An, Xin ; Zhao, Jinpeng ; Chuai, Wenhui ; Liu, Baisheng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c637-2ec01becf72e17820ee673bb445ffcf4906291f53d3ea1cf354039c917a8284f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Accuracy</topic><topic>Adaptation models</topic><topic>Dangerous driving detection</topic><topic>Fatigue</topic><topic>Feature extraction</topic><topic>Monitoring</topic><topic>multiscale feature capture</topic><topic>QIN_dataset</topic><topic>Real-time systems</topic><topic>spatial-channel attention</topic><topic>Vehicles</topic><topic>YOLO</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Ruijie</creatorcontrib><creatorcontrib>Yu, Changdong</creatorcontrib><creatorcontrib>Qin, Xiangrong</creatorcontrib><creatorcontrib>An, Xin</creatorcontrib><creatorcontrib>Zhao, Jinpeng</creatorcontrib><creatorcontrib>Chuai, Wenhui</creatorcontrib><creatorcontrib>Liu, Baisheng</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><jtitle>IEEE sensors journal</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Li, Ruijie</au><au>Yu, Changdong</au><au>Qin, Xiangrong</au><au>An, Xin</au><au>Zhao, Jinpeng</au><au>Chuai, Wenhui</au><au>Liu, Baisheng</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>YOLO-SGC: A Dangerous Driving Behavior Detection Method With Multiscale Spatial-Channel Feature Aggregation</atitle><jtitle>IEEE sensors journal</jtitle><stitle>JSEN</stitle><date>2024-11-01</date><risdate>2024</risdate><volume>24</volume><issue>21</issue><spage>36044</spage><epage>36056</epage><pages>36044-36056</pages><issn>1530-437X</issn><eissn>1558-1748</eissn><coden>ISJEAZ</coden><abstract>In intelligent transportation system, it is significant to detect drivers' dangerous driving behaviors accurately and in real time. However, current fatigue driving detection methods only focus on facial expressions or hand movements and ignore the impact of global behavior, resulting in poor detection results in complex scenes. To solve this problem, we propose a novel dangerous driving detection method called YOLO-SGC. This method builds on the YOLOv8 framework, enhancing it with the addition of spatial and channel reconstruction convolution (SCConv) and a global attention mechanism (GAM) to efficiently capture multiscale features based on spatial-channel information. Additionally, we leverage the cross-scale partial connections (CSPCs) method to optimize the spatial pyramid pooling fast (SPPF), expanding the model's field of view. These innovations significantly enhance the algorithm's ability to express features at various scales, thereby improving the model's capacity to recognize multiscale driving behaviors. In addition, we construct a new dataset of real driving scenarios called QIN_Dataset, which contains 34 000 images of real driving scenes from frontal and side angles. Finally, we validated YOLO-SGC and other mainstream methods on the public datasets and our QIN_Dataset. The results show that compared with other mainstream models, YOLO-SGC has a significant improvement in the average accuracy (overall increasing 1%~42.7% in mAP50 and 1.1%~55.6% in mAP50-95) while maintaining a high detection speed. This demonstrate that YOLO-SGC is an effective and practical solution for vision-based driver dangerous behaviors detection.</abstract><pub>IEEE</pub><doi>10.1109/JSEN.2024.3457686</doi><tpages>13</tpages><orcidid>https://orcid.org/0009-0009-1208-4399</orcidid><orcidid>https://orcid.org/0000-0002-5759-4589</orcidid></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1530-437X
ispartof	IEEE sensors journal, 2024-11, Vol.24 (21), p.36044-36056
issn	1530-437X 1558-1748
language	eng
recordid	cdi_crossref_primary_10_1109_JSEN_2024_3457686
source	IEEE Electronic Library (IEL)
subjects	Accuracy Adaptation models Dangerous driving detection Fatigue Feature extraction Monitoring multiscale feature capture QIN_dataset Real-time systems spatial-channel attention Vehicles YOLO
title	YOLO-SGC: A Dangerous Driving Behavior Detection Method With Multiscale Spatial-Channel Feature Aggregation
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-19T13%3A44%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=YOLO-SGC:%20A%20Dangerous%20Driving%20Behavior%20Detection%20Method%20With%20Multiscale%20Spatial-Channel%20Feature%20Aggregation&rft.jtitle=IEEE%20sensors%20journal&rft.au=Li,%20Ruijie&rft.date=2024-11-01&rft.volume=24&rft.issue=21&rft.spage=36044&rft.epage=36056&rft.pages=36044-36056&rft.issn=1530-437X&rft.eissn=1558-1748&rft.coden=ISJEAZ&rft_id=info:doi/10.1109/JSEN.2024.3457686&rft_dat=%3Ccrossref_RIE%3E10_1109_JSEN_2024_3457686%3C/crossref_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10680977&rfr_iscdi=true