YOLO-SGC: A Dangerous Driving Behavior Detection Method With Multiscale Spatial-Channel Feature Aggregation

In intelligent transportation system, it is significant to detect drivers' dangerous driving behaviors accurately and in real time. However, current fatigue driving detection methods only focus on facial expressions or hand movements and ignore the impact of global behavior, resulting in poor d...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE sensors journal 2024-11, Vol.24 (21), p.36044-36056
Hauptverfasser: Li, Ruijie, Yu, Changdong, Qin, Xiangrong, An, Xin, Zhao, Jinpeng, Chuai, Wenhui, Liu, Baisheng
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 36056
container_issue 21
container_start_page 36044
container_title IEEE sensors journal
container_volume 24
creator Li, Ruijie
Yu, Changdong
Qin, Xiangrong
An, Xin
Zhao, Jinpeng
Chuai, Wenhui
Liu, Baisheng
description In intelligent transportation system, it is significant to detect drivers' dangerous driving behaviors accurately and in real time. However, current fatigue driving detection methods only focus on facial expressions or hand movements and ignore the impact of global behavior, resulting in poor detection results in complex scenes. To solve this problem, we propose a novel dangerous driving detection method called YOLO-SGC. This method builds on the YOLOv8 framework, enhancing it with the addition of spatial and channel reconstruction convolution (SCConv) and a global attention mechanism (GAM) to efficiently capture multiscale features based on spatial-channel information. Additionally, we leverage the cross-scale partial connections (CSPCs) method to optimize the spatial pyramid pooling fast (SPPF), expanding the model's field of view. These innovations significantly enhance the algorithm's ability to express features at various scales, thereby improving the model's capacity to recognize multiscale driving behaviors. In addition, we construct a new dataset of real driving scenarios called QIN_Dataset, which contains 34 000 images of real driving scenes from frontal and side angles. Finally, we validated YOLO-SGC and other mainstream methods on the public datasets and our QIN_Dataset. The results show that compared with other mainstream models, YOLO-SGC has a significant improvement in the average accuracy (overall increasing 1%~42.7% in mAP50 and 1.1%~55.6% in mAP50-95) while maintaining a high detection speed. This demonstrate that YOLO-SGC is an effective and practical solution for vision-based driver dangerous behaviors detection.
doi_str_mv 10.1109/JSEN.2024.3457686
format Article
fullrecord <record><control><sourceid>crossref_RIE</sourceid><recordid>TN_cdi_crossref_primary_10_1109_JSEN_2024_3457686</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10680977</ieee_id><sourcerecordid>10_1109_JSEN_2024_3457686</sourcerecordid><originalsourceid>FETCH-LOGICAL-c637-2ec01becf72e17820ee673bb445ffcf4906291f53d3ea1cf354039c917a8284f3</originalsourceid><addsrcrecordid>eNpNkLtOwzAYhS0EEqXwAEgMfoEUX-OEraQXQC0dWgmYItf9nRhCUjlOJd4eonZgOmc43xk-hG4pGVFK0vuX9fR1xAgTIy6kipP4DA2olElElUjO-85JJLh6v0RXbftJCE2VVAP09bFarKL1PHvAYzzRdQG-6Vo88e7g6gI_QqkPrvF4AgFMcE2NlxDKZoffXCjxsquCa42uAK_3OjhdRVmp6xoqPAMdOg94XBQeCt2j1-jC6qqFm1MO0WY23WRP0WI1f87Gi8jEXEUMDKFbMFYxoCphBCBWfLsVQlprrEhJzFJqJd9x0NRYLgXhqUmp0glLhOVDRI-3xjdt68Hme---tf_JKcl7WXkvK-9l5SdZf8zdkXEA8G8fJyRViv8CG_tmbA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>YOLO-SGC: A Dangerous Driving Behavior Detection Method With Multiscale Spatial-Channel Feature Aggregation</title><source>IEEE Electronic Library (IEL)</source><creator>Li, Ruijie ; Yu, Changdong ; Qin, Xiangrong ; An, Xin ; Zhao, Jinpeng ; Chuai, Wenhui ; Liu, Baisheng</creator><creatorcontrib>Li, Ruijie ; Yu, Changdong ; Qin, Xiangrong ; An, Xin ; Zhao, Jinpeng ; Chuai, Wenhui ; Liu, Baisheng</creatorcontrib><description>In intelligent transportation system, it is significant to detect drivers' dangerous driving behaviors accurately and in real time. However, current fatigue driving detection methods only focus on facial expressions or hand movements and ignore the impact of global behavior, resulting in poor detection results in complex scenes. To solve this problem, we propose a novel dangerous driving detection method called YOLO-SGC. This method builds on the YOLOv8 framework, enhancing it with the addition of spatial and channel reconstruction convolution (SCConv) and a global attention mechanism (GAM) to efficiently capture multiscale features based on spatial-channel information. Additionally, we leverage the cross-scale partial connections (CSPCs) method to optimize the spatial pyramid pooling fast (SPPF), expanding the model's field of view. These innovations significantly enhance the algorithm's ability to express features at various scales, thereby improving the model's capacity to recognize multiscale driving behaviors. In addition, we construct a new dataset of real driving scenarios called QIN_Dataset, which contains 34 000 images of real driving scenes from frontal and side angles. Finally, we validated YOLO-SGC and other mainstream methods on the public datasets and our QIN_Dataset. The results show that compared with other mainstream models, YOLO-SGC has a significant improvement in the average accuracy (overall increasing 1%~42.7% in mAP50 and 1.1%~55.6% in mAP50-95) while maintaining a high detection speed. This demonstrate that YOLO-SGC is an effective and practical solution for vision-based driver dangerous behaviors detection.</description><identifier>ISSN: 1530-437X</identifier><identifier>EISSN: 1558-1748</identifier><identifier>DOI: 10.1109/JSEN.2024.3457686</identifier><identifier>CODEN: ISJEAZ</identifier><language>eng</language><publisher>IEEE</publisher><subject>Accuracy ; Adaptation models ; Dangerous driving detection ; Fatigue ; Feature extraction ; Monitoring ; multiscale feature capture ; QIN_dataset ; Real-time systems ; spatial-channel attention ; Vehicles ; YOLO</subject><ispartof>IEEE sensors journal, 2024-11, Vol.24 (21), p.36044-36056</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c637-2ec01becf72e17820ee673bb445ffcf4906291f53d3ea1cf354039c917a8284f3</cites><orcidid>0009-0009-1208-4399 ; 0000-0002-5759-4589</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10680977$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10680977$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Li, Ruijie</creatorcontrib><creatorcontrib>Yu, Changdong</creatorcontrib><creatorcontrib>Qin, Xiangrong</creatorcontrib><creatorcontrib>An, Xin</creatorcontrib><creatorcontrib>Zhao, Jinpeng</creatorcontrib><creatorcontrib>Chuai, Wenhui</creatorcontrib><creatorcontrib>Liu, Baisheng</creatorcontrib><title>YOLO-SGC: A Dangerous Driving Behavior Detection Method With Multiscale Spatial-Channel Feature Aggregation</title><title>IEEE sensors journal</title><addtitle>JSEN</addtitle><description>In intelligent transportation system, it is significant to detect drivers' dangerous driving behaviors accurately and in real time. However, current fatigue driving detection methods only focus on facial expressions or hand movements and ignore the impact of global behavior, resulting in poor detection results in complex scenes. To solve this problem, we propose a novel dangerous driving detection method called YOLO-SGC. This method builds on the YOLOv8 framework, enhancing it with the addition of spatial and channel reconstruction convolution (SCConv) and a global attention mechanism (GAM) to efficiently capture multiscale features based on spatial-channel information. Additionally, we leverage the cross-scale partial connections (CSPCs) method to optimize the spatial pyramid pooling fast (SPPF), expanding the model's field of view. These innovations significantly enhance the algorithm's ability to express features at various scales, thereby improving the model's capacity to recognize multiscale driving behaviors. In addition, we construct a new dataset of real driving scenarios called QIN_Dataset, which contains 34 000 images of real driving scenes from frontal and side angles. Finally, we validated YOLO-SGC and other mainstream methods on the public datasets and our QIN_Dataset. The results show that compared with other mainstream models, YOLO-SGC has a significant improvement in the average accuracy (overall increasing 1%~42.7% in mAP50 and 1.1%~55.6% in mAP50-95) while maintaining a high detection speed. This demonstrate that YOLO-SGC is an effective and practical solution for vision-based driver dangerous behaviors detection.</description><subject>Accuracy</subject><subject>Adaptation models</subject><subject>Dangerous driving detection</subject><subject>Fatigue</subject><subject>Feature extraction</subject><subject>Monitoring</subject><subject>multiscale feature capture</subject><subject>QIN_dataset</subject><subject>Real-time systems</subject><subject>spatial-channel attention</subject><subject>Vehicles</subject><subject>YOLO</subject><issn>1530-437X</issn><issn>1558-1748</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpNkLtOwzAYhS0EEqXwAEgMfoEUX-OEraQXQC0dWgmYItf9nRhCUjlOJd4eonZgOmc43xk-hG4pGVFK0vuX9fR1xAgTIy6kipP4DA2olElElUjO-85JJLh6v0RXbftJCE2VVAP09bFarKL1PHvAYzzRdQG-6Vo88e7g6gI_QqkPrvF4AgFMcE2NlxDKZoffXCjxsquCa42uAK_3OjhdRVmp6xoqPAMdOg94XBQeCt2j1-jC6qqFm1MO0WY23WRP0WI1f87Gi8jEXEUMDKFbMFYxoCphBCBWfLsVQlprrEhJzFJqJd9x0NRYLgXhqUmp0glLhOVDRI-3xjdt68Hme---tf_JKcl7WXkvK-9l5SdZf8zdkXEA8G8fJyRViv8CG_tmbA</recordid><startdate>20241101</startdate><enddate>20241101</enddate><creator>Li, Ruijie</creator><creator>Yu, Changdong</creator><creator>Qin, Xiangrong</creator><creator>An, Xin</creator><creator>Zhao, Jinpeng</creator><creator>Chuai, Wenhui</creator><creator>Liu, Baisheng</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0009-0009-1208-4399</orcidid><orcidid>https://orcid.org/0000-0002-5759-4589</orcidid></search><sort><creationdate>20241101</creationdate><title>YOLO-SGC: A Dangerous Driving Behavior Detection Method With Multiscale Spatial-Channel Feature Aggregation</title><author>Li, Ruijie ; Yu, Changdong ; Qin, Xiangrong ; An, Xin ; Zhao, Jinpeng ; Chuai, Wenhui ; Liu, Baisheng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c637-2ec01becf72e17820ee673bb445ffcf4906291f53d3ea1cf354039c917a8284f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Accuracy</topic><topic>Adaptation models</topic><topic>Dangerous driving detection</topic><topic>Fatigue</topic><topic>Feature extraction</topic><topic>Monitoring</topic><topic>multiscale feature capture</topic><topic>QIN_dataset</topic><topic>Real-time systems</topic><topic>spatial-channel attention</topic><topic>Vehicles</topic><topic>YOLO</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Ruijie</creatorcontrib><creatorcontrib>Yu, Changdong</creatorcontrib><creatorcontrib>Qin, Xiangrong</creatorcontrib><creatorcontrib>An, Xin</creatorcontrib><creatorcontrib>Zhao, Jinpeng</creatorcontrib><creatorcontrib>Chuai, Wenhui</creatorcontrib><creatorcontrib>Liu, Baisheng</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><jtitle>IEEE sensors journal</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Li, Ruijie</au><au>Yu, Changdong</au><au>Qin, Xiangrong</au><au>An, Xin</au><au>Zhao, Jinpeng</au><au>Chuai, Wenhui</au><au>Liu, Baisheng</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>YOLO-SGC: A Dangerous Driving Behavior Detection Method With Multiscale Spatial-Channel Feature Aggregation</atitle><jtitle>IEEE sensors journal</jtitle><stitle>JSEN</stitle><date>2024-11-01</date><risdate>2024</risdate><volume>24</volume><issue>21</issue><spage>36044</spage><epage>36056</epage><pages>36044-36056</pages><issn>1530-437X</issn><eissn>1558-1748</eissn><coden>ISJEAZ</coden><abstract>In intelligent transportation system, it is significant to detect drivers' dangerous driving behaviors accurately and in real time. However, current fatigue driving detection methods only focus on facial expressions or hand movements and ignore the impact of global behavior, resulting in poor detection results in complex scenes. To solve this problem, we propose a novel dangerous driving detection method called YOLO-SGC. This method builds on the YOLOv8 framework, enhancing it with the addition of spatial and channel reconstruction convolution (SCConv) and a global attention mechanism (GAM) to efficiently capture multiscale features based on spatial-channel information. Additionally, we leverage the cross-scale partial connections (CSPCs) method to optimize the spatial pyramid pooling fast (SPPF), expanding the model's field of view. These innovations significantly enhance the algorithm's ability to express features at various scales, thereby improving the model's capacity to recognize multiscale driving behaviors. In addition, we construct a new dataset of real driving scenarios called QIN_Dataset, which contains 34 000 images of real driving scenes from frontal and side angles. Finally, we validated YOLO-SGC and other mainstream methods on the public datasets and our QIN_Dataset. The results show that compared with other mainstream models, YOLO-SGC has a significant improvement in the average accuracy (overall increasing 1%~42.7% in mAP50 and 1.1%~55.6% in mAP50-95) while maintaining a high detection speed. This demonstrate that YOLO-SGC is an effective and practical solution for vision-based driver dangerous behaviors detection.</abstract><pub>IEEE</pub><doi>10.1109/JSEN.2024.3457686</doi><tpages>13</tpages><orcidid>https://orcid.org/0009-0009-1208-4399</orcidid><orcidid>https://orcid.org/0000-0002-5759-4589</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1530-437X
ispartof IEEE sensors journal, 2024-11, Vol.24 (21), p.36044-36056
issn 1530-437X
1558-1748
language eng
recordid cdi_crossref_primary_10_1109_JSEN_2024_3457686
source IEEE Electronic Library (IEL)
subjects Accuracy
Adaptation models
Dangerous driving detection
Fatigue
Feature extraction
Monitoring
multiscale feature capture
QIN_dataset
Real-time systems
spatial-channel attention
Vehicles
YOLO
title YOLO-SGC: A Dangerous Driving Behavior Detection Method With Multiscale Spatial-Channel Feature Aggregation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-19T13%3A44%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=YOLO-SGC:%20A%20Dangerous%20Driving%20Behavior%20Detection%20Method%20With%20Multiscale%20Spatial-Channel%20Feature%20Aggregation&rft.jtitle=IEEE%20sensors%20journal&rft.au=Li,%20Ruijie&rft.date=2024-11-01&rft.volume=24&rft.issue=21&rft.spage=36044&rft.epage=36056&rft.pages=36044-36056&rft.issn=1530-437X&rft.eissn=1558-1748&rft.coden=ISJEAZ&rft_id=info:doi/10.1109/JSEN.2024.3457686&rft_dat=%3Ccrossref_RIE%3E10_1109_JSEN_2024_3457686%3C/crossref_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10680977&rfr_iscdi=true