Streaming Object Detection on Fisheye Cameras for Automatic Parking

Fisheye cameras are widely employed in automatic parking, and the video stream object detection (VSOD) of the fisheye camera is a fundamental perception function to ensure the safe operation of vehicles. In past research work, the difference between the output of the deep learning model and the actu...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Yan, Yixiong, Cheng, Liangzhu, Li, Yongxu, Tuo, Xinjuan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Yan, Yixiong
Cheng, Liangzhu
Li, Yongxu
Tuo, Xinjuan
description Fisheye cameras are widely employed in automatic parking, and the video stream object detection (VSOD) of the fisheye camera is a fundamental perception function to ensure the safe operation of vehicles. In past research work, the difference between the output of the deep learning model and the actual situation at the current moment due to the existence of delay of the perception system is generally ignored. But the environment will inevitably change within the delay time which may cause a potential safety hazard. In this paper, we propose a real-time detection framework equipped with a dual-flow perception module (dynamic and static flows) that can predict the future and alleviate the time-lag problem. Meanwhile, we use a new scheme to evaluate latency and accuracy. The standard bounding box is unsuitable for the object in fisheye camera images due to the strong radial distortion of the fisheye camera and the primary detection objects of parking perception are vehicles and pedestrians, so we adopt the rotate bounding box and propose a new periodic angle loss function to regress the angle of the box, which is the simple and accurate representation method of objects. The instance segmentation ground truth is used to supervise the training. Experiments demonstrate the effectiveness of our approach. Code is released at: https://gitee.com/hiyanyx/fisheye-streaming-perception.
doi_str_mv 10.48550/arxiv.2305.14713
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2305_14713</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2305_14713</sourcerecordid><originalsourceid>FETCH-LOGICAL-a673-539cc72b5257cc4918f606293788e2517fbcbbd7bf0a60045cf50ce695abec63</originalsourceid><addsrcrecordid>eNotj0FOwzAQRb1hgQoHYIUvkGDHGTtZVoECUqUilX00no6LgTTIMYjenlCQvvRW_0lPiCutyroBUDeYvuNXWRkFpa6dNuei2-bEOMTDXm78K1OWt5xnxPEg563i9MJHlh0OnHCSYUxy-ZnHAXMk-YTpbX5eiLOA7xNf_nMhtqu75-6hWG_uH7vlukDrTAGmJXKVhwocUd3qJlhlq9a4puEKtAuevN85HxRapWqgAIrYtoCeyZqFuP6zniL6jxQHTMf-N6Y_xZgffNBEUw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Streaming Object Detection on Fisheye Cameras for Automatic Parking</title><source>arXiv.org</source><creator>Yan, Yixiong ; Cheng, Liangzhu ; Li, Yongxu ; Tuo, Xinjuan</creator><creatorcontrib>Yan, Yixiong ; Cheng, Liangzhu ; Li, Yongxu ; Tuo, Xinjuan</creatorcontrib><description>Fisheye cameras are widely employed in automatic parking, and the video stream object detection (VSOD) of the fisheye camera is a fundamental perception function to ensure the safe operation of vehicles. In past research work, the difference between the output of the deep learning model and the actual situation at the current moment due to the existence of delay of the perception system is generally ignored. But the environment will inevitably change within the delay time which may cause a potential safety hazard. In this paper, we propose a real-time detection framework equipped with a dual-flow perception module (dynamic and static flows) that can predict the future and alleviate the time-lag problem. Meanwhile, we use a new scheme to evaluate latency and accuracy. The standard bounding box is unsuitable for the object in fisheye camera images due to the strong radial distortion of the fisheye camera and the primary detection objects of parking perception are vehicles and pedestrians, so we adopt the rotate bounding box and propose a new periodic angle loss function to regress the angle of the box, which is the simple and accurate representation method of objects. The instance segmentation ground truth is used to supervise the training. Experiments demonstrate the effectiveness of our approach. Code is released at: https://gitee.com/hiyanyx/fisheye-streaming-perception.</description><identifier>DOI: 10.48550/arxiv.2305.14713</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2023-05</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2305.14713$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2305.14713$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Yan, Yixiong</creatorcontrib><creatorcontrib>Cheng, Liangzhu</creatorcontrib><creatorcontrib>Li, Yongxu</creatorcontrib><creatorcontrib>Tuo, Xinjuan</creatorcontrib><title>Streaming Object Detection on Fisheye Cameras for Automatic Parking</title><description>Fisheye cameras are widely employed in automatic parking, and the video stream object detection (VSOD) of the fisheye camera is a fundamental perception function to ensure the safe operation of vehicles. In past research work, the difference between the output of the deep learning model and the actual situation at the current moment due to the existence of delay of the perception system is generally ignored. But the environment will inevitably change within the delay time which may cause a potential safety hazard. In this paper, we propose a real-time detection framework equipped with a dual-flow perception module (dynamic and static flows) that can predict the future and alleviate the time-lag problem. Meanwhile, we use a new scheme to evaluate latency and accuracy. The standard bounding box is unsuitable for the object in fisheye camera images due to the strong radial distortion of the fisheye camera and the primary detection objects of parking perception are vehicles and pedestrians, so we adopt the rotate bounding box and propose a new periodic angle loss function to regress the angle of the box, which is the simple and accurate representation method of objects. The instance segmentation ground truth is used to supervise the training. Experiments demonstrate the effectiveness of our approach. Code is released at: https://gitee.com/hiyanyx/fisheye-streaming-perception.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj0FOwzAQRb1hgQoHYIUvkGDHGTtZVoECUqUilX00no6LgTTIMYjenlCQvvRW_0lPiCutyroBUDeYvuNXWRkFpa6dNuei2-bEOMTDXm78K1OWt5xnxPEg563i9MJHlh0OnHCSYUxy-ZnHAXMk-YTpbX5eiLOA7xNf_nMhtqu75-6hWG_uH7vlukDrTAGmJXKVhwocUd3qJlhlq9a4puEKtAuevN85HxRapWqgAIrYtoCeyZqFuP6zniL6jxQHTMf-N6Y_xZgffNBEUw</recordid><startdate>20230524</startdate><enddate>20230524</enddate><creator>Yan, Yixiong</creator><creator>Cheng, Liangzhu</creator><creator>Li, Yongxu</creator><creator>Tuo, Xinjuan</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230524</creationdate><title>Streaming Object Detection on Fisheye Cameras for Automatic Parking</title><author>Yan, Yixiong ; Cheng, Liangzhu ; Li, Yongxu ; Tuo, Xinjuan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a673-539cc72b5257cc4918f606293788e2517fbcbbd7bf0a60045cf50ce695abec63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Yan, Yixiong</creatorcontrib><creatorcontrib>Cheng, Liangzhu</creatorcontrib><creatorcontrib>Li, Yongxu</creatorcontrib><creatorcontrib>Tuo, Xinjuan</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Yan, Yixiong</au><au>Cheng, Liangzhu</au><au>Li, Yongxu</au><au>Tuo, Xinjuan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Streaming Object Detection on Fisheye Cameras for Automatic Parking</atitle><date>2023-05-24</date><risdate>2023</risdate><abstract>Fisheye cameras are widely employed in automatic parking, and the video stream object detection (VSOD) of the fisheye camera is a fundamental perception function to ensure the safe operation of vehicles. In past research work, the difference between the output of the deep learning model and the actual situation at the current moment due to the existence of delay of the perception system is generally ignored. But the environment will inevitably change within the delay time which may cause a potential safety hazard. In this paper, we propose a real-time detection framework equipped with a dual-flow perception module (dynamic and static flows) that can predict the future and alleviate the time-lag problem. Meanwhile, we use a new scheme to evaluate latency and accuracy. The standard bounding box is unsuitable for the object in fisheye camera images due to the strong radial distortion of the fisheye camera and the primary detection objects of parking perception are vehicles and pedestrians, so we adopt the rotate bounding box and propose a new periodic angle loss function to regress the angle of the box, which is the simple and accurate representation method of objects. The instance segmentation ground truth is used to supervise the training. Experiments demonstrate the effectiveness of our approach. Code is released at: https://gitee.com/hiyanyx/fisheye-streaming-perception.</abstract><doi>10.48550/arxiv.2305.14713</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2305.14713
ispartof
issn
language eng
recordid cdi_arxiv_primary_2305_14713
source arXiv.org
subjects Computer Science - Artificial Intelligence
Computer Science - Computer Vision and Pattern Recognition
title Streaming Object Detection on Fisheye Cameras for Automatic Parking
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-24T19%3A33%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Streaming%20Object%20Detection%20on%20Fisheye%20Cameras%20for%20Automatic%20Parking&rft.au=Yan,%20Yixiong&rft.date=2023-05-24&rft_id=info:doi/10.48550/arxiv.2305.14713&rft_dat=%3Carxiv_GOX%3E2305_14713%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true