Clickable Object Detection Network for a Wide Range of Mobile Screen Resolutions

Recently, as the development cycle of applications has been shortened, it is important to develop rapid and accurate application testing technology. Since application testing requires a lot of cost, mobile GUI component detection technology using deep learning is essential to prevent the use of expe...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2022, Vol.10, p.115051-115060
Hauptverfasser: Kang, Boseon, Jo, Minseok, Jeong, Chang-Sung
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Recently, as the development cycle of applications has been shortened, it is important to develop rapid and accurate application testing technology. Since application testing requires a lot of cost, mobile GUI component detection technology using deep learning is essential to prevent the use of expensive human resources. In this paper, we shall propose a Clickable Object Detection Network (CODNet) for mobile component detection in a wide range of mobile screen resolutions. CODNet consists of three modules: feature extraction, deconvolution and prediction modules in order to provide performance improvement and scalability. Feature extraction module uses squeeze and excitation blocks to efficiently extract features by changing the ratio of the input image to 1:2 most close to that of mobile screen. Deconvolution module provides feature map of various sizes by upsampling feature map through top-down pathway and lateral connections. Prediction module selects an anchor size most suitable for the mobile environment using Anchor Transfer block among the set of anchor candidates obtained through the analysis of mobile dataset. Moreover, we shall show that our model achieves competitive performance in mean average precision on our dataset compared to the other models, and object detection performance is improved by building a new mobile screen dataset which consists of data collected from various resolutions and operating systems.
ISSN:2169-3536
2169-3536
DOI:10.1109/ACCESS.2022.3202222