STATNet: One-stage coal-gangue detector based on deep learning algorithm for real industrial application

•Propose a one-stage object detection model for accurate coal-gangue detection.•Use Swin-transformer as backbone to extract global multi-scale information.•Improve feature pyramid to facilitate effective cross-scale feature fusion.•Introduce task-aligned head to mitigate classification-localization...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Energy and AI 2024-09, Vol.17, p.100388, Article 100388
Hauptverfasser: Zhang, Kefei, Wang, Teng, Yang, Xiaolin, Xu, Liang, Thé, Jesse, Tan, Zhongchao, Yu, Hesheng
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Propose a one-stage object detection model for accurate coal-gangue detection.•Use Swin-transformer as backbone to extract global multi-scale information.•Improve feature pyramid to facilitate effective cross-scale feature fusion.•Introduce task-aligned head to mitigate classification-localization misalignments.•Our STATNet model outperforms SOTA baseline models in real industrial dataset. Coal-gangue object detection has attracted substantial attention because it is the core of realizing vision-based intelligent and green coal separation. However, most existing studies have been focused on laboratory datasets and prioritized model lightweight. This makes the coal-gangue object detection challenging to adapt to the complex and harsh scenes of real production environments. Therefore, our project collected and labeled image datasets of coal and gangue under real production conditions from a coal preparation plant. We then designed a one-stage object model, named STATNet, following the “backbone-neck-head” architecture with the aim of enhancing the detection accuracy under industrial coal preparation scenarios. The proposed model utilizes Swin Transformer as backbone module to extract multi-scale features, improved path augmentation feature pyramid network (iPAFPN) as neck module to enrich feature fusion, and task-aligned head (TAH) as head module to mitigate conflicts and misalignments between classification and localization tasks. Experimental results on a real-world industrial dataset demonstrate that the proposed STATNet model achieves an impressive AP50 of 89.27 %, significantly surpassing several state-of-the-art baseline models by 2.02 % to 5.58 %. Additionally, it exhibits stronger robustness in resisting image corruption and perturbation. These findings demonstrate its promising prospects in practical coal and gangue separation applications. [Display omitted]
ISSN:2666-5468
2666-5468
DOI:10.1016/j.egyai.2024.100388