Structured Knowledge Distillation for Dense Prediction

In this work, we consider transferring the structure information from large networks to compact ones for dense prediction tasks in computer vision. Previous knowledge distillation strategies used for dense prediction tasks often directly borrow the distillation scheme for image classification and pe...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on pattern analysis and machine intelligence 2023-06, Vol.45 (6), p.7035-7049
Hauptverfasser:	Liu, Yifan, Shu, Changyong, Wang, Jingdong, Shen, Chunhua
Format:	Artikel
Sprache:	eng
Schlagworte:	adversarial training Computer vision dense prediction Distillation Estimation Image classification Image segmentation Knowledge engineering knowledge transferring Networks Object detection Object recognition Semantic segmentation Semantics Structured knowledge distillation Task analysis Training
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In this work, we consider transferring the structure information from large networks to compact ones for dense prediction tasks in computer vision. Previous knowledge distillation strategies used for dense prediction tasks often directly borrow the distillation scheme for image classification and perform knowledge distillation for each pixel separately , leading to sub-optimal performance. Here we propose to distill structured knowledge from large networks to compact networks, taking into account the fact that dense prediction is a structured prediction problem. Specifically, we study two structured distillation schemes: i ) pair-wise distillation that distills the pair-wise similarities by building a static graph; and ii ) holistic distillation that uses adversarial training to distill holistic knowledge. The effectiveness of our knowledge distillation approaches is demonstrated by experiments on three dense prediction tasks: semantic segmentation, depth estimation and object detection. Code is available at https://git.io/StructKD .
ISSN:	0162-8828 1939-3539 2160-9292
DOI:	10.1109/TPAMI.2020.3001940