From detection to understanding: A survey on representation learning for human-object interaction

Human-Object Interaction (HOI) detection is a critical topic in the visual understanding field. With the development of deep learning models, the research of HOI detection has been profoundly reshaped. Deep convolutional neural networks increased the object recognition accuracy of static images and...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Neurocomputing (Amsterdam) 2023-07, Vol.543, p.126243, Article 126243
Hauptverfasser:	Luo, Tianlun, Guan, Steven, Yang, Rui, Smith, Jeremy
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer vision Convolutional neural networks Human-object interaction Object detection Representation learning Transformers
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Human-Object Interaction (HOI) detection is a critical topic in the visual understanding field. With the development of deep learning models, the research of HOI detection has been profoundly reshaped. Deep convolutional neural networks increased the object recognition accuracy of static images and induced a detection-based HOI detection stream. The detection-based models resolve the HOI detection problem from a classification perspective. Another stream of HOI detection methods seeks a deeper understanding of the information shown in images, and they are named HOI understanding methods in this survey paper. HOI understanding methods usually acquire external linguistic data to enable the deep models to learn more about the images. Additionally, some of the HOI understanding methods exploit graph neural networks (GNN) to increase the inference accuracy of the model.
ISSN:	0925-2312 1872-8286
DOI:	10.1016/j.neucom.2023.126243