RelTrans: An Enhancing Offline Reinforcement Learning Model for the Complex Hand Gesture Decision-Making Task

As wearable devices gain popularity, gesture recognition technology is becoming increasingly vital. Merely identifying gesture categories is insufficient for devices operating in complex environments. A significant challenge lies in enabling devices to autonomously and efficiently perform gesture re...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on consumer electronics 2024-02, Vol.70 (1), p.3762-3769
Hauptverfasser:	Chen, Xiangwei, Zeng, Zhixia, Xiao, Ruliang, Rida, Imad, Zhang, Shi
Format:	Artikel
Sprache:	eng
Schlagworte:	Adaptation models Computer Science Constraint modelling data analysis Data exchange Data models Decision making Deep learning Distillation Gesture recognition hand gesture recognition Machine learning offline reinforcement learning Reinforcement learning Sequences Source code Task analysis Task complexity Transformers Wearable technology
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	As wearable devices gain popularity, gesture recognition technology is becoming increasingly vital. Merely identifying gesture categories is insufficient for devices operating in complex environments. A significant challenge lies in enabling devices to autonomously and efficiently perform gesture recognition tasks, particularly in complex decision-making. Addressing this, this paper introduces an implicit relationship constraint-based offline reinforcement learning model, termed the Relationship Transformer Guided Generative Policy Network (RelTrans), designed for complex gesture decision-making tasks. The model includes an Implicit Constraint-Constructing Network (ICCN) that uses immediate rewards, unbound by predefined reward values, to extract relationship data for guiding the Generative Policy Network (GPN) in predicting action sequences. Additionally, it integrates a knowledge distillation-based Soft-Bias loss function, which not only allows the GPN to leverage ICCN's implicit constraints but also controls its self-generalization, ensuring effective information exchange and coordinated network updates. These advancements enable the model to comprehend and adapt to higher-level decision-making and reasoning across varying environmental conditions, enhancing the agent and applicability of gesture recognition technology in a broad spectrum of application areas. Extensive experimentation across 19 subtasks in the D4RL offline benchmark suite demonstrates that RelTrans matches or surpasses the performance of previous state-of-the-art approaches in various autonomous decision-making tasks. Our code is open source and available at: https://github.com/Aoudsung/RelTrans .
ISSN:	0098-3063 1558-4127
DOI:	10.1109/TCE.2024.3360211