Towards autonomous neuroprosthetic control using Hebbian reinforcement learning

Objective. Our goal was to design an adaptive neuroprosthetic controller that could learn the mapping from neural states to prosthetic actions and automatically adjust adaptation using only a binary evaluative feedback as a measure of desirability undesirability of performance. Approach. Hebbian rei...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of neural engineering 2013-12, Vol.10 (6), p.066005-15
Hauptverfasser:	Mahmoudi, Babak, Pohlmeyer, Eric A, Prins, Noeline W, Geng, Shijia, Sanchez, Justin C
Format:	Artikel
Sprache:	eng
Schlagworte:	actor-critic Adaptation adaptive control Algorithms Animals Artificial Intelligence - standards associative learning autonomous brain-machine interface Callithrix Control systems Control theory Controllers Feedback Hebbian learning Learning Learning - physiology neural decoding Neural Prostheses - standards neuroprosthetic Policies Random Allocation Reinforcement Reinforcement (Psychology) reinforcement learning
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Objective. Our goal was to design an adaptive neuroprosthetic controller that could learn the mapping from neural states to prosthetic actions and automatically adjust adaptation using only a binary evaluative feedback as a measure of desirability undesirability of performance. Approach. Hebbian reinforcement learning (HRL) in a connectionist network was used for the design of the adaptive controller. The method combines the efficiency of supervised learning with the generality of reinforcement learning. The convergence properties of this approach were studied using both closed-loop control simulations and open-loop simulations that used primate neural data from robot-assisted reaching tasks. Main results. The HRL controller was able to perform classification and regression tasks using its episodic and sequential learning modes, respectively. In our experiments, the HRL controller quickly achieved convergence to an effective control policy, followed by robust performance. The controller also automatically stopped adapting the parameters after converging to a satisfactory control policy. Additionally, when the input neural vector was reorganized, the controller resumed adaptation to maintain performance. Significance. By estimating an evaluative feedback directly from the user, the HRL control algorithm may provide an efficient method for autonomous adaptation of neuroprosthetic systems. This method may enable the user to teach the controller the desired behavior using only a simple feedback signal.
ISSN:	1741-2560 1741-2552
DOI:	10.1088/1741-2560/10/6/066005