Digital human video identification model training method and system based on reinforcement learning

The invention discloses a digital human video identification model training method and system based on reinforcement learning, and belongs to the technical field of artificial intelligence. Defining an action space and a value index of each action according to a noise adding behavior type, and initi...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	XUE HONGYANG
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The invention discloses a digital human video identification model training method and system based on reinforcement learning, and belongs to the technical field of artificial intelligence. Defining an action space and a value index of each action according to a noise adding behavior type, and initializing a reinforcement learning model of a valued network and playback buffer; sampling positive and negative sample pairs from the positive and negative sample set, obtaining to-be-added noise by using the value network and adding the to-be-added noise into the sample pairs to obtain new positive and negative sample pairs, obtaining return values of the added noise of the new positive and negative sample pairs by using the identification model, constructing a tetrad and storing the tetrad into playback buffer; when the number of the tetrads reaches a threshold value, updating the value network, and emptying the playback buffer; using a value network to disturb the positive and negative sample set to obtain a new