User information collection method and device based on reinforcement learning model

The embodiment of the invention provides a user information collection method and device based on a reinforcement learning model, the reinforcement learning model comprises a strategy network, the method comprises the following steps: in a conversation process with a target user, a current environme...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	ZHANG TIANYI, LIU DANDAN, CAO LIN, SHU HUIZHEN, ZHANG XIAOXU
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES PHYSICS SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The embodiment of the invention provides a user information collection method and device based on a reinforcement learning model, the reinforcement learning model comprises a strategy network, the method comprises the following steps: in a conversation process with a target user, a current environment state is acquired, the current environment state is determined at least based on corresponding previous N rounds of conversation content, and each round of conversation content comprises N rounds of conversation content; historical information collection questions and historical user feedback of the target user; the current environment state is input into a strategy network, Q values corresponding to all alternative information collection questions in an alternative question set in the current environment state are obtained, the alternative question set is determined based on historical user feedback and a preset knowledge base, and the preset knowledge base comprises the mapping relation between specified types