METHOD AND APPARATUS OF GENERATING QUESTION-ANSWER LEARNING MODEL THROUGH REINFORCEMENT LEARNING

The present invention relates to a method for generating a question-and-answer learning model through reinforcement learning in a device that generates an answer to a question, wherein the method comprises: a step of sampling, at a first agent, a latent variable in an arbitrary paragraph; a step of...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	KIBONG SUNG, KIM DONG HWAN, JEONG WOO TAE
Format:	Patent
Sprache:	eng ; kor
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The present invention relates to a method for generating a question-and-answer learning model through reinforcement learning in a device that generates an answer to a question, wherein the method comprises: a step of sampling, at a first agent, a latent variable in an arbitrary paragraph; a step of extracting a dataset of questions and answers from the paragraph based on the latent variable; a step of determining, at a second agent, whether or not to apply the extracted question and answer dataset to a question-and-answer learning model that generates an answer to an arbitrary question; and a step of applying a performance change value of the question and answer model to the first agent and the second agent as a reward. Therefore, the present invention is capable of improving a performance of the question-and-answer learning model. 본 발명은 질의에 대한 응답을 생성하는 장치에서, 강화 학습을 통한 질의응답 모델을 운용하는 방법에 대한 것으로, 제 1 에이전트에서, 임의의 문단에서 잠재변수(Latent variable)를 샘플링하는 단계; 상기 잠재변수를 기초로 상기 문단으로부터 질의 및 응답의 데이터 셋을 추출하는 단계; 제 2 에이전트에서, 추출된 질의 및 응답의 데이터 셋을 임의의 질의에 대한 응답을 생성하는 질의응답 모델의 학습에 적용할지 여부를 결정하는 단계; 및 상기 질의응답 모델의 성능의 변경값을 상기 제 1 에이전트 및 상기 제 2 에이전트에 리워드로 적용하는 단계를 포함하는 것을 특징으로 한다.