VARIABLE SELECTION METHOD, VARIABLE SELECTION PROGRAM, AND VARIABLE SELECTION SYSTEM

To provide a variable selection method for machine learning with which it is possible to improve prediction accuracy, etc.SOLUTION: The present invention is a variable selection method for selecting a specific variable group used in machine learning from all variable groups constituting the mother d...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: YUNOKI TAKAHIRO, YOGO YASUHIRO, MIYAZAKI IZURU
Format: Patent
Sprache:eng ; jpn
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:To provide a variable selection method for machine learning with which it is possible to improve prediction accuracy, etc.SOLUTION: The present invention is a variable selection method for selecting a specific variable group used in machine learning from all variable groups constituting the mother data group concerned. The variable selection method comprises: an information amount calculation step for calculating, using the mother data group, respectively, the mutual information amount of a variable pair that is a combination of one or more first variables extracted from all variable groups and each second variable, unlike the first variables, extracted from all variable groups; and a verification step for performing, with regard to second variables extracted as many as prescribed selection counts in decreasing order of magnitude of mutual information amount or a selected variable group composed of said variable pair, learning model creation and performance evaluation using a selected data group pertaining to the selected variable group extracted from the mother data group. The specific variable group is selected on the basis of a relationship between the selected number of selected variable groups and the performance of the learning model.SELECTED DRAWING: Figure 5 【課題】予測精度等の向上を図れる機械学習用の変数選定方法を提供する。【解決手段】本発明は、対象となる母データ群を構成する全変数群から機械学習に用いる特定変数群を選定する変数選定方法である。変数選定方法は、全変数群から抽出される1以上の第1変数と第1変数と異なり全変数群から抽出される各第2変数との組合せである変数対の相互情報量をそれぞれ母データ群を用いて算出する情報量算出ステップと、相互情報量の大きい方から降順に所定の選択数だけ抽出した第2変数または該変数対からなる選択変数群について、母データ群から抽出される選択変数群に係る選択データ群を用いて学習モデルの作成と性能評価を行う検証ステップとを備える。特定変数群は、選択変数群の選択数と学習モデルの性能との関係に基づいて選定される。【選択図】図5