Neural network systems with an integrated coefficient of variation-based feature selection for stock price and trend prediction

Stock market forecasting has been a subject of interest for many researchers; the essential market analyses can be integrated with historical stock market data to derive a set of features. It is crucial to select features with useful information about the specific aspect. In this article, we propose...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2023-06, Vol.219, p.119527, Article 119527
Hauptverfasser: Chaudhari, Kinjal, Thakkar, Ankit
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Stock market forecasting has been a subject of interest for many researchers; the essential market analyses can be integrated with historical stock market data to derive a set of features. It is crucial to select features with useful information about the specific aspect. In this article, we propose coefficient of variation (CV)-based feature selection for stock prediction. The unitless statistical method, CV, is widely used to obtain variability among data distributions. We calculate CV for each feature and integrate an existing method, k-means algorithm, as well as proposed methods, median range and top-M, to select a set of features with specific characteristics such as features belonging to the largest cluster, the defined range, and with the highest CV values, respectively. We apply the set of selected features to models such as backpropagation neural network (BPNN), long short-term memory (LSTM), gated recurrent unit (GRU), and convolutional neural network (CNN) for stock price and trend prediction. We demonstrate the applicability of our proposed approach using five of the existing feature selection methods, namely, correlation coefficient, Chi2, mutual information, principal component analysis, and variance threshold; comparison indicates remarkable performance enhancement using several accuracy-based, as well as error-based, metrics and the same is statistically supported using Wilcoxon signed-rank test. •Statistical method, coefficient of variation (CV) is proposed for feature selection.•k-means algorithm, median range, and top-M are applied on the CV values of features.•Selected features are applied to neural network for stock price and trend prediction.•Comparative analysis with five existing feature selection methods is demonstrated.•Statistical significance using Wilcoxon signed-rank test is provided.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2023.119527