Neural network systems with an integrated coefficient of variation-based feature selection for stock price and trend prediction
Stock market forecasting has been a subject of interest for many researchers; the essential market analyses can be integrated with historical stock market data to derive a set of features. It is crucial to select features with useful information about the specific aspect. In this article, we propose...
Gespeichert in:
Veröffentlicht in: | Expert systems with applications 2023-06, Vol.219, p.119527, Article 119527 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Stock market forecasting has been a subject of interest for many researchers; the essential market analyses can be integrated with historical stock market data to derive a set of features. It is crucial to select features with useful information about the specific aspect. In this article, we propose coefficient of variation (CV)-based feature selection for stock prediction. The unitless statistical method, CV, is widely used to obtain variability among data distributions. We calculate CV for each feature and integrate an existing method, k-means algorithm, as well as proposed methods, median range and top-M, to select a set of features with specific characteristics such as features belonging to the largest cluster, the defined range, and with the highest CV values, respectively. We apply the set of selected features to models such as backpropagation neural network (BPNN), long short-term memory (LSTM), gated recurrent unit (GRU), and convolutional neural network (CNN) for stock price and trend prediction. We demonstrate the applicability of our proposed approach using five of the existing feature selection methods, namely, correlation coefficient, Chi2, mutual information, principal component analysis, and variance threshold; comparison indicates remarkable performance enhancement using several accuracy-based, as well as error-based, metrics and the same is statistically supported using Wilcoxon signed-rank test.
•Statistical method, coefficient of variation (CV) is proposed for feature selection.•k-means algorithm, median range, and top-M are applied on the CV values of features.•Selected features are applied to neural network for stock price and trend prediction.•Comparative analysis with five existing feature selection methods is demonstrated.•Statistical significance using Wilcoxon signed-rank test is provided. |
---|---|
ISSN: | 0957-4174 1873-6793 |
DOI: | 10.1016/j.eswa.2023.119527 |