Healthcare cost prediction: Leveraging fine-grain temporal patterns
[Display omitted] •We investigated leveraging patients’ temporal data for healthcare cost prediction.•Patients’ temporal data were extracted and represented in fine-grain form.•Novel spike detection features proposed to extract temporal patterns.•The proposed approach outperformed baselines and esta...
Gespeichert in:
Veröffentlicht in: | Journal of biomedical informatics 2019-03, Vol.91, p.103113-103113, Article 103113 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | [Display omitted]
•We investigated leveraging patients’ temporal data for healthcare cost prediction.•Patients’ temporal data were extracted and represented in fine-grain form.•Novel spike detection features proposed to extract temporal patterns.•The proposed approach outperformed baselines and established literature methods.
To design and assess a method to leverage individuals’ temporal data for predicting their healthcare cost. To achieve this goal, we first used patients’ temporal data in their fine-grain form as opposed to coarse-grain form. Second, we devised novel spike detection features to extract temporal patterns that improve the performance of cost prediction. Third, we evaluated the effectiveness of different types of temporal features based on cost information, visit information and medical information for the prediction task.
We used three years of medical and pharmacy claims data from 2013 to 2016 from a healthcare insurer, where the first two years were used to build the model to predict the costs in the third year. To prepare the data for modeling and prediction, the time series data of cost, visit and medical information were extracted in the form of fine-grain features (i.e., segmenting each time series into a sequence of consecutive windows and representing each window by various statistics such as sum). Then, temporal patterns of the time series were extracted and added to fine-grain features using a novel set of spike detection features (i.e., the fluctuation of data points). Gradient Boosting was applied on the final set of extracted features. Moreover, the contribution of each type of data (i.e., cost, visit and medical) was assessed. We benchmarked the proposed predictors against extant methods including those that used coarse-grain features which represent each time series with various statistics such as sum and the most recent portion of the values in the entire series. All prediction performances were measured in terms of Mean Absolute Percentage Error (MAPE).
Gradient Boosting applied on fine-grain predictors outperformed coarse-grain predictors with a MAPE of 3.02 versus 8.14 (p |
---|---|
ISSN: | 1532-0464 1532-0480 |
DOI: | 10.1016/j.jbi.2019.103113 |