Establishment of a Predictive Model of GvHD-Free, Relapse-Free Survival after Allogeneic Hematopoietic Stem Cell Transplantation Using a Machine Learning Algorithm

Background: GvHD-free, relapse-free survival (GRFS) is a useful composite endpoint that measures survival without relapse or significant morbidity after allogeneic hematopoietic stem cell transplantation (allo-HSCT). We aimed to establish a predictive model of GRFS after allo-HSCT, using a machine l...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Blood 2019-11, Vol.134 (Supplement_1), p.3337-3337
Hauptverfasser: Kanda, Junya, Arai, Yasuyuki, Kondo, Tadakazu, Ishikawa, Takayuki, Ueda, Yasunori, Imada, Kazunori, Akasaka, Takashi, Yonezawa, Akihito, Yago, Kazuhiro, Nohgawa, Masaharu, Anzai, Naoyuki, Moriguchi, Toshinori, Kitano, Toshiyuki, Itoh, Mitsuru, Arima, Nobuyoshi, Takeoka, Tomoharu, Watanabe, Mitsumasa, Hirata, Hirokazu, Asagoe, Kosuke, Miyanishi, Masanori, Miyatsuka, Isao, An, Le My, Takaori-Kondo, Akifumi
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Background: GvHD-free, relapse-free survival (GRFS) is a useful composite endpoint that measures survival without relapse or significant morbidity after allogeneic hematopoietic stem cell transplantation (allo-HSCT). We aimed to establish a predictive model of GRFS after allo-HSCT, using a machine learning algorithm within the Kyoto Stem Cell Transplantation Group (KSCTG), a multi-institutional joint research group of 17 transplantation centers in Japan. Methods: Consecutive patients with hematologic diseases who had undergone their first allo-HSCT between 1986 and 2016, were included in the study (n = 2,565). The primary endpoint was GRFS, and the secondary endpoints were: overall survival (OS), relapse, non-relapse mortality, and grades II-IV acute GvHD. Data were randomly split into the training set (70% of the dataset) and the validation set (30% of the dataset). The model was trained and tested using a 5-fold cross validation on the training set. A stacking ensemble of multiple machine learning algorithms was applied to develop a model. Predictions were first generated using different machine learning algorithms such as AdaBoost, Random Survival Forest, and Dynamic-DeepHit. A metamodel was subsequently trained using these predictions as inputs, to generate the final prediction (Figure 1). All models were trained using 28 input variables containing information on the patient and transplant characteristics. We used Cox Proportional-Hazards (Cox-PH) model as the benchmark case and evaluated the models' performance using the Harrell's Concordance Index (C-index). The model's final prediction can be directly used as a risk score for each patient. We validated this risk score in the validation set using the competing risk hazard model. Due to the small size of the validation set (N=770), we only classified the final risk scores into 2 groups as follows: high risk (above median) and low risk (below median) for each risk category. Results: The median age of the patients at transplantation was 48 years (range from 7 to 76). The sources of stem cells were related bone marrow (BM) in 650 patients, related peripheral blood (PB) in 382, unrelated BM in 978, unrelated PB in 24, and cord blood in 522. Myeloablative conditioning was administered in 1,293 patients and reduced-intensity conditioning in 949 patients. Refined disease risk indexes were low in 310 patients, intermediate in 1,197 patients, high in 545 patients, and very high in 368 patients. Median follow-u
ISSN:0006-4971
1528-0020
DOI:10.1182/blood-2019-123241