A comprehensive model to predict severe acute graft-versus-host disease in acute leukemia patients after haploidentical hematopoietic stem cell transplantation

Shen, Meng-Zhu; Hong, Shen-Da; Lou, Rui; Chen, Rui-Ze; Zhang, Xiao-Hui; Xu, Lan-Ping; Wang, Yu; Yan, Chen-Hua; Chen, Huan; Chen, Yu-Hong; Han, Wei; Wang, Feng-Rong; Wang, Jing-Zhi; Liu, Kai-Yan; Huang, Xiao-Jun; Mo, Xiao-Dong

doi:10.1186/s40164-022-00278-x

Research
Open access
Published: 03 May 2022

A comprehensive model to predict severe acute graft-versus-host disease in acute leukemia patients after haploidentical hematopoietic stem cell transplantation

Meng-Zhu Shen¹^na1,
Shen-Da Hong²^na1,
Rui Lou¹^na1,
Rui-Ze Chen^1,3^na1,
Xiao-Hui Zhang¹,
Lan-Ping Xu¹,
Yu Wang¹,
Chen-Hua Yan¹,
Huan Chen¹,
Yu-Hong Chen¹,
Wei Han¹,
Feng-Rong Wang¹,
Jing-Zhi Wang¹,
Kai-Yan Liu¹,
Xiao-Jun Huang^1,4,5 &
…
Xiao-Dong Mo^1,5

Experimental Hematology & Oncology volume 11, Article number: 25 (2022) Cite this article

3123 Accesses
21 Citations
Metrics details

Abstract

Background

Acute graft-versus-host disease (aGVHD) remains the major cause of early mortality after haploidentical related donor (HID) hematopoietic stem cell transplantation (HSCT). We aimed to establish a comprehensive model which could predict severe aGVHD after HID HSCT.

Methods

Consecutive 470 acute leukemia patients receiving HID HSCT according to the protocol registered at https://clinicaltrials.gov (NCT03756675) were enrolled, 70% of them (n = 335) were randomly selected as training cohort and the remains 30% (n = 135) were used as validation cohort.

Results

The equation was as follows: Probability (grade III–IV aGVHD) = $\frac{1}{{1 + \exp \left( { - \,{\text{Y}}} \right)}}$, where Y = –0.0288 × (age) + 0.7965 × (gender) + 0.8371 × (CD3 + /CD14 + cells ratio in graft) + 0.5829 × (donor/recipient relation) − 0.0089 × (CD8 + cell counts in graft) − 2.9046. The threshold of probability was 0.057392 which helped separate patients into high- and low-risk groups. The 100-day cumulative incidence of grade III–IV aGVHD in the low- and high-risk groups was 4.1% (95% CI 1.9–6.3%) versus 12.8% (95% CI 7.4–18.2%) (P = 0.001), 3.2% (95% CI 1.2–5.1%) versus 10.6% (95% CI 4.7–16.5%) (P = 0.006), and 6.1% (95% CI 1.3–10.9%) versus 19.4% (95% CI 6.3–32.5%) (P = 0.017), respectively, in total, training, and validation cohort. The rates of grade III–IV skin and gut aGVHD in high-risk group were both significantly higher than those of low-risk group. This model could also predict grade II–IV and grade I–IV aGVHD.

Conclusions

We established a model which could predict the development of severe aGVHD in HID HSCT recipients.

Introdution

Allogeneic hematopoietic stem cell transplantation (allo-HSCT) is the most important curative method for acute leukemia (AL), which can significantly improve the long-term survival [1, 2]. Human leukocyte antigen (HLA) haploidentical related donors (HIDs) have become one of the most important donors, which accounted for the proportion at 42% among allo-HSCT from family donors in Europe [3], and accounted for the proportion at 60% among all of the allo-HSCT in China [4].

Although many strategies [e.g., antithymocyte globulin (ATG) and post-transplant cyclophosphamide (PTCy)] are used to prevent acute graft-versus-host disease (aGVHD), it is still inevitable [5]. Only half of aGVHD patients could achieve durable responses to initial corticosteroid therapy [6], and there is no standard therapy for steroid refractory aGVHD and the survival among these patients is poor [7]. Thus, severe aGVHD remains the major cause of early mortality after HID HSCT [8,9,10]. An early-warning method for severe aGVHD can help to provide risk-stratification directed prophylaxis for aGVHD and significantly improve the survival of patients receiving HID HSCT.

Several demographic and transplant characteristics, such as patient age, underlying disease (e.g., chronic myeloid leukemia), comorbidities before allo-HSCT, donor/recipient gender mismatching (i.e., female donor/male recipient combination), donor and recipient cytomegalovirus (CMV) serostatus, donor type (i.e., HLA‐non‐identical donors), HLA disparity, and GVHD prophylaxis methods are reported as important risk factors for aGVHD [11, 12]. Particularly, donor/recipient relation [i.e., collateral relative donors (CRDs) [13] and maternal donors (MDs)] [14, 15] is associated with aGVHD after HID HSCT with ATG or PTCY for GVHD prophylaxis.

In addition, graft composition may be associated with aGVHD after allo-HSCT. For example, the CD4+/CD8+ T cells ratio in granulocyte colony-stimulating factor (G-CSF)-mobilized bone marrow (G-BM) [16] or the CD3+/CD14+ cells ratio in G-CSF-primed peripheral blood (G-PB) [17] can predict aGVHD after HID HSCT. However, most of the studies only reported the risk factors for aGVHD, and there was no comprehensive model which included the characteristics of demographic, disease, transplant, and graft composition for aGVHD prediction.

Thus, in the present study, we aimed to establish a comprehensive model which could predict the severe aGVHD in patients receiving HID HSCT with ATG for GVHD prophylaxis.

Patients and methods

Study design

Consecutive AL patients receiving HID HSCT between January 21, 2020 and May 31, 2021 at Peking University, Institute of Hematology (PUIH) were enrolled. The end point of the last follow-up for all survivors was November 11, 2021. A total of 67 patients had been previously reported by Ma et al. [18], and all of them were further followed-up. All patients were treated according to the protocol registered at https://clinicaltrials.gov (NCT03756675). Informed consent was obtained from all patients or their guardians. The study was conducted in accordance with the Declaration of Helsinki, and the protocol was approved by the Institutional Review Board of Peking University People’s Hospital.

Transplant regimens

Major conditioning regimen consisted of cytarabine, busulfan, cyclophosphamide, and semustine [19, 20]. Twelve patients received total body irradiation (TBI)-based conditioning regimen. G-PB harvests were administered to the recipients on the same day of collection [18]. ATG, cyclosporine A, mycophenolate mofetil, and short-term methotrexate were administered to prevent GVHD. Particularly, patients with CRDs or MDs could receive low dose cyclophosphamide after transplantation based on ATG for GVHD prophylaxis (Additional file 1: Additional methods) [21].

Evaluation of graft composition

The methods for graft composition evaluation were showed in Additional file 1: Additional methods [16, 22].

Definitions

The definitions for disease risk index (DRI), engraftment, aGVHD, relapse, mortality, and survival were showed in Additional file 1: Additional methods [23,24,25].

Building machine learning models

Our method consisted of three steps: selecting features, building models, and finding the optimal threshold (Fig. 1 and Additional file 1: Additional methods).

Backward feature selection strategy

We randomly selected 70% of the entire population (n = 335) as the training cohort, the remains 30% were used as validation cohort (n = 135). For primary outcome (i.e., grade III-IV aGVHD), the model building steps were performed in the training cohort and validated in the validation cohort. The sensitivity, specificity, area under curve score, and accuracy score were identified in both the training and validation cohort.

We used feature selection techniques to select the predictive variables (Additional file 1: Additional methods) [26]. By doing this, we could reduce the complexity of machine learning model, while also improve the generalizability. We set age and gender to be obligate variables in the machine learning model. For other variables, we selected top-3 significant variables using backward feature selection strategy. In detail, we started with all variables including age and gender. At each iteration, we removed the least significant variable (variable with the highest P-value) except age and gender. Aside from the involved variables, we also added an extra constant variate to make the feature selection more robust. The selection was realized using generalized linear models with binomial exponential family distribution of statsmodels v0.13.0 statistical models module with Python 3.8 based on anaconda3 development platform [27].

Building models

We used generalized linear models with binomial exponential family distribution to realize logistic regression models, which were equivalent models. Aside from the selected variables, we added an extra constant variate for the predicted model to make the machine learning models stronger. We used statsmodels v0.13.0 with Python 3.8 to build the models based on anaconda3 development platform. The model parameters were set to be the defaults [28,29,30].

Finding the optimal threshold

Logistic regression model produced values between 0 and 1, which could be treated as the probabilities to be positive prediction. We needed to determine the threshold of output positive predictions (1) or negative predictions (0). In detail, we drew Receiver Operating Characteristic (ROC) curves [31] and calculated the g-mean for each threshold [32]. The best threshold corresponded to the largest g-mean. The g-mean was calculated as sqrt [tpr × (1 − fpr)], where tpr represented true positive rate, fpr represented false positive rate, under a given threshold.

Evaluation for model

ROC-AUC was defined as the area under the curve of the true positive rate versus the false positive rate at various thresholds ranging from zero to one. Confusion matrix was a summary table of predictions. In this paper, the confusion matrix was of two-by-two shape. The diagonal showed the count values of correct predictions, while the others showed the count values of incorrect predictions. Besides, we also normalized the count values by the number of True Label (Outcome) or the number of Predicted Label (Prediction). To better visualize the matrix, we colored the values with Blues colorbar.

Statistical methods

In the present study, the primary outcome was grade III to IV aGVHD. The secondary outcomes included grade II to IV aGVHD, grade I to IV aGVHD, relapse, non-relapse mortality (NRM), leukemia-free survival (LFS), and overall survival (OS).

Mann–Whitney U-test was used to compare continuous variables, χ² and Fisher’s exact tests were used for categorical variables. The Kaplan–Meier method was used to estimate the probability of LFS and OS. Competing risk analyses were performed to calculate the cumulative incidence of aGVHD, relapse, and NRM [33]. Testing was two-sided at the P < 0.05 level. Statistical analysis was performed on SPSS 22.0 software (SPSS, Chicago, IL), and R software (version 4.0.0) (http://www.r-project.org).

Results

Patient characteristics

A total of 470 patients were enrolled, and the characteristics were all comparable between training and validation cohort (Table 1). All patients achieved neutrophil engraftment and the median time from HSCT to neutrophil engraftment was 12 days (range 9–28) days. Four hundred and fifty-eight (97.4%) patients achieved platelet engraftment and the median time from HSCT to platelet engraftment was 13 days (range 7–144) days, respectively.

Table 1 Patient characteristics

Full size table

Two hundred and sixty-six (56.6%), 129 (27.4%), and 33 (7.0%) patients experienced grade I to IV aGVHD, grade II to IV aGVHD, and grade III to IV aGVHD after allo-HSCT, respectively. The median time from HSCT to aGVHD was 20 days (range 8–99) days. The cumulative incidence of grade I to IV aGVHD, grade II to IV aGVHD, and grade III to IV aGVHD at 100 days after HID HSCT was 56.5% (95% CI 52.0–61.0%), 27.3% (95% CI 23.3–31.3%), and 6.8% (95% CI 4.5–9.1%), respectively.

Thirty-eight (8.1%) patients experienced relapse, and 16 (3.4%) patients died of NRM. Four hundred and forty-nine patients survived until the last follow-up, and the median duration of follow-up was 200 days (range 52 to 509) days. The probabilities of relapse, NRM, LFS, and OS at 100 days after HID HSCT were 2.8% (95% CI 1.3–4.3%), 1.5% (95% CI 0.4–2.6%), 95.7% (95% CI 93.9–97.6%), and 97.8% (95% CI 96.5–99.2%), respectively.

Predicted model for grade III to IV aGVHD (model 1)

A predictive model for grade III-IV aGVHD was developed (Additional file 1: Additional methods, Table S1 and Fig. S1), and the equation was as follows:

$${\text{Probability}}\left( {{\text{grade III}}{-}{\text{IV aGVHD}}} \right) = \frac{1}{{1{ } + {\text{ exp}}\left( { - {\text{Y}}} \right)}}$$

where, Y = − 0.0288 × (age) + 0.7965 × (gender) + 0.8371 × (CD3 + /CD14 + cells ratio in graft) + 0.5829 × (donor/recipient relation) − 0.0089 × (CD8 + cell counts in graft) − 2.9046. Particularly, donor/recipient relation included immediate relative donors (IRDs) other than MDs (value = 0), MDs (value = 1), and CRDs (value = 2). Gender included male (value = 0) and female (value = 1). The age (years), CD8 + cell counts (× 10⁶/kg), CD3+/CD14+ cells ratio in graft used actual numerical value (Additional file 1: Table S1). The threshold of probability was 0.057392 and the g-mean was 0.682. Patients were separated into low- and high-risk groups by the threshold.

In the training cohort, the sensitivity, specificity, area under curve score, and accuracy score were 0.632, 0.680, 0.685, and 0.678, respectively. ROC curve for the model and confusion matrix is shown in Fig. 2A and Additional file 1: Table S2. In the validation cohort, the sensitivity, specificity, area under curve score, and accuracy score were 0.500, 0.760, 0.673, and 0.733, respectively. ROC curve for the model and confusion matrix is shown in Fig. 2B and Additional file 1: Table S3.

Verifying the predicted model in validation and total cohort

The 100-day cumulative incidence of grade III-IV aGVHD in the low- and high-risk groups was 4.1% (95% CI 1.9–6.3%) versus 12.8% (95% CI 7.4–18.2%) (P = 0.001), respectively, in total cohort (Fig. 3A).

The 100-day cumulative incidence of grade III-IV aGVHD in the low- and high-risk groups was 3.2% (95% CI 1.2–5.1%) versus 10.6% (95% CI 4.7–16.5%) with P = 0.006 and 6.1% (95% CI 1.3–10.9%) versus 19.4% (95% CI 6.3–32.5%) with P = 0.017, respectively, in training cohort (Fig. 3B) and validation cohort (Fig. 3C). The 100-day cumulative incidence of grade III-IV aGVHD in the low- and high-risk groups was 4.9% (95% CI 2.1–7.7%) versus 11.1% (95% CI 5.2–17.0%) with P = 0.033 and 2.1% (95% CI 0.0–4.9%) versus 18.8% (95% CI 5.0–32.5%) with P < 0.001, respectively, in patients with HCT-CI scores of 0 (Additional file 1: Fig. S2) and ≥ 1 (Additional file 1: Fig. S3).

The rates of grade III to IV skin and gut aGVHD in low-risk group were both significantly lower than those of high-risk group (skin: 4.4% vs. 12.8%, P = 0.001; gut: 1.6% vs. 4.7%, P = 0.045) (Fig. 3D).

Validation of the predicted model in grade II to IV aGVHD

In the total population, the 100-day cumulative incidence of grade II to IV aGVHD in the low-risk group and high-risk group was 21.5% (95% CI 17.0–26.0%) and 39.6% (95% CI 31.7–47.5%), respectively (P < 0.001, Fig. 4A). The rates of grade II to IV skin and gut aGVHD in the low-risk group were both significantly lower than those of high-risk group (skin: 25.5% vs. 35.6%, P = 0.025; gut: 7.5% vs. 18.8%, P < 0.001) (Fig. 4B).

Validation of the predicted model in grade I to IV aGVHD

In total population, the 100-day cumulative incidence of grade I to IV aGVHD in the low-risk group and high-risk group was 51.5% (95% CI 46.0–57.0%) and 67.1% (95% CI 59.5–74.7%), respectively (P = 0.001, Fig. 4C). The rates of grade I to IV skin, gut, and liver aGVHD in the low-risk group were all significantly lower than those of high-risk group (skin: 44.5% vs. 60.4%, P = 0.001; gut: 15.9% vs. 30.2%, P < 0.001; liver: 1.9% vs. 5.4%, P = 0.038) (Fig. 4D).

Validation of the predicted model in other clinical outcomes after HSCT

In total population, the probabilities of relapse, NRM, LFS, and OS at 100 days after HID HSCT were all comparable between the low- and high-risk groups in the total population (Additional file 1: Fig. S4).

Discussion

In the present study, we established a predicted model for grade III to IV aGVHD including patient age, gender, donor/recipient relation, CD8+ T cell count, and CD3+/CD14+ cells ratio in the graft in training cohort, which was verified in validation and total cohorts. To the best of our knowledge, we firstly established a comprehensive model which can effectively predict severe aGVHD in HID HSCT recipients with ATG for GVHD prophylaxis.

Although some studies reported several risk factors of aGVHD, most of them did not integrate these factors and single factor may not provide comprehensive prediction for aGVHD. For example, Yahng et al. [34] reported that CD8+ cell counts in G-PB were associated with the occurrence of severe aGVHD after haplo-HSCT, which was not supported by the study of Liu et al. [17] In addition, MDs showed a higher risk of aGVHD compared with other IRDs in patients receiving ATG [14] or PTCY [15] for GVHD prophylaxis. In addition, we observed that the risk of aGVHD in CRDs group was as high as that of MDs group [13]. However, some authors reported that MDs did not increase the risk of aGVHD in patients using TCD protocol [35]. In the present study, the predictive model created by machine learning models is more accurate and reliable because it can eliminate the influence of selection bias in choosing variables. It also accounts for interaction and confounding factors, which cannot be completely adjusted for or eliminated using conventional statistics.

Compared with the traditional logistic regression model, the method proposed in this paper has several improvements. First, this method adds a feature selection step [26, 27]. We propose a backward feature selection strategy based on multi-factor analysis. This strategy is in a step-wise manner, which can ensure the stability of the feature selection process, and makes the model more generalizable. Second, in the model optimization process, we add a penalty function of the regularization term. It can reduce the risk of overfitting the training data, and further make the model more generalizable. Third, we consider the imbalance of positive and negative samples of the data when outputting the final prediction results. Hence, the traditional threshold of 0.5 is not directly used. Instead, we calculate the optimal threshold based on g-means index from the ROC curve [31, 32].

According to the theory of machine learning, adding more variables increases the capacity and performance upper bound of the predictive model [36, 37], but also increases the complexity of the predictive model. Additionally, many variables may make a model too difficult to clinically apply. Thus, obligate variables seem to be a balanced approach [38, 39]. Age and gender are the most common obligate variables because they are easy to acquire in the real world and adding them usually does not increase the clinical burden [40,41,42]. Hence, we extracted “age” and "gender" as the factors in our predictive model of III to IV aGVHD.

We observed that our predict model was associated with grade III to IV and grade II to IV gut aGVHD after HID HSCT, which suggested that routine GVHD prophylaxis methods were not sufficient to prevent severe gut aGVHD in high-risk patients. Severe gut aGVHD is difficult to treat and is the greatest cause of GVHD-related mortality [43]. Thus, our predicted model could help to direct more intense prophylaxis for gut aGVHD in high-risk patients after HID HSCT with ATG for GVHD prophylaxis.

The present study had some limitations. First, the model was not associated with the development of grade III to IV liver aGVHD after HID HSCT, which might be due to the small sample of severe liver aGVHD in the present study. However, we observed that the rate of grade I to IV liver aGVHD in high-risk group was higher than that of low-risk group. Second, although we verified the model successfully in the validation cohort, this was a single-center study and the sample of validation cohort was relatively small. Third, ATG was administered to prevent GVHD in this research, but ATG is contained in 94 per cent of conditioning regimens for HID HSCT in China. Thus, the predicted value of our model should be further confirmed in patients receiving HID HSCT with PTCY for GVHD prophylaxis and in those receiving identical sibling or unrelated donor HSCT. Thus, the model should be further evaluated by independent cohorts in multicenter studies. Lastly, we did not monitor plasma cytokines (e.g., interleukin [IL]-2) and biomarkers (e.g., ST2, REG3α, TNFR1, and IL-2Rα) [44, 45], which may further improve the efficacy of our predicted model.

Conclusions

We established a comprehensive model which could predict the development of severe aGVHD in HID HSCT recipients. This was the first predicted model for severe aGVHD which can be popularized easily, can help to provide risk-stratification directed aGVHD prophylaxis, and may further decrease the risk of severe aGVHD in HID HSCT recipients. In future, prospective, multicenter studies can further confirm the efficacy of our predicted model.

Availability of data and materials

The datasets generated during the analysis of the current study are available from the corresponding author on reasonable request.

References

Zhang XH, Chen J, Han MZ, Huang H, Jiang EL, Jiang M, et al. The consensus from The Chinese Society of Hematology on indications, conditioning regimens and donor selection for allogeneic hematopoietic stem cell transplantation: 2021 update. J Hematol Oncol. 2021;14(1):145.
Article PubMed PubMed Central Google Scholar
Xu L, Chen H, Chen J, Han M, Huang H, Lai Y, et al. The consensus on indications, conditioning regimen, and donor selection of allogeneic hematopoietic cell transplantation for hematological diseases in China-recommendations from the Chinese Society of Hematology. J Hematol Oncol. 2018;11(1):33.
Article PubMed PubMed Central Google Scholar
Passweg JR, Baldomero H, Chabannon C, Basak GW, de la Cámara R, Corbacioglu S, et al. Hematopoietic cell transplantation and cellular therapy survey of the EBMT: monitoring of activities and trends over 30 years. Bone Marrow Transplant. 2021;56(7):1651–64.
Article PubMed PubMed Central Google Scholar
Xu LP, Lu PH, Wu DP, Sun ZM, Liu QF, Han MZ, et al. Hematopoietic stem cell transplantation activity in China 2019: a report from the Chinese Blood and Marrow Transplantation Registry Group. Bone Marrow Transplant. 2021;56(12):2940–47.
Article PubMed PubMed Central Google Scholar
Ringdén O, Labopin M, Sadeghi B, Mailhol A, Beelen D, Fløisand Y, et al. What is the outcome in patients with acute leukaemia who survive severe acute graft-versus-host disease? J Intern Med. 2018;283(2):166–77.
Article PubMed Google Scholar
Martin PJ, Rizzo JD, Wingard JR, Ballen K, Curtin PT, Cutler C, et al. First- and second-line systemic treatment of acute graft-versus-host disease: recommendations of the American Society of Blood and Marrow Transplantation. Biol Blood Marrow Transplant. 2012;18(8):1150–63.
Article PubMed PubMed Central Google Scholar
Penack O, Marchetti M, Ruutu T, Aljurf M, Bacigalupo A, Bonifazi F, et al. Prophylaxis and management of graft versus host disease after stem-cell transplantation for haematological malignancies: updated consensus recommendations of the European society for blood and marrow transplantation. Lancet Haematol. 2020;7(2):e157–67.
Article PubMed Google Scholar
Yeshurun M, Weisdorf D, Rowe JM, Tallman MS, Zhang MJ, Wang HL, et al. The impact of the graft-versus-leukemia effect on survival in acute lymphoblastic leukemia. Blood Adv. 2019;3(4):670–80.
Article PubMed PubMed Central Google Scholar
Yu J, Parasuraman S, Shah A, Weisdorf D. Mortality, length of stay and costs associated with acute graft-versus-host disease during hospitalization for allogeneic hematopoietic stem cell transplantation. Curr Med Res Opin. 2019;35(6):983–8.
Article PubMed Google Scholar
Modi A, Rybicki L, Majhail NS, Mossad SB. Severity of acute gastrointestinal graft-vs-host disease is associated with incidence of bloodstream infection after adult allogeneic hematopoietic stem cell transplantation. Transplant Infect Dis. 2020;22(1):e13217.
Article Google Scholar
Blume KG, Thomas ED. Thomas’ hematopoietic cell transplantation. 5th ed. Amsterdam: Wiley; 2016.
Google Scholar
Maziarz R, Slater S. Blood and marrow transplant handbook comprehensive guide for patient care comprehensive guide for patient care. Berlin: Springer; 2021.
Book Google Scholar
Mo X-D, Zhang Y-Y, Zhang X-H, Xu L-P, Wang Y, Yan C-H, et al. The role of collateral related donors in haploidentical hematopoietic stem cell transplantation. Sci Bull. 2018;63(20):1376–82.
Article Google Scholar
Wang Y, Chang YJ, Xu LP, Liu KY, Liu DH, Zhang XH, et al. Who is the best donor for a related HLA haplotype-mismatched transplant? Blood. 2014;124(6):843–50.
Article CAS PubMed Google Scholar
Kongtim P, Ciurea SO. Who is the best donor for haploidentical stem cell transplantation? Semin Hematol. 2019;56(3):194–200.
Article PubMed Google Scholar
Luo XH, Chang YJ, Xu LP, Liu DH, Liu KY, Huang XJ. The impact of graft composition on clinical outcomes in unmanipulated HLA-mismatched/haploidentical hematopoietic SCT. Bone Marrow Transplant. 2009;43(1):29–36.
Article CAS PubMed Google Scholar
Liu DH, Zhao XS, Chang YJ, Liu YK, Xu LP, Chen H, et al. The impact of graft composition on clinical outcomes in pediatric patients undergoing unmanipulated HLA-mismatched/haploidentical hematopoietic stem cell transplantation. Pediatr Blood Cancer. 2011;57(1):135–41.
Article PubMed Google Scholar
Ma YR, Zhang X, Xu L, Wang Y, Yan C, Chen H, et al. G-CSF-primed peripheral blood stem cell haploidentical transplantation could achieve satisfactory clinical outcomes for acute leukemia patients in the first complete remission: a registered study. Front Oncol. 2021;11:631625.
Article PubMed PubMed Central Google Scholar
Wang Y, Liu QF, Lin R, Yang T, Huang XJ. Optimizing antithymocyte globulin dosing in haploidentical hematopoietic cell transplantation: long-term follow-up of a multicenter, randomized controlled trial. Sci Bull. 2021. https://doi.org/10.2139/ssrn.3798561.
Article Google Scholar
Wang Y, Liu QF, Xu LP, Liu KY, Zhang XH, Ma X, et al. Haploidentical vs identical-sibling transplant for AML in remission: a multicenter, prospective study. Blood. 2015;125(25):3956–62.
Article CAS PubMed Google Scholar
Wang Y, Wu DP, Liu QF, Xu LP, Liu KY, Zhang XH, et al. Low-dose post-transplant cyclophosphamide and anti-thymocyte globulin as an effective strategy for GVHD prevention in haploidentical patients. J Hematol Oncol. 2019;12(1):88.
Article PubMed PubMed Central CAS Google Scholar
Liu Y, Chen S, Yu H. Standardization and quality control in flow cytometric enumeration of CD34(+) cells. Zhongguo Shi Yan Xue Ye Xue Za Zhi. 2000;8(4):302–6.
PubMed Google Scholar
Armand P, Kim HT, Logan BR, Wang Z, Alyea EP, Kalaycio ME, et al. Validation and refinement of the disease risk index for allogeneic stem cell transplantation. Blood. 2014;123(23):3664–71.
Article CAS PubMed PubMed Central Google Scholar
Mo XD, Zhang XH, Xu LP, Wang Y, Yan CH, Chen H, et al. Disease risk comorbidity index for patients receiving haploidentical allogeneic hematopoietic transplantation. Engineering. 2021;7(2):162–9.
Article Google Scholar
Harris AC, Young R, Devine S, Hogan WJ, Ayuk F, Bunworasate U, et al. International, multicenter standardization of acute graft-versus-host disease clinical data collection: a report from the Mount Sinai acute GVHD international consortium. Biol Blood Marrow Transplant. 2016;22(1):4–10.
Article PubMed Google Scholar
Guyon I, Andre E. An introduction to variable and feature selection. J Mach Learn Res. 2003;3:1157–82.
Google Scholar
Nelder JA, Wedderburn RWM. Generalized linear models. J Royal Stat Soc Ser A. 1972;135(3):370–84.
Article Google Scholar
Hosmer DWJ, Lemeshow SL. Applied logistic regression. Hoboken: Wiley; 1989.
Google Scholar
Seabold S, Perktold J. Statsmodels: econometric and statistical modeling with python. In: proceedings of the 9th python in science conference. 2010;57: 61.
Hastie T. The elements of statistical learning: data mining, inference, and prediction. Berlin: Springer; 2009.
Book Google Scholar
Zweig MH, Campbell G. Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. Clin Chem. 1993;39(4):561–77.
Article CAS PubMed Google Scholar
Guo H, Liu H, Wu C, Zhi W, Xiao Y, She W. Logistic discrimination based on G-mean and F-measure for imbalanced problem. J Intell Fuzzy Syst. 2016;31(3):1155–66.
Article Google Scholar
Gooley TA, Leisenring W, Crowley J, Storer BE. Estimation of failure probabilities in the presence of competing risks: new representations of old estimators. Stat Med. 1999;18(6):695–706.
Article CAS PubMed Google Scholar
Yahng SA, Kim JH, Jeon YW, Yoon JH, Shin SH, Lee SE, et al. A well-tolerated regimen of 800 cGy TBI-fludarabine-busulfan-ATG for reliable engraftment after unmanipulated haploidentical peripheral blood stem cell transplantation in adult patients with acute myeloid leukemia. Biol Blood Marrow Transplant. 2015;21(1):119–29.
Article CAS PubMed Google Scholar
Stern M, Ruggeri L, Mancusi A, Bernardo ME, de Angelis C, Bucher C, et al. Survival after T cell-depleted haploidentical stem cell transplantation is improved using the mother as donor. Blood. 2008;112(7):2990–5.
Article CAS PubMed PubMed Central Google Scholar
Blumer A, Ehrenfeucht A, Haussler D, et al. Learnability and the Vapnik-Chervonenkis dimension. J ACM. 1989;36(4):929–65.
Article Google Scholar
Abu-Mostafa YS. The Vapnik-Chervonenkis dimension: information versus complexity in learning. Neural Comput. 1989;1(3):312–7.
Article Google Scholar
Mitchell TM. The discipline of machine learning. Pittsburgh: Carnegie Mellon University, School of Computer Science, Machine Learning Department; 2006.
Google Scholar
Han J, Pei J, Kamber M. Data mining: concepts and techniques. Hoboken: Elsevier; 2011.
Google Scholar
Rajkomar A, Oren E, Chen K, Dai AM, Hajaj N, Hardt M, et al. Scalable and accurate deep learning with electronic health records. NPJ Digit Med. 2018;1:18.
Article PubMed PubMed Central Google Scholar
Nemati S, Holder A, Razmi F, Stanley MD, Clifford GD, Buchman TG. An interpretable machine learning model for accurate prediction of sepsis in the ICU. Crit Care Med. 2018;46(4):547–53.
Article PubMed PubMed Central Google Scholar
Zoabi Y, Deri-Rozov S, Shomron N. Machine learning-based prediction of COVID-19 diagnosis based on symptoms. NPJ Digit Med. 2021;4(1):3.
Article PubMed PubMed Central Google Scholar
Naymagon S, Naymagon L, Wong SY, Ko HM, Renteria A, Levine J, et al. Acute graft-versus-host disease of the gut: considerations for the gastroenterologist. Nat Rev Gastroenterol Hepatol. 2017;14(12):711–26.
Article PubMed PubMed Central Google Scholar
Hartwell MJ, Özbek U, Holler E, Renteria AS, Major-Monfried H, Reddy P, et al. An early-biomarker algorithm predicts lethal graft-versus-host disease and survival. JCI Insight. 2017;2(3):e89798.
Article PubMed PubMed Central Google Scholar
Levine JE, Braun TM, Harris AC, Holler E, Taylor A, Miller H, et al. A prognostic score for acute graft-versus-host disease based on biomarkers: a multicentre study. Lancet Haematol. 2015;2(1):e21–9.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the Foundation for Innovative Research Groups of the National Natural Science Foundation of China (Grant Number 81621001), the CAMS Innovation Fund for Medical Sciences (CIFMS) (Grant Number 2019-I2M-5-034), the Program of the National Natural Science Foundation of China (Grant Number 82170208), the Key Program of the National Natural Science Foundation of China (Grant Number 81930004), and the Fundamental Research Funds for the Central Universities.

Funding

This work was supported by the Program of the National Natural Science Foundation of China (Grant Number 82170208), the Foundation for Innovative Research Groups of the National Natural Science Foundation of China (Grant Number 81621001), the CAMS Innovation Fund for Medical Sciences (CIFMS) (Grant Number 2019-I2M-5-034), the Key Program of the National Natural Science Foundation of China (Grant Number 81930004), and the Fundamental Research Funds for the Central Universities.

Author information

Meng-Zhu Shen, Shen-Da Hong, Rui Lou, Rui-Ze Chen contributed equally to this manuscript

Authors and Affiliations

Peking University People’s Hospital, Peking University Institute of Hematology, National Clinical Research Center for Hematologic Disease, Beijing Key Laboratory of Hematopoietic Stem Cell Transplantation, No. 11 Xizhimen South Street, Xicheng District, Beijing, 100044, China
Meng-Zhu Shen, Rui Lou, Rui-Ze Chen, Xiao-Hui Zhang, Lan-Ping Xu, Yu Wang, Chen-Hua Yan, Huan Chen, Yu-Hong Chen, Wei Han, Feng-Rong Wang, Jing-Zhi Wang, Kai-Yan Liu, Xiao-Jun Huang & Xiao-Dong Mo
National Institute of Health Data Science at Peking University, Peking University Health Science Center, Beijing, 100191, China
Shen-Da Hong
Department of Hematology, Jiangsu Province Hospital, The First Affiliated Hospital of Nanjing Medical University, Jiangsu, 210036, China
Rui-Ze Chen
Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100044, China
Xiao-Jun Huang
Research Unit of Key Technique for Diagnosis and Treatments of Hematologic Malignancies, Chinese Academy of Medical Sciences, Beijing, 2019RU029, China
Xiao-Jun Huang & Xiao-Dong Mo

Authors

Meng-Zhu Shen
View author publications
You can also search for this author in PubMed Google Scholar
Shen-Da Hong
View author publications
You can also search for this author in PubMed Google Scholar
Rui Lou
View author publications
You can also search for this author in PubMed Google Scholar
Rui-Ze Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xiao-Hui Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Lan-Ping Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Chen-Hua Yan
View author publications
You can also search for this author in PubMed Google Scholar
Huan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Hong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Wei Han
View author publications
You can also search for this author in PubMed Google Scholar
Feng-Rong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jing-Zhi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Kai-Yan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xiao-Jun Huang
View author publications
You can also search for this author in PubMed Google Scholar
Xiao-Dong Mo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X-JH and X-DM contributed to the study conception and design. Material preparation, data collection and analysis were performed by all authors. The first draft of the manuscript was written by X-DM and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Xiao-Dong Mo.

Ethics declarations

Ethics approval and consent to participate

The study was conducted in accordance with the Declaration of Helsinki, and the protocol was approved by the Institutional Review Board of Peking University People’s Hospital. Informed consent was obtained from all individual participants or their guardians included in the study.

Consent for publication

Not applicable.

Competing interests

The authors have no relevant financial or non-financial interests to disclose.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Additional methods, Additional tables S1–S3 and additional figures S1–S4.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Shen, MZ., Hong, SD., Lou, R. et al. A comprehensive model to predict severe acute graft-versus-host disease in acute leukemia patients after haploidentical hematopoietic stem cell transplantation. Exp Hematol Oncol 11, 25 (2022). https://doi.org/10.1186/s40164-022-00278-x

Download citation

Received: 07 February 2022
Accepted: 11 April 2022
Published: 03 May 2022
DOI: https://doi.org/10.1186/s40164-022-00278-x

A comprehensive model to predict severe acute graft-versus-host disease in acute leukemia patients after haploidentical hematopoietic stem cell transplantation

Abstract

Background

Methods

Results

Conclusions

Introdution

Patients and methods

Study design

Transplant regimens

Evaluation of graft composition

Definitions

Building machine learning models

Backward feature selection strategy

Building models

Finding the optimal threshold

Evaluation for model

Statistical methods

Results

Patient characteristics

Predicted model for grade III to IV aGVHD (model 1)

Verifying the predicted model in validation and total cohort

Validation of the predicted model in grade II to IV aGVHD

Validation of the predicted model in grade I to IV aGVHD

Validation of the predicted model in other clinical outcomes after HSCT

Discussion

Conclusions

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Experimental Hematology & Oncology

Contact us