Machine-learning model for predicting oliguria in critically ill patients

Yamao, Yasuo; Oami, Takehiko; Yamabe, Jun; Takahashi, Nozomi; Nakada, Taka-aki

doi:10.1038/s41598-024-51476-y

Download PDF

Article
Open access
Published: 11 January 2024

Machine-learning model for predicting oliguria in critically ill patients

Yasuo Yamao¹,
Takehiko Oami¹,
Jun Yamabe²,
Nozomi Takahashi¹ &
…
Taka-aki Nakada¹

Scientific Reports volume 14, Article number: 1054 (2024) Cite this article

2118 Accesses
23 Altmetric
Metrics details

Subjects

Abstract

This retrospective cohort study aimed to develop and evaluate a machine-learning algorithm for predicting oliguria, a sign of acute kidney injury (AKI). To this end, electronic health record data from consecutive patients admitted to the intensive care unit (ICU) between 2010 and 2019 were used and oliguria was defined as a urine output of less than 0.5 mL/kg/h. Furthermore, a light-gradient boosting machine was used for model development. Among the 9,241 patients who participated in the study, the proportions of patients with urine output < 0.5 mL/kg/h for 6 h and with AKI during the ICU stay were 27.4% and 30.2%, respectively. The area under the curve (AUC) values provided by the prediction algorithm for the onset of oliguria at 6 h and 72 h using 28 clinically relevant variables were 0.964 (a 95% confidence interval (CI) of 0.963–0.965) and 0.916 (a 95% CI of 0.914–0.918), respectively. The Shapley additive explanation analysis for predicting oliguria at 6 h identified urine values, severity scores, serum creatinine, oxygen partial pressure, fibrinogen/fibrin degradation products, interleukin-6, and peripheral temperature as important variables. Thus, this study demonstrates that a machine-learning algorithm can accurately predict oliguria onset in ICU patients, suggesting the importance of oliguria in the early diagnosis and optimal management of AKI.

Machine learning algorithm to predict mortality in critically ill patients with sepsis-associated acute kidney injury

Article Open access 30 March 2023

Machine learning for early discrimination between transient and persistent acute kidney injury in critically ill patients with sepsis

Article Open access 12 October 2021

Prediction of persistent acute kidney injury in postoperative intensive care unit patients using integrated machine learning: a retrospective cohort study

Article Open access 12 October 2022

Introduction

Acute kidney injury (AKI), which is defined as a rapid increase in serum creatinine or decrease in urine output, is one of the leading causes of complications during intensive care unit (ICU) admission, resulting in persistent organ dysfunction and increased mortality^1,2,3,4. Although early detection of AKI and prompt intervention can improve the prognosis of critically ill patients, none of the procedures, including close monitoring of vital signs, blood tests, and urine analysis, provides promising solutions. Serum creatinine, which is used to diagnose AKI, has a reduced reliability for the early detection of the pathology^5,6,7,8,9.

A recent study showed that oliguria, which is generally defined as a urine output of less than 0.5 mL/kg/h over 6 h, was associated with 90-day mortality irrespective of elevations in serum creatinine levels, exhibiting important diagnostic implications for oliguria in AKI management¹⁰. In addition, a prospective observational study showed that continuous monitoring of urine output could identify more patients with AKI earlier than serum creatinine alone¹¹. Therefore, management of AKI with accurate prediction of oliguria may be a promising strategy to understand the complex pathophysiology of critical illnesses better.

With the advancement of artificial intelligence, a substantial number of prediction models using machine-learning algorithms have demonstrated high accuracy in predicting mortality and clinical outcomes in ICU patients, including the detection of AKI^12,13,14. In terms of oliguria, a previous study reported a machine-learning approach for predicting urine output in patients with sepsis after fluid administration¹⁵. However, the accuracy of a machine-learning model in predicting oliguria in a general ICU setting remains underexplored.

Therefore, we hypothesized that a machine-learning model could predict the onset of oliguria in patients admitted to an ICU. This study aimed to develop a machine-learning algorithm for predicting oliguria in patients at 6 and 72 h, from an arbitrary period, during their ICU stay and to evaluate the accuracy of the developed algorithm using a large database from a single-center surgical/medical mixed ICU.

Results

Characteristics of the cohort

Of the 14,105 patients screened, 4,745 patients without documented body weight and 119 patients on maintenance dialysis were excluded; therefore, only 9,241 patients were included in the study (Supplementary File: Fig. S1). No significant differences were observed between the training and test data. In the entire cohort, the proportions of patients with urine output < 0.5 mL/kg/h for 6 h and with AKI during their ICU stay were 27.4% and 30.2%, respectively (Table 1).

Table 1 Patient characteristics and outcomes in the training and test cohorts.

Full size table

Selection of variables

We used hourly variables and baseline information to develop a sequential machine-learning model to predict oliguria (Fig. 1). From 1,018 variables, 28 variables were selected from a clinical perspective and included in the reduced dataset (Supplementary File: Table S1). Using the light-gradient boosting machine (LightGBM) classifier, we compared the accuracies of the models using all selected variables (Fig. 2). Although the area under the curve (AUC) values for predicting oliguria were comparable between the two methods, the computation time was much longer for the 1,018-variable dataset (56.3 s) than for the 28-variable dataset (7.5 s). The top 50 important variables in the 1,018-variable dataset overlapped with approximately 40% of the selected variables (Supplementary File: Fig. S2). To improve the efficiency and comparative accuracy of the algorithm, we used the 28-variable dataset for further analysis.

Prediction of oliguria

The AUC of the model to predict oliguria at 6 h using the selected variables was 0.964 (with a 95% confidence interval [CI] of 0.963–0.965). To verify the accuracy of the model, a fivefold cross-validation was implemented with an AUC of 0.920 (95% CI 0.918–0.922). The following Shapley additive explanation (SHAP) values are important variables for predicting oliguria at 6 h: the urine values, sequential organ failure assessment (SOFA) score, serum creatinine, oxygen partial pressure (pO₂), fibrinogen/fibrin degradation products (FDP), interleukin (IL)-6, peripheral temperature, creatinine kinase, and total bilirubin (Fig. 3A). The SHAP individual force plots (Fig. 3B,C) show the SHAP values for two patients with and without oliguria. In the first patient, a greater urine volume decreased the probability of oliguria occurrence, whereas the elevation of lactate dehydrogenase (LDH) and FDP increased the probability of oliguria (Fig. 3B). Consequently, the model predicted a lower probability of oliguria occurrence at 6 h. In the second patient, a lower urine volume and higher acute physiology and chronic health evaluation (APACHE) II increased the probability of oliguria occurrence, whereas normal levels of LDH, uric acid (UA), and IL-6 decreased the probability of oliguria. Based on this information, the model predicted an increased risk of oliguria at 6 h (Fig. 3C). The prediction model for the onset of oliguria at 72 h still showed high accuracy (AUC of 0.916 [95% CI 0.914–0.918]). The important variables for predicting oliguria at 72 h based on the SHAP values overlapped with those at 6 h except for urea nitrogen and platelets (Supplementary File: Fig. S3). After analyzing the same dataset using the different computer setting as a sensitivity analysis, we obtained the same results as the primary analysis.

Subgroup analyses

Next, we analyzed the accuracy of the models in predicting the onset of oliguria at 6 h according to sex, age (≤ 65 and > 66 years), and furosemide administration (Fig. 4A). In this context, the male group was more accurate (AUC = 0.965 [95% CI 0.964–0.967]) than the female group (AUC = 0.946 [95% CI 0.943–0.949]; mean absolute error [MAE] = 0.026). In the age comparison, there was a relatively small difference (MAE = 0.006) between the group younger than 65 years old (AUC = 0.958 [95% CI, 0.958–0.959]) and the group older than 66 years (AUC = 0.962 [95% CI, 0.960–0.964]) (Fig. 4B). Finally, the accuracy of the prediction model was higher for the non-furosemide group (AUC = 0.966 [95% CI 0.964–0.967]) than for the furosemide group (AUC = 0.953 [95% CI 0.951–0.955]), with a greater difference at a later prediction time point (MAE = 0.050) (Fig. 4C).

Discussion

This study developed a machine-learning model with a high AUC (> 0.96) for predicting the onset of oliguria at 6 h in critically ill patients. Among the 28 clinically relevant variables used for the prediction model, urine values, the SOFA score, serum creatinine, pO₂, FDP, IL-6, and peripheral temperature were listed as important variables.

Over the past decade, machine learning algorithms for predicting oliguria in critically ill patients have been underexplored, despite oliguria being one of the key components of AKI that leads to increased mortality in such patients^1,16,17,18. Although several studies have repeatedly verified the precision of a machine-learning model to predict AKI in critically ill patients with an AUC range of 0.74 to 0.93^{19,20,21,22,23,24,25,26}, only one study reported a machine-learning approach to predict oliguria for the next 4 h in patients with sepsis after fluid resuscitation using 47 clinical values, with an AUC of 0.86¹⁵. Our machine-learning model exhibited a high AUC (> 0.90) for predicting oliguria in critically ill patients between 6 and 72 h. The high accuracy of our model and its capability to predict oliguria over longer periods and fact that the AUC remained the same even after reducing the variables in the model development are a testament to the novel contributions of this study. This high accuracy may be attributed to the large sample size of > 10,000 patients, resulting in abundant training data. In addition, our method of predicting the onset of oliguria from an arbitrary time may have improved the accuracy by increasing the number of training datasets. Although we built the model based on 28 clinically relevant variables, its high overlap with the top-listed variables in the 1,018-value dataset would support the plausibility of using the selected variables for the prediction model. Because oliguria could identify more patients with AKI earlier than serum creatinine alone and is associated with poor outcomes in critically ill patients, our model would be useful for the early detection of patients with AKI and for improving the prognosis of the population through better management and early intervention^10,11,27,28.

The SHAP analysis identified urine values, the SOFA score, serum creatinine, pO₂, FDP, IL-6, and peripheral temperature as significant predictors of oliguria. Previous studies using SHAP values identified significant factors that increased the risk of ICU-acquired AKI, including a higher body mass index on admission, the presence of chronic kidney disease, congestive heart failure, coagulation and bleeding disorders, and cardiac arrhythmias, in addition to renal function represented by an elevation in blood urea nitrogen or serum creatinine, but not IL-6^19,24. Because IL-6 is one of the representative cytokines induced by overwhelming systemic inflammation, several studies have reported the significance of blood IL-6 levels for early detection of multiple organ dysfunction^29,30,31. The high weight of IL-6 in the development of the predictive model verified the essential role of this cytokine in the progression of organ dysfunction. IL-6 gained wide recognition in the cytokine storm associated with COVID-19 and can now be measured using rapid kits using immunoassays. Moreover, it has emergency approval in the United States³². In a previous study investigating the association between AKI and oliguria in critically ill patients, the SOFA score was higher in patients with oliguria than in those without oliguria in the AKI cohort³³. In addition, a multicenter study identified the APACHE II score as a predictor in patients with acute renal failure, requiring dialysis³⁴. These findings support our hypothesis that severity scores play an important role in the early detection of oliguria³⁵. In summary, these important variables can provide insights into the complex pathophysiology of AKI.

In the subgroup analyses, the accuracy of the model in the female and furosemide groups decreased over time compared with that in the other groups, with relatively high accuracy (AUC = 0.86) even at 72 h after the time of observation. A previous study showed that males developed AKI in the ICU more frequently than females due to the protective function of estrogen, which includes proliferative and anti-apoptotic effects on proximal tubular cells. Therefore, physiological differences between the sexes may have affected the accuracy of the model^36,37. Additionally, the administration of furosemide, which is arbitrarily timed by a physician, has a direct effect on urine output; therefore, it is conceivable that this subjective intervention could bias the accuracy of the model.

However, this study had the following limitations. First, this retrospective single-center study model may cause uncertainty when applied to different settings although we confirmed the accuracy of the cross-validation methods. Therefore, a prospective study in a different setting is required for future clinical applications. Second, we reduced the number of variables in this study from a clinical perspective; however, we may have missed important variables that affect prediction accuracy. Third, the SHAP analysis included both plausible and uninterpretable variables to predict outcomes. A mechanistic study investigating the significance of these uninterpretable values may reveal their role in the onset of oliguria and AKI. Future research should include an external validation through multicenter studies to verify the prediction accuracy. A prospective investigation is also warranted to evaluate the effects of model application on clinical outcomes.

In conclusion, this study demonstrated that a machine-learning model could predict the onset of oliguria in critically ill patients with high accuracy. Future investigations can focus on validating the accuracy of the prediction model for the early detection of AKI in patients.