Research Article

Albumin and Machine Learning Model for Predicting 28-day All-cause Mortality in ICU Patients with Lung Cancer

DOI:

10.3791/70067

April 3rd, 2026

In This Article

Summary

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

This protocol details the extraction of ICU clinical data from the MIMIC-IV database and the stepwise application of survival analysis and interpretable machine-learning models to evaluate serum albumin as a predictor of 28-day mortality in lung cancer patients.

Abstract

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

Serum albumin reflects nutritional status, systemic inflammation, and disease burden. However, its prognostic significance in critically ill patients with lung cancer remains unclear. This study aimed to investigate the association between serum albumin and 28-day all-cause mortality in intensive care unit (ICU) patients with lung cancer and to assess its predictive value in machine learning (ML) models. This retrospective cohort study included 1,274 adult ICU patients with lung cancer from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database. Multivariable Cox proportional hazards models evaluated the association between serum albumin and 28-day mortality. Additional analyses included restricted cubic splines, Kaplan–Meier survival curves, and subgroup analyses. ML classifiers were developed using the least absolute shrinkage and selection operator (LASSO) and Boruta for feature selection. Logistic regression was selected as the optimal model, with SHapley Additive Explanations (SHAP) values used for interpretation. Among 1,274 patients, the 28-day mortality rate was 26.9%. Lower serum albumin levels were independently associated with higher 28-day mortality. The hazard ratio for each 1 g/dL increase in serum albumin was 0.729 (95% CI, 0.597–0.891; p = 0.002). An inverse association was observed across serum albumin quartiles (p for trend <0.001), and Kaplan–Meier analysis showed lower mortality among patients with albumin ≥2.9 g/dL (log-rank p < 0.0001). The logistic regression model achieved an AUC of 0.767 in the validation cohort, indicating good discrimination and calibration. SHAP analysis identified serum albumin as a key predictor of 28-day mortality. Lower serum albumin was independently associated with increased short-term mortality in ICU patients with lung cancer. Serum albumin may serve as a practical biomarker for early risk stratification in this population. ML models incorporating albumin demonstrated strong predictive performance and interpretability.

Introduction

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

Lung cancer continues to rank among the most common and deadliest forms of malignancy worldwide. As reported by the Global Cancer Observatory (GLOBOCAN 2020), lung cancer represents around 11.4% of all newly identified cancer cases and remains the primary cause of cancer-related deaths, contributing to roughly 18% of cancer fatalities worldwide1. Although significant progress has been made in early diagnostic techniques, chemotherapeutic regimens, and targeted treatment approaches, the overall 5-year survival rate for lung cancer persists below 20%, predominantly owing to its late-stage presentation and highly aggressive nature

Access restricted. Please log in or start a trial to view this content.

Protocol

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

Database access and software environment
Obtain authorized access to the Medical Information Mart for Intensive Care IV database (MIMIC-IV, v3.1) after completing the required training on ethical data use (Certification ID: 69002811). According to the Measures for Ethical Review of Life Science and Medical Research Involving Human Subjects (February 18, 2023, China), Article 32 stipulates that research using legally obtained public data, data generated without interfering with public behavior, or anonymized information is exempt from ethics review. Consistent with these provisions, this study was exempt from institutional ethical approval. The d....

Access restricted. Please log in or start a trial to view this content.

Results

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

Baseline characteristics
A total of 1,274 patients with lung cancer were identified and extracted from the MIMIC-IV database (Figure 1). The median age of the cohort was 69 years (IQR: 61–77), and 51.18% were male. Overall, 343 patients (26.92%) died within 28 days of ICU admission. Baseline characteristics stratified by 28-day survival status are presented in Table 1. Compared with survivors, non-survivors exhibited significantly higher severity of illn.......

Access restricted. Please log in or start a trial to view this content.

Discussion

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

In this study, we examined the association between serum albumin levels and 28-day all-cause mortality among ICU patients with lung cancer, using data from the MIMIC-IV database. Lower serum albumin levels were independently associated with a higher risk of 28-day mortality, even after adjustment for a wide range of clinical covariates. This association remained consistent across various analytical approaches, including Cox proportional hazards models and ML methods. Subgroup analyses further demonstrated the stability o.......

Access restricted. Please log in or start a trial to view this content.

Disclosures

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

The authors declare no competing interests.

Acknowledgements

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

We sincerely appreciate the significant contributions made by all the authors towards this study; their invaluable efforts have been instrumental in its success. This research was funded by the National Natural Science Foundation of China(82174415, 82405441), Science and technology innovation project of Chinese Academy of Traditional Chinese Medicine(CI2021A01818, C12021A03307, C12021A05054) and High level Traditional Chinese Medicine Hospital Construction Project of Wangjing Hospital, Chinese Academy of Chinese Medical Sciences Clinical Evidence based Research Special Project for Traditional Chinese Medicine(WJYY-XZKT-2023-25).

....

Access restricted. Please log in or start a trial to view this content.

Materials

List of materials used in this article
NameCompanyCatalog NumberComments
Medical Information Mart for Intensive Care IV (MIMIC-IV) DatabaseMassachusetts Institute of Technologyhttp: //mimic.physionet.org/
Navicat Premium (v17)PremiumSoft CyberTech Ltd.
PostgreSQL (v17.1.11)PostgreSQL Global Development Group
Python (v3.10)Python Software Foundation
R Statistical Software (v4.2.3)R Foundation for Statistical Computing
SHapley Additive exPlanations (SHAP)Open-source software

Reprints and Permissions

Request permission to reuse the text or figures of this JoVE article

Request Permission

Tags

Serum AlbuminLung CancerICU Patients28 Day MortalityMachine LearningLogistic RegressionCox Proportional HazardsSHAP AnalysisRisk StratificationFeature Selection

Related Articles