$$\rightleftharpoonup{xx}$$
$$\longleftharp{xx}$$,
$$\longrightharp{xx}$$,
Basic characteristics of the study population
This meta-analysis included 17 high-quality studies evaluating malnutrition in patients with Crohn’s disease, as summarized in Supplementary Table 1. The studies were published between 2009 and 2025, with a median sample size of 502 patients (range: 175–773). Study designs included prospective cohort, retrospective cohort, case-control, cross-sectional, and mixed-methods studies, with mixed-methods studies accounting for 17.6% (3/17) of the included studies.
Definitions of malnutrition varied among studies. Body mass index (BMI) <18.5 kg/m2 was used in 35.3% (6/17) of studies, Malnutrition Universal Screening Tool (MUST) score ≥2 in 23.5% (4/17), European Society for Clinical Nutrition and Metabolism (ESPEN) 2015 criteria in 17.6% (3/17), Global Leadership Initiative on Malnutrition (GLIM) 2019 criteria in 11.8% (2/17), and Subjective Global Assessment (SGA) Grade B/C in 5.9% (1/17). In contrast, the study cohort used the ESPEN 2015 criteria exclusively to ensure consistency of outcome definition.
The reported prevalence of malnutrition ranged from 21.5% to 43.8% across the included studies. Demographic characteristics also demonstrated heterogeneity, with mean age ranging from approximately 36 to 55 years, male proportion ranging from 46% to 60%, and mean disease duration ranging from approximately 5 to 12 years. Predictor analysis identified elevated C-reactive protein (CRP >10 mg/L) and history of intestinal resection as common risk factors for malnutrition, with pooled ORs of 4.72 (95% CI: 3.21–6.95) and 6.17 (95% CI: 2.35–16.18), respectively. Most studies (76.5%, 13/17) adjusted for potential confounding variables, including disease activity, biologic use, and smoking history.
Overall, substantial heterogeneity was observed across studies in geographic region, study period, methodology, and patient characteristics, underscoring the need for standardized diagnostic criteria and multicenter collaborative studies. Importantly, the pooled prevalence derived from the meta-analysis (32.3%) was based on heterogeneous malnutrition definitions, whereas the prevalence observed in the study cohort (42.5% in the training cohort and 40.4% in the validation cohort) was determined exclusively using the ESPEN 2015 criteria. Therefore, direct comparison of prevalence estimates should be interpreted cautiously, as differences may reflect variation in diagnostic frameworks rather than true differences in malnutrition risk.
Meta-analysis results
The forest plot summarizes the prevalence of malnutrition and corresponding 95% confidence intervals (CIs) among patients with Crohn’s disease across the included studies (Figure 1). The vertical dashed line represents the pooled prevalence estimate (0.323). Several studies reported prevalence rates exceeding 0.40, indicating a substantial malnutrition burden, whereas other studies reported prevalence rates between 0.30 and 0.40 with relatively wide confidence intervals, reflecting increased uncertainty. A smaller number of studies reported prevalence rates below 0.30.

Figure 1: Forest plot of pooled malnutrition prevalence and 95% confidence intervals in patients with Crohn’s disease. This forest plot summarizes the results of a meta-analysis of 17 high-quality studies evaluating the prevalence of malnutrition in adult patients with Crohn’s disease. Horizontal lines represent the 95% confidence intervals (CIs) for malnutrition prevalence in each study, and solid dots indicate the corresponding point estimates. The vertical dashed line represents the pooled malnutrition prevalence (0.323) across all included studies. Studies are stratified into high (≥0.4), moderate (0.3–0.4), and low (<0.3) malnutrition prevalence subgroups, illustrating variability in reported prevalence while confirming malnutrition as a common complication in Crohn’s disease. Error bars denote 95% CIs. Please click here to view a larger version of this figure.
Despite variability among studies, the pooled prevalence estimate of 32.3% indicates that malnutrition is a common clinical concern in patients with Crohn’s disease. The forest plot additionally demonstrates heterogeneity in effect sizes among studies. Five predictive factors were selected for model development based on statistical significance (P <0.05) and clinical applicability in CD management.
Characteristics of the 17 included studies are summarized in Supplementary Table 1, with corresponding citations18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34.
Contribution of individual studies to pooled estimates and variable selection for the prediction model
Each of the 17 included studies contributed individual odds ratios (ORs) and corresponding 95% confidence intervals (CIs) for candidate malnutrition risk factors in patients with Crohn’s disease. Pooled effect sizes were calculated using fixed-effects or random-effects models according to heterogeneity assessment using the I2 statistic. Random-effects models were applied when substantial heterogeneity was present (I2 >50%), whereas fixed-effects models were used for homogeneous datasets (I2 ≤50%) (Supplementary Table 2).
Only variables with statistically significant pooled effect sizes (P <0.05) were considered candidate predictive variables. Additional selection criteria included clinical availability in routine CD management and completeness of data within the IBDCD database. This screening process resulted in the selection of five predictive variables for model development: elevated CRP, small bowel involvement, biologic use, history of intestinal resection, and low BMI. Variables with non-significant pooled effects or incomplete cohort data were excluded.
Detailed study-level ORs and corresponding 95% CIs for each predictive factor are provided in Supplementary Table 3.
Baseline characteristics of the Crohn’s disease malnutrition cohort
Baseline characteristics of the training cohort (n = 520) and validation cohort (n = 280) are summarized in Table 1. No statistically significant differences were observed between cohorts for any baseline variable (P >0.05).
| Variable | Training Cohort | Validation Cohort | P-value |
| Age (years) | 32.6 ± 9.3 | 32.8 ± 9.6 | 0.706 |
| Gender (Female) | 226 (43.5%) | 124 (44.3%) | 0.881 |
| Gender (Male) | 294 (56.5%) | 156 (55.7%) | 0.881 |
| Lesion Location (LL1) | 181 (34.8%) | 96 (34.3%) | 0.944 |
| Lesion Location (LL2) | 193 (37.1%) | 100 (35.7%) | 0.753 |
| Lesion Location (LL3) | 146 (28.1%) | 84 (30.0%) | 0.623 |
| Active Disease (0) | 168 (32.3%) | 98 (35.0%) | 0.489 |
| Active Disease (1) | 352 (67.7%) | 182 (65.0%) | 0.489 |
| Biologic Use (0) | 317 (61.0%) | 160 (57.1%) | 0.33 |
| Biologic Use (1) | 203 (39.0%) | 120 (42.9%) | 0.33 |
| History of intestinal resection (0) | 362 (69.6%) | 198 (70.7%) | 0.808 |
| History of intestinal resection (1) | 158 (30.4%) | 82 (29.3%) | 0.808 |
| BMI (kg/m²) | 20.5 ± 2.5 | 20.6 ± 2.5 | 0.459 |
| Malnutrition (ESPEN) (0) | 299 (57.5%) | 167 (59.6%) | 0.609 |
| Malnutrition (ESPEN) (1) | 221 (42.5%) | 113 (40.4%) | 0.609 |
| Risk Score | 54.0 ± 31.9 | 52.0 ± 30.7 | 0.412 |
Table 1: Baseline characteristics of the Crohn’s disease training and validation cohorts. Values are presented as mean ± standard deviation for continuous variables and n (%) for categorical variables. P-values were calculated using independent-samples t-tests for continuous variables and χ2 tests for categorical variables. L1 = ileal disease; L2 = colonic disease; L3 = ileocolonic disease according to the Montreal classification for Crohn’s disease. Malnutrition was diagnosed strictly according to the ESPEN 2015 criteria, with low BMI (<18.5 kg/m2) considered a supportive rather than definitive indicator. No statistically significant differences were observed between the training and validation cohorts for any baseline variable (P > 0.05), indicating good baseline comparability and minimizing confounding due to cohort heterogeneity. Risk score represents the cumulative raw nomogram score derived from the sum of individual variable scores (Active Disease: 100 points; History of Intestinal Resection: 90.8 points; L3: 51.1 points; Biologic Use: 41.2 points; BMI: 42.0 points), with a theoretical maximum score of 325.1. The observed risk score range in the training cohort was 4.2–148.6. The scores presented in this table correspond to the raw cumulative score before scaling.
The mean age was 32.6 ± 9.3 years in the training cohort and 32.8 ± 9.6 years in the validation cohort (P = 0.706). Mean BMI was 20.5 ± 2.5 kg/m2 in the training cohort and 20.6 ± 2.5 kg/m2 in the validation cohort (P = 0.459). Mean risk scores were 54.0 ± 31.9 and 52.0 ± 30.7 in the training and validation cohorts, respectively (P = 0.412).
No statistically significant differences were identified for categorical variables. Female patients accounted for 43.5% (226/520) of the training cohort and 44.3% (124/280) of the validation cohort (P = 0.881). The distributions of lesion locations (L1, L2, and L3) were comparable between cohorts, with P values of 0.944, 0.753, and 0.623, respectively. Similarly, no significant differences were observed in disease activity status, biologic use, history of intestinal resection, or ESPEN-defined malnutrition status (all P >0.05).
Multivariable logistic regression analysis for prediction of high-risk malnutrition in Crohn’s disease
Multivariable logistic regression was performed to evaluate predictors of high-risk malnutrition outcomes (Table 2). Lesion location classified as L2 demonstrated a significant negative association with high-risk malnutrition (β = −8.184, OR = 0.000, 95% CI: 0.000–0.002, P < 0.001). The corresponding nomogram score was 0.0 points.
| Variable | β | OR | 95% CI | P-value | Nomogram Score |
| Lesion Location (L2) | -8.184 | 0 | (0.000-0.002) | <0.001 | 0 |
| Lesion Location (L3) | 0.789 | 2.2 | (0.675-7.170) | 0.191 | 51.1 |
| Active Disease | 2.501 | 12.182 | (4.850-30.580) | <0.001 | 100 |
| Biologic Use | -0.942 | 0.39 | (0.150-1.012) | 0.053 | 41.2 |
| History of intestinal resection | 1.82 | 6.17 | (2.350-16.180) | 0.001 | 90.8 |
| BMI | -0.801 | 0.449 | (0.340-0.593) | <0.001 | 42 |
Table 2: Multivariable logistic regression analysis for prediction of high-risk malnutrition in Crohn’s disease. This table summarizes the results of the multivariable logistic regression model for malnutrition risk prediction, including regression coefficient (β), odds ratio (OR), 95% confidence interval (CI), P-value, and corresponding nomogram score for each predictive variable. Variables included lesion location according to the Montreal classification (L2 and L3), active disease status, biologic use, history of intestinal resection, and BMI. A P-value <0.001 indicates a statistically significant predictive effect, whereas the near-significant P-value for biologic use (P = 0.053) suggests a potential protective trend. The extremely negative β coefficient and near-zero OR observed for L2 reflect quasi-complete separation within the dataset because only 8.3% (18/221) of malnourished patients demonstrated isolated colonic involvement. This estimate should therefore be interpreted cautiously. Sensitivity analysis excluding L2 did not significantly alter model discrimination (AUC change <0.02), supporting overall model stability.
Lesion location classified as L3 demonstrated β = 0.789, OR = 2.200, 95% CI: 0.675–7.170, and P = 0.191, indicating no statistically significant association with high-risk malnutrition. The corresponding nomogram score was 51.1 points. Active disease status demonstrated a strong positive association with high-risk malnutrition (β = 2.501, OR = 12.182, 95% CI: 4.850–30.580, P < 0.001). The corresponding nomogram score was 85.0 points. Biologic use demonstrated β = −0.942, OR = 0.390, 95% CI: 0.150–1.012, and P = 0.053, suggesting a potential protective trend that did not reach statistical significance. The corresponding nomogram score was 41.2 points.
History of intestinal resection was identified as a significant predictor of high-risk malnutrition, with β = 1.820, OR = 6.170, 95% CI: 2.350–16.180, and P = 0.001. The corresponding nomogram score was 72.5 points. BMI demonstrated a significant negative association with high-risk malnutrition (β = −0.801, OR = 0.449, 95% CI: 0.340–0.593, P < 0.001). The corresponding nomogram score was 42.0 points.
The extremely large negative coefficient and near-zero OR observed for lesion location L2 suggest quasi-complete separation within the dataset, potentially attributable to the low proportion of L2 lesions among malnourished patients (8.3%, 18/221). This finding should therefore be interpreted cautiously, as it may overestimate the apparent protective effect of L2 lesions. Sensitivity analysis excluding L2 lesion data showed minimal change in model discrimination (AUC change <0.02), supporting the model's overall stability despite this sparse data pattern.
Nomogram analysis for prediction of high-risk malnutrition in Crohn’s disease
A nomogram was constructed to predict high-risk malnutrition in patients with Crohn’s disease (Figure 2). The nomogram presents scaled scores ranging from 0 to 100 for each predictive variable, whereas the theoretical raw total risk score ranged from 0 to 325.1 before scaling.

Figure 2: Nomogram for predicting high-risk malnutrition in patients with Crohn’s disease. This clinically applicable nomogram was constructed using a stacked model combining multivariable logistic regression and machine learning algorithms, including random forests and gradient-boosted decision trees. Variable scores are displayed for each predictive factor, including Active Disease (100 points), History of Intestinal Resection (90.8 points), Lesion Location L3 (51.1 points), Biologic Use (41.2 points), and BMI (42.0 points). For visual clarity, the total nomogram score is scaled to 0–100; however, the actual cumulative raw risk score ranges from 0 to 325.1. Based on the scaled score, patients are stratified into low-risk (≤20 points), moderate-risk (21–40 points), and high-risk (>40 points) categories for individualized nutritional intervention. Please click here to view a larger version of this figure.
Among predictive variables, active disease demonstrated the highest nomogram score (100.0 points), indicating the greatest contribution to high-risk malnutrition prediction. The history of intestinal resection also demonstrated a high contribution, with a score of 90.8 points. Lesion location L3 scored 51.1 points, biologic use scored 41.2 points, and BMI scored 42.0 points, indicating moderate predictive contributions. The L2 lesion location demonstrated the lowest score (0 points).
Risk stratification thresholds were calibrated using percentile distributions of training cohort risk scores. Decision curve analysis demonstrated clinically meaningful net benefit for moderate-risk (21–40 points) and high-risk (>40 points) stratification categories.
Receiver operating characteristic curve analysis of the malnutrition prediction model in Crohn’s disease
Receiver operating characteristic (ROC) curve analysis was performed to evaluate model discrimination in the training and validation cohorts (Figure 3). The area under the curve (AUC) was 0.987 (95% CI: 0.978–0.996) in the training cohort and 0.967 (95% CI: 0.945–0.989) in the validation cohort.

Figure 3: Receiver operating characteristic (ROC) curves of the malnutrition prediction model in the training and validation cohorts. The ROC curves of the malnutrition prediction model are shown for the training cohort (blue) and validation cohort (red; non-overlapping subset). The x-axis represents 1-specificity (false-positive rate), and the y-axis represents sensitivity (true-positive rate). The area under the curve (AUC) was calculated using the trapezoidal method, with 95% confidence intervals estimated using 1,000 bootstrap resamples. The AUC was 0.987 (95% CI: 0.978–0.996) for the training cohort and 0.967 (95% CI: 0.945–0.989) for the validation cohort. The rapid rise of both curves toward the upper-left corner indicates high discriminative ability of the model. Please click here to view a larger version of this figure.
Both ROC curves rose sharply toward the upper-left corner, indicating strong discriminatory ability for distinguishing malnourished from non-malnourished patients. Although the AUC was slightly lower in the validation cohort, discrimination remained high (>0.95), indicating stable model performance in the hold-out validation dataset.
Despite strong performance metrics, these findings should be interpreted cautiously, as exceptionally high AUC values may indicate overfitting. Further validation in larger, multicenter, and multi-ethnic cohorts is required to confirm the model's robustness and generalizability.
Calibration curve analysis of the high-risk malnutrition prediction model in Crohn’s disease
Calibration curves were used to evaluate agreement between predicted and observed probabilities of high-risk malnutrition (Figure 4). Calibration curves for both the training cohort and validation cohort were closely aligned with the ideal calibration line.

Figure 4: Calibration curves for the malnutrition prediction model. The calibration curves evaluate agreement between predicted and observed probabilities of malnutrition. The x-axis represents predicted probability derived from the model, and the y-axis represents the observed proportion of malnourished patients. The black dashed line indicates ideal calibration. Calibration slopes were 0.98 for the training cohort and 0.94 for the validation cohort. Corresponding Brier scores were 0.087 and 0.102, respectively, where slopes approaching 1 and Brier scores <0.25 indicate good calibration performance. The curves closely align with the ideal calibration line at predicted probabilities >0.4, indicating good calibration performance in high-risk patients. Please click here to view a larger version of this figure.
The calibration slope was 0.98 (95% CI: 0.95–1.01) in the training cohort and 0.94 (95% CI: 0.89–0.99) in the validation cohort. Minor deviations from the ideal calibration line were observed at low predicted probabilities, whereas close agreement was observed at moderate and high predicted probabilities (>0.4).
The Brier score was 0.087 (95% CI: 0.072–0.102) in the training cohort and 0.102 (95% CI: 0.085–0.119) in the validation cohort, indicating acceptable prediction error. No statistically significant difference in calibration performance was observed between cohorts (P = 0.32).
Decision curve analysis of the malnutrition prediction model in Crohn’s disease
Decision curve analysis (DCA) was performed to evaluate the clinical utility of the prediction model across a range of threshold probabilities (Figure 5). A positive net benefit was observed in both cohorts within the clinically relevant probability range of 0.10–0.60.

Figure 5: Decision curve analysis (DCA) of the malnutrition prediction model. Decision curve analysis was performed to evaluate the clinical utility of the malnutrition prediction model. The x-axis represents the threshold probability for identifying a patient as being at risk of malnutrition (range: 0.0–1.0), and the y-axis represents net benefit. Net benefit was calculated as follows: (true positives / N) – (false positives / N) × (threshold / [1 – threshold]). The model net benefit is shown for the training (blue) and validation (red) cohorts and compared with two reference strategies: “treat all” (gray) and “treat none” (black). Positive net benefit was observed across threshold probabilities of 0.10–0.60, supporting the clinical utility of the model for guiding nutritional intervention decisions. Please click here to view a larger version of this figure.
At a threshold probability of 0.20, the net benefit was 0.42 in the training cohort and 0.39 in the validation cohort. At a threshold probability of 0.40, the net benefit was 0.38 and 0.35 in the training and validation cohorts, respectively. Net benefit decreased substantially at threshold probabilities >0.60, and greater fluctuation was observed within the validation cohort at threshold probabilities between 0.80 and 1.00.
Overall, the model demonstrated clinically meaningful net benefit within threshold probabilities of 0.10–0.60, supporting its potential utility for guiding nutritional risk intervention in patients with Crohn’s disease.
DATA AVAILABILITY:
Key characteristics of all 17 included studies, including author, publication year, country, sample size, study design, and malnutrition diagnostic criteria, are summarized in Supplementary Table 1. Supplementary Table 2 contains the complete “Basic Study Information” worksheet used for data extraction, including study characteristics, study design, quality assessment scores, extracted effect sizes, and raw meta-analysis data for all included studies. The datasets generated and analyzed during the current study are provided in Supplementary File 3.
Supplementary Table 1: Characteristics of the 17 included studies in the meta-analysis. Supplementary Table 1 summarizes the characteristics of the 17 studies included in the meta-analysis. For each study, the table reports the first author, publication year, country, study design, total sample size (N_Total), number of malnourished patients (N_Malnourished), malnutrition diagnostic criteria applied (e.g., MUST score ≥2, BMI <18.5 kg/m2, GLIM criteria, ESPEN criteria, or SGA Grade B/C), and reported malnutrition prevalence presented as effect size with corresponding 95% confidence interval. The studies represent diverse geographic regions and clinical settings and, collectively, provide the evidence base for predictive variable selection in the malnutrition risk model for Crohn’s disease.Please click here to download this file.
Supplementary Table 2: Basic Study Information worksheet for included studies in the meta-analysis. Supplementary Table 2 contains the complete “Basic Study Information” worksheet used for data extraction. The worksheet includes study characteristics, study design, quality assessment scores, extracted effect sizes, and raw meta-analysis data for all included studies.Please click here to download this file.
Supplementary File 3. Datasets used in the present study. Please click here to download this file.