$$\rightleftharpoonup{xx}$$
$$\longleftharp{xx}$$,
$$\longrightharp{xx}$$,
Data
The CHARLS database is based on a survey of health, economic, and social activities in China33. All the CHARLSs received ethical approval from the Institutional Review Board (IRB) of Peking University. The IRB approval number for the main household survey (including anthropometrics) was IRB0000105211015. In the current study, the authors applied the 2015 survey. The sample consisted of 21,095 participants. Participants aged ≤45 years or with missing age information were excluded. Participants with missing data for the three core variables, recurrent falls, functional limitation score, and knee osteoarthritis, were then excluded. Finally, participants with missing covariate data were further excluded to construct a complete-case analytical dataset. All the materials and tools used for this study are listed in the Table of Materials.
Key variable indicators
Recurrent falls
Fall was a participant self-reported outcome34, which was assessed on the basis of "Have you experienced any falls since your last visit?" The participants provided binary responses, indicating either "yes" or "no." In the present analysis, participants who answered “yes” were classified as having recurrent falls, and those who answered “no” were classified as not having recurrent falls.
Knee osteoarthritis
The outcome of symptomatic knee osteoarthritis was determined if the participant answered "yes" to the first of the following questions and "knee" to the second question35 : “Are you often troubled with any body pain?” and “On what part of your body do you feel pain? Please list all parts of the body you are currently feeling pain in.” Thus, knee osteoarthritis in this study was operationalized using self-reported knee pain information from CHARLS 2015. This definition was used because, owing to the data-collection design of CHARLS 2015, radiographic assessment and physician-diagnosed knee osteoarthritis were not available in the dataset used for this analysis.
Functional limitations
Functional limitations were assessed using activities of daily living (ADL) and instrumental activities of daily living (IADL) items from CHARLS 201536. ADL items included difficulties in dressing, bathing, eating, getting in and out of bed, toileting, and controlling urination and defecation. IADL items included difficulties in housework, cooking, shopping, managing money, and taking medication. Each item was scored on a four-point scale from 1, indicating no difficulty, to 4, indicating inability to perform the activity. The 11 ADL and IADL items were summed to generate a functional limitation score ranging from 11 to 44. Higher scores indicated more severe functional limitations. The functional limitation score was treated as a continuous mediator in the mediation analysis. In addition, ADL disability and IADL disability were constructed as binary covariates. ADL disability was coded as present if the participant reported being unable to perform at least one ADL activity. IADL disability was coded as present if the participant reported being unable to perform at least one IADL activity. These two binary variables were included as covariates in the regression models, whereas the summed functional limitation score was used as the mediator.
Covariates
The covariates included sex, age, drinking status, smoking status, body mass index category, night sleep duration, depression status, cognition score, ADL disability, and IADL disability. Sex was categorized as male or female. Age was analyzed as a continuous variable. Drinking status was coded as yes if participants reported drinking alcoholic beverages, either more than once per month or less than once per month, and no if they reported no alcohol consumption. Smoking status was coded as yes if participants reported a history of smoking and no otherwise. Body mass index was categorized as underweight, normal, overweight, or obese. Night sleep duration was categorized as <6 h or ≥6 h.
Depression status was assessed using the 10-item depressive symptom scale from CHARLS 201537. The items asked whether, during the past week, participants were bothered by things that usually did not bother them, had difficulty concentrating, felt depressed, felt that everything was an effort, felt hopeful about the future, felt fearful, had restless sleep, felt happy, felt lonely, or could not get going. Each item was originally scored on a four-point frequency scale from 1 to 4. For the eight negatively worded items, responses were converted to scores from 0 to 3 by subtracting 1 from the original response. The two positively worded items, feeling hopeful about the future and feeling happy, were reverse-scored. The total depression score ranged from 0 to 30, with higher scores indicating more severe depressive symptoms. Participants were categorized into depression score <10 and ≥10 groups, with a score ≥10 indicating a higher depressive symptom burden.
Cognition score was calculated using cognition-related items from CHARLS 201538. These items included correct identification of the year, month, date, day of the week, and season; five serial subtraction tasks; and a figure-drawing task. For the serial subtraction task, participants were asked to subtract 7 from 100 in successive steps, and correct answers to the five steps were scored separately. Correct responses were scored as 1 and incorrect responses as 0. The total cognition score was calculated by summing all item scores, with a possible range from 0 to 11. Higher scores indicated better cognitive performance.
Statistical analysis
First, a complete-case analytical dataset was constructed by excluding participants younger than 45 years and those with missing data for recurrent falls, functional limitation score, knee osteoarthritis, or covariates included in the regression models. The characteristics of the study population were summarized as means and standard deviations for continuous variables and frequencies and percentages for categorical variables. Baseline characteristics were compared between female and male participants using Student’s t-test for continuous variables and chi-square tests for categorical variables. Spearman correlation analysis was performed to examine the correlations among recurrent falls, functional limitation score, and knee osteoarthritis.
Because knee osteoarthritis was treated as a binary outcome, multivariable logistic regression was used to examine the association between recurrent falls and knee osteoarthritis. The functional limitation score was treated as a continuous mediator; therefore, linear regression was used to evaluate the association between recurrent falls and the functional limitation score. Model 1 included recurrent falls as the main independent variable and adjusted for sex, age, drinking status, smoking status, body mass index category, night sleep duration, depression status, cognition score, ADL disability, and IADL disability. Model 2 included the same covariates as Model 1 and additionally included the functional limitation score.
The mediation analysis was conducted using a bootstrap mediation framework based on a linear mediator model and a logistic outcome model. In this model, recurrent falls were defined as the independent variable, functional limitation score as the mediator, and knee osteoarthritis as the dependent variable. The mediation analysis estimated the indirect effect, direct effect, total effect, and proportion mediated. Path a represented the association between recurrent falls and functional limitation score, and path b represented the association between functional limitation score and knee osteoarthritis after adjustment for recurrent falls and covariates. The direct effect represented the association between recurrent falls and knee osteoarthritis after functional limitation score was included in the model.
Bootstrap resampling with 5,000 simulations was used to estimate the total, direct, and indirect effects and their 95% confidence intervals. The authors used 5,000 bootstrap simulations to improve the stability of confidence interval estimation for the indirect effect, which is often non-normally distributed. The mediation effect was considered statistically significant when the 95% bootstrap confidence interval did not include zero. Statistical significance was defined as a two-sided p value <0.0539.
All statistical analyses were conducted using R software version 4.4.1 on Windows 10 x64. The main R packages used included tidyverse version 2.0.0, dplyr version 1.1.4, tableone version 0.13.2, psych version 2.5.6, mediation version 4.5.1, and ggplot2 version 3.4.4. Data extraction and variable construction were performed using prespecified R scripts according to the CHARLS 2015 variable dictionary and questionnaire documentation. Reproducibility was ensured by using prespecified scripts and recording the R session information. Statistical analyses were performed by the study authors, and the analytical outputs were checked against the generated tables and figures. In addition, the statistical workflow, model selection, and main analytical outputs were reviewed by a professor with expertise in statistical methodology from the author’s university to further ensure the appropriateness and reliability of the analyses.