$$\rightleftharpoonup{xx}$$
$$\longleftharp{xx}$$,
$$\longrightharp{xx}$$,
In this study, five TNF inhibitors identified from real-world pharmacovigilance data were integrated with Mendelian randomization analyses to evaluate whether TNF pathway biomarkers are causally associated with hip fracture risk. The findings consistently indicated that no significant direct causal association between TNF-α and hip fracture risk was identified, and additional analyses of sTNFR1 and sTNFR2 yielded concordant results. These results support the value of combining spontaneous-report safety data with genetic causal inference when randomized trials are unavailable or impractical. Beyond these empirical findings, this study provides a reproducible protocol framework for integrating spontaneous-report pharmacovigilance data with genetically informed causal inference. In this workflow, FAERS is used to characterize real-world adverse event reporting patterns, whereas Mendelian randomization is used to assess whether a biomarker or pathway is likely to have an independent causal effect on the clinical outcome. Therefore, this workflow is intended not only to evaluate a specific TNF-related research question but also to demonstrate a generalizable methodological strategy for safety-signal assessment and causal triangulation when randomized trials are unavailable, infeasible, or ethically difficult to conduct.
Compared with conventional cohort and case-control studies, this combined workflow is less dependent on long-term prospective follow-up and can rapidly generate real-world safety evidence from post-marketing data48. However, observational studies may provide richer clinical information, including disease severity, medication duration, comorbidities, and confounder adjustment49. Compared with randomized controlled trials, the FAERS-Mendelian randomization workflow is more feasible for rare or long-latency outcomes such as hip fracture, although it cannot replace trial-level evidence when randomized evidence is available50. Compared with pharmacovigilance analysis alone, the addition of Mendelian randomization provides a genetically informed causal component that helps distinguish a reporting signal from a potentially causal biological pathway32. Compared with mendelian randomization alone, the inclusion of FAERS adds real-world post-marketing context, including drug-specific reporting patterns and safety surveillance information51. Therefore, the main value of this workflow lies in methodological triangulation rather than in replacing any single established design.
Several common failure points should be considered when applying this workflow. In the FAERS component, inconsistent raw data structures across quarterly files are a frequent source of error. Column names, file formats, and table structures may vary across reporting periods, which may lead to import errors or incorrect table merging52. This can be addressed by checking raw headers before merging, removing empty trailing columns, and using flexible merging functions that allow non-identical columns. Another common problem is inconsistent drug naming, because drugs may be reported using generic names, brand names, active ingredients, abbreviations, or spelling variants53. Therefore, drug names and active ingredient fields should be standardized before matching, preferably using uppercase conversion, punctuation removal, and a predefined generic-name dictionary. Duplicate reporting is another critical issue. If duplicate CASEID records are not removed, disproportionality estimates may be inflated54. A transparent deduplication rule should therefore be applied, such as retaining the report with the latest FDA receipt date and, if tied, retaining the highest PRIMARYID.
In the Mendelian randomization component, common failure points include too few available instrumental variables, weak instrument bias, linkage disequilibrium among selected SNPs, allele mismatch between exposure and outcome datasets, ambiguous palindromic SNPs, incomplete outcome SNP matching, horizontal pleiotropy, sample overlap, and ancestry mismatch55,56. These issues may lead to unstable or biased causal estimates. To reduce these risks, investigators should prespecify the SNP selection threshold, perform LD clumping, calculate F-statistics, remove weak instruments, harmonize effect alleles carefully, exclude ambiguous palindromic SNPs when strand orientation cannot be determined, and conduct heterogeneity and pleiotropy tests. If too few SNPs remain after filtering, a relaxed threshold may be considered, but the decision should be justified and accompanied by sensitivity analyses.
This workflow can be modified according to the characteristics of the data source, exposure, outcome, and study objective. If the target adverse event is rare, investigators may broaden the adverse-event definition by including related MedDRA Preferred Terms or Standardized MedDRA Queries, while also performing sensitivity analyses using both narrow and broad definitions. If the target drug has multiple synonyms, biosimilars, or active ingredients, a curated drug dictionary should be constructed before querying FAERS. If the research question focuses on a drug class rather than a single agent, both drug-specific and pooled analyses should be reported. If reporter type, age group, sex, country, or reporting year is central to the research question, stratified disproportionality analyses may be added when the sample size is sufficient.
The Mendelian randomization component should also be adapted to the available genetic data. If GWAS data for the exact biomarker are unavailable, proxy traits, soluble receptors, cis-protein quantitative trait loci, or pathway-related biomarkers may be considered, but the biological interpretation should be limited accordingly. If exposure and outcome GWAS datasets differ in ancestry, ancestry-specific analyses should be prioritized to reduce population stratification57. If only a small number of SNPs are available, MR-Egger may be unstable, and greater emphasis should be placed on inverse variance weighted analysis, weighted median analysis, leave-one-out analysis, and biological plausibility of individual instruments58. If outcome data are binary, causal estimates should be interpreted on the odds-ratio scale. If sample overlap is suspected, this should be acknowledged and assessed through alternative outcome datasets or sensitivity analyses when possible59.
Descriptive analysis of the data revealed that a higher proportion of hip fracture reports was observed in women than in men. This pattern may be related to sex differences in bone mineral density, hormonal changes after menopause, and age-related skeletal fragility60,61,62. In particular, the present study revealed that hip fracture accounted for the largest proportion of people over 65 years of age when the vast majority of women stopped menstruating, with a significant decline in estrogen levels, abnormal bone metabolism, etc., leading to a decrease in bone density, which further validates that the risk of hip fracture may be greater in women than in men63,64,65. The reporting population is mostly medicine-related, and although the participation of professionals can improve the accuracy and standardization of reporting, the lack of public participation may lead to some unreported side effects and potential drug safety hazards due to low public awareness and sensitivity to adverse drug reactions. In the future, more attention needs to be given to improving ADR monitoring, especially how to increase public participation, which can be accomplished by increasing public education, improving communication channels, and setting up a convenient reporting mechanism66,67. The analysis of the reporting countries reveals that most of the reporting countries are in the United States, which suggests that international cooperation and information sharing may need to be strengthened in the future. The number of ADR reports in the United States may reflect higher reporting activity, broader utilization of the FAERS system, or stronger pharmacovigilance infrastructure, but globally, ensuring drug safety still requires close collaboration among countries68,69. By strengthening the integration and sharing of global adverse drug reaction data and promoting the standardization of international drug safety assessments, drug safety risks can be better prevented and controlled and public health can be improved70.
According to existing studies, TNF-α is an important mediator of inflammatory bone remodeling and may directly promote osteoclast activity, bone resorption and skeletal fragility. However, the presence of these biological effects does not necessarily indicate that circulating TNF-α levels alone constitute an independent direct causal determinant of hip fracture risk in the general population71,72,73. For example, studies have shown that the expression level of IL-1β, a more common inflammatory factor, may change during fracture74,75. The upregulation of IL-1β further activates osteoclasts and increases bone resorption, which has the potential to lead to an increased risk of hip fracture76,77,78. Moreover, TNF-α can enhance the inflammatory response by inducing the release of IL-1β from immune cells such as macrophages and monocytes79,80,81. In addition, TNF-α can also increase IL-1β expression by activating the NF-κB pathway, which in turn promotes IL-1β transcription82,83. These findings suggest that changes in circulating TNF-α levels alone may not constitute an independent direct determinant of hip fracture risk. However, TNF-α may still contribute to skeletal fragility indirectly through regulation of IL-1β expression and broader inflammatory pathways. Moreover, the increased risk of fracture is more likely related to changes in hormone levels, age, and long-term use of medications such as steroids rather than to a single factor such as TNF-α alone84,85,86. This framework may also help explain why TNF inhibitors have been associated with reduced fracture risk in inflammatory diseases such as rheumatoid arthritis and ankylosing spondylitis87. The protective effects of TNF inhibitors may reflect multiple mechanisms beyond direct blockade of TNF-α alone, including improved disease control, reduced chronic systemic inflammation, lower glucocorticoid exposure, improved physical function, and attenuation of inflammation-related bone loss88,89. Therefore, treatment benefits observed in specific inflammatory populations should not be directly interpreted as evidence that TNF-α alone is a universal direct causal determinant of hip fracture risk.
The present findings have important practical implications. No significant direct causal association was identified between TNF pathway biomarkers and hip fracture risk, suggesting that circulating TNF-α, sTNFR1, and sTNFR2 may have limited value as independent predictors or direct intervention targets for hip fracture in clinical practice. This result indicates that fracture risk assessment should not rely excessively on a single inflammatory biomarker, but should instead continue to focus on established determinants such as age, sex, bone mineral density, hormonal status, falls, comorbidities, and medication exposure. In addition, these findings suggest that the clinical benefits of TNF inhibitors observed in certain inflammatory diseases may reflect broader anti-inflammatory and disease-control effects rather than a direct one-to-one causal relationship between TNF-α and hip fracture. Therefore, the present study provides a more cautious basis for interpreting TNF-related biomarkers in fracture research and may help guide future studies toward broader inflammatory networks and multifactorial risk mechanisms. From a workflow perspective, this protocol also provides a structured approach for evaluating whether post-marketing safety concerns are supported by genetically informed causal evidence. It may be particularly useful for hypothesis prioritization, safety signal interpretation, and mechanistic triangulation.
The present study also has the following limitations. First, potential confounding introduced by underlying autoimmune diseases and concomitant glucocorticoid use among patients treated with TNF-α inhibitors could not be fully assessed. Because FAERS does not comprehensively capture indications, disease severity, cumulative glucocorticoid exposure, or co-medication information, subgroup and sensitivity analyses were not feasible90. However, the complementary Mendelian randomization analyses were conducted using genetic variants as instrumental variables for TNF-α, sTNFR1, and sTNFR2 levels. As genetic variants are randomly allocated at conception and remain largely stable throughout life, this design is less susceptible to conventional confounding factors and reverse causation91. Second, FAERS reports data from a variety of sources, and the quality of the data may vary depending on the professional competence of the reporter, the attention given to the report, and the level of detail in the report. Therefore, this study standardized the data via the MedDRA lexicon version 26.1 and validated Mendelian randomization. Finally, FAERS data rely mainly on the reporting of adverse drug reactions, which often do not fully establish the causal relationship between drugs and adverse reactions; therefore, this paper used four algorithms for the correction and testing of FAERS data, and Mendelian randomization was used for further validation with the aim of strengthening the reliability and stability of the results. Nevertheless, the FAERS and Mendelian randomization components should be interpreted as complementary rather than interchangeable. FAERS provides real-world safety-surveillance evidence, whereas Mendelian randomization provides genetically informed causal inference under specific instrumental-variable assumptions.
This workflow can be adapted to other drug classes, adverse events, biomarkers, and disease outcomes. In pharmacovigilance research, it can be applied to other spontaneous reporting systems or extended by integrating electronic health records, claims databases, disease registries, or hospital-based adverse event monitoring systems. In causal inference research, the Mendelian randomization component can be extended to multivariable Mendelian randomization, two-step Mendelian randomization, drug-target Mendelian randomization, cis-Mendelian randomization, colocalization analysis, or phenome-wide Mendelian randomization to investigate pathway-specific mechanisms. The workflow may also serve as a screening framework for prioritizing safety signals that require prospective validation, mechanistic experiments, or enhanced clinical surveillance. Future studies could further improve this protocol by automating FAERS data cleaning, standardizing drug dictionaries, embedding reproducibility checkpoints, and developing open-source pipelines that automatically generate workflow-verification outputs.
In summary, this study did not identify a significant direct causal association between TNF pathway biomarkers and hip fracture risk. More importantly, it provides a reproducible FAERS Mendelian randomization workflow for integrating post-marketing safety surveillance with genetic causal inference. The value of this protocol lies in its ability to evaluate safety signals, test biological plausibility, identify common failure points, and guide methodological adaptation across different data environments and research questions.