$$\rightleftharpoonup{xx}$$
$$\longleftharp{xx}$$,
$$\longrightharp{xx}$$,
The key findings of the proposed study are presented here, encompassing various analyses conducted to elucidate the molecular mechanisms underlying NAFLD and MI.
Identification of DEGs
In the GSE89632 dataset, 76 up-regulated and 20 down-regulated genes were identified as NAFLD-DEGs (Figure 2B,D), while the GSE66360 dataset revealed 118 up-regulated and 8 down-regulated genes as MI-DEGs (Figure 2C,E). Subsequently, co-DEGs were derived from the overlap between NAFLD-DEGs and MI-DEGs, identifying 13 up-regulated DEGs and 1 down-regulated DEG among the co-DEGs (Figure 2A).
Identifying DEGs unique to NAFLD and MI, as well as those common to both disorders, provides valuable insights into the molecular processes underlying these illnesses. Understanding the distinct and shared gene expression patterns linked to both conditions helps clarify the pathophysiology of NAFLD and MI and may identify potential molecular targets for therapeutic intervention. Further investigation into the co-DEGs could reveal critical regulators or pathways essential to the interaction between these two disorders, facilitating the development of targeted therapies aimed at mitigating the negative health outcomes associated with NAFLD and MI.
Enrichment analysis
GO analysis revealed that NAFLD DEGs were primarily enriched in the following biological processes (BP): fat cell differentiation, response to lipopolysaccharide, and multicellular organism processes (p < 0.05, Figure 3A). For cellular components (CC), NAFLD DEGs were mainly associated with the phosphatidylinositol 3-kinase complex, secretory granule lumen, and cytoplasmic vesicle lumen (p < 0.05, Figure 3A). In terms of molecular function (MF), the key activities included phosphatidylinositol-3-kinase regulator activity, DNA-binding transcription activator activity (RNA polymerase-specific), and general DNA-binding transcription activator activity (p < 0.05, Figure 3A).
For MI-DEGs, the BP analysis showed enrichment in leukocyte chemotaxis, myeloid leukocyte activation, and cell chemotaxis (p < 0.05, Figure 3B). The CC analysis highlighted enrichment in tertiary granules, ficolin-1-rich granules, and secretory granule lumen (p < 0.05, Figure 3B). MF analysis revealed enrichment in pattern recognition receptor activity, RAGE receptor binding, and immune receptor activity (p < 0.05, Figure 3B).
KEGG enrichment analysis indicated that NAFLD-DEGs were significantly enriched in the following pathways: IL-17, AGE-RAGE, TNF, osteoclast differentiation, malaria, rheumatoid arthritis, Chagas disease, and JAK-STAT signaling pathways (p < 0.05, Figure 3C). MI-DEGs were enriched in IL-17, TNF, lipid metabolism and atherosclerosis, toll-like receptor signaling, C-type lectin receptor signaling, legionellosis, rheumatoid arthritis, and NF-κB signaling pathways (p < 0.05, Figure 3D). Disease Ontology (DO) enrichment analysis demonstrated that the top three DEGs for both NAFLD and MI were significantly associated with atherosclerosis, arteriosclerotic cardiovascular disease, and arteriosclerosis (p < 0.05, Figure 3E-F).
PPI network diagrams analysis
In the PPI network diagram for NAFLD, 92 nodes and 250 edges were identified, reflecting a complex network of interactions among proteins associated with this condition. Similarly, the PPI network diagram for MI revealed 117 nodes and 587 edges, indicating a more intricate network characteristic of the molecular landscape of myocardial infarction. Additionally, the PPI network diagrams for co-DEGs identified 14 nodes and 21 edges, highlighting potential key interactions among proteins shared between NAFLD and MI (Figure 4A-F).
PPI networks are valuable for constructing and analyzing therapeutic targets and provide important insights into the molecular connections underlying NAFLD and MI. The unique network topologies observed in NAFLD and MI emphasize the distinct pathophysiological mechanisms involved in these diseases. Moreover, the discovery of shared connections between co-DEGs suggests potential shared pathways or regulatory networks that mediate the interaction between MI and NAFLD. Further research into these connections could reveal novel molecular targets for intervention and therapy development, potentially offering new treatment options for these complex illnesses.
Candidate hub DEGs screening
The SVM-RFE algorithm17 was employed to screen for potential crucial DEGs, resulting in the identification of 6 genes in GSE89632 and 4 genes in GSE66360 (Figure 5A,B). Similarly, the LASSO algorithm19 was used to identify candidate crucial DEGs, yielding 4 genes in GSE89632 and 5 genes in GSE66360 (Figure 5C,D). Notably, a Venn diagram depicted the intersection of the hub DEG (THBS1) identified by SVM-RFE, LASSO, and co-PPI network analyses, underscoring its significance in both NAFLD and MI pathogenesis (Figure 5F).
These advanced machine learning algorithms provided a robust framework for identifying key DEGs associated with MI and NAFLD and offered valuable insights into their potential as therapeutic or diagnostic targets. The identification of THBS1 as a hub gene through multiple analytical techniques highlights its importance in the molecular pathways underlying both diseases. Further research into the functional significance of THBS1 and its interactions within the co-PPI network could elucidate novel mechanisms linking NAFLD and MI, potentially paving the way for targeted therapies and precision medicine strategies.
Diagnostic value of one hub DEGs
ROC analysis was conducted to assess the diagnostic efficiency of the identified hub DEGs in both NAFLD and MI datasets. As depicted in Figure 6A,B, THBS1 demonstrated high discriminatory power, with an AUC of 0.981 (95% CI: 0.949-1.000) in NAFLD patients and 0.900 (95% CI: 0.834-0.956) in MI patients. The strong performance of THBS1 as a diagnostic biomarker highlights its potential usefulness in differentiating between NAFLD, MI, and healthy controls.
These results suggest that THBS1 could serve as a valuable marker for disease state, offering clinicians a useful tool for risk assessment and early identification. To facilitate the integration of THBS1 into clinical practice and improve patient care and management, further validation studies are needed to confirm its diagnostic accuracy across diverse patient populations and clinical settings.
Immune infiltration analysis
ssGSEA was conducted to analyze 29 immune-related genes between NAFLD/MI and the control group (CON). Compared to the CON group, immune cell infiltration was significantly increased in the NAFLD and MI groups, particularly for CCR, MHC class I, neutrophils, parainflammation, and Tfh cells (p < 0.05, Figure 7A,B). Conversely, immune cell infiltration decreased notably in the NAFLD and MI groups relative to the CON group, especially for CD8+ T cells, cytolytic activity, and tumor-infiltrating lymphocytes (TIL) (p < 0.05, Figure 7C,D).
These results suggest dysregulation of immune cell infiltration in both NAFLD and MI, highlighting the complex immunological landscape associated with these conditions. The observed changes in immune cell composition provide valuable insights into the immunological processes underlying the development and pathophysiology of these diseases. Further understanding of these immune characteristics could reveal new therapeutic targets for intervention and support the development of individualized treatment plans based on the immunological profiles of patients with NAFLD and MI.

Figure 1: The design sequence of the medical decision support system for identifying THBS1. Please click here to view a larger version of this figure.

Figure 2: Venn diagrams, volcano diagrams, and heat maps. (A) Venn diagram for GSE89632 and GSE66360. (B) A volcano diagram of differentially expressed genes (DEGs) in GSE89632. (C) A volcano diagram of DEGs in GSE66360. (D) A heat map diagram of DEGs in GSE89632. (E) A heat map diagram of DEGs in GSE66360. Please click here to view a larger version of this figure.

Figure 3: Enrichment analysis of differentially expressed genes (DEGs). (A) The top five NALFD- DEGs enriched gene ontology (GO) in biological process (BP), cellular component (CC), and molecular function (MF). (B) The top five MI-DEGs enriched GO in BP, CC and MF. (C) The top eight NALFD-DEGs enriched Kyoto encyclopedia of genes and genomes (KEGG) signaling pathways. (D) The top eight MI-DEGs enriched KEGG signaling pathways. (E) The top five NALFD-DEGs enriched disease ontology (DO). (F) The top five MI-DEGs enriched DO. Please click here to view a larger version of this figure.

Figure 4: Protein-protein interaction (PPI) network diagrams. (A) A PPI network diagram of non-alcoholic fatty liver disease (NAFLD) obtained from the String online platform. (B) A PPI network diagram of NAFLD obtained from Cytoscape. (C) A PPI network diagram of myocardial infarction (MI) obtained from String online platform. (D) A PPI network diagram of MI obtained Cytoscape. (E) A PPI network diagram of protein interactions. (F) A PPI network diagram of IL1B and FOS. Please click here to view a larger version of this figure.

Figure 5: Candidate hub DEGs screening. (A) Six crucial DEGs obtained by SVM-RFE in NAFLD. (B) Four crucial DEGs obtained by SVM-RFE in myocardial infarction (MI). (C) Four crucial DEGs obtained by LASSO in NAFLD. (D) Five crucial DEGs obtained by LASSO in MI. (E) Only one DEG acquired from the above results in the Venn diagram. Please click here to view a larger version of this figure.

Figure 6: Diagnostic value of one hub differentially expressed genes (DEGs). (A) ROC curves in NAFLD. (B) ROC curves in MI. Please click here to view a larger version of this figure.

Figure 7: Distribution of infiltrating immune cells and correlation between THBS1 and infiltrating immune cells in non-alcoholic fatty liver disease (NAFLD) and myocardial infarction (MI). (A) Distribution of infiltrating immune cells in NAFLD. (B) Distribution of infiltrating immune cells in MI. (C) Correlation between THBS1 and infiltrating immune cells in NAFLD. (D) Correlation between THBS1 and infiltrating immune cells in MI. Please click here to view a larger version of this figure.
| Abbreviation | Nomenclature |
| AUC | Area Under Curve |
| BP | Biological Process |
| CC | Cellular Component |
| CON | Controls |
| DEG | Differentially Expressed Genes |
| DO | Disease Ontology |
| GO | Gene Ontology |
| KEGG | Kyoto Encyclopedia of Genes and Genomes |
| LASSO | Least Absolute Shrinkage and Selection Operator |
| MF | Molecular Function |
| MI | Myocardial Infarction |
| NAFLD | Non-Alcoholic Fatty Liver Disease |
| PPI | Protein-Protein Interaction |
| ROC | Receiver Operator Characteristic Curves |
| ssGSEA | single-sample Gene Set Enrichment Analysis |
| SVM-RFE | Support Vector Machine-Recursive Feature Elimination |
| THBS1 | Thrombospondin 1 |
Table 1: Simulation parameters used in the experimentation.