$$\rightleftharpoonup{xx}$$
$$\longleftharp{xx}$$,
$$\longrightharp{xx}$$,
Transcriptional subgroups in atopic dermatitis
RNA-seq data from 266 AD patient samples were analyzed to investigate transcriptional heterogeneity within the disease. After quality control and correction for batch effects across multiple studies, unsupervised consensus clustering revealed two distinct molecular subgroups (Figure 1A). Cluster stability and optimal cluster number were assessed using the cumulative distribution function (CDF) plot (Figure 1B), delta area plot (Figure 1C), and consensus matrix heatmap (Figure 1D). Together, these results support the existence of two robust transcriptional subtypes in AD, reflecting underlying genetic heterogeneity.
Differentially expressed genes between AD subgroups
The t-SNE plot based on the normalized gene expression matrix further validated the transcriptional subgroups identified by consensus clustering. The t-SNE plot revealed two well-separated clusters, each corresponding to one of the previously defined subgroups (Figure 2A), supporting the presence of distinct molecular profiles among AD patients. Differential expression between the two subgroups was then analyzed using DESeq2, with a threshold of adjusted p < 0.01 and |log₂ fold change| > 1. The resulting volcano plot (Figure 2B) showed differentially expressed genes (DEGs), indicating strong transcriptional divergence. The top 10 up-regulated genes are ABHD2, ADAR, ADCY3, ADCY9, ADD1, ADIPOR2, AFF1, AGFG1, AGRN and AHNAK in cluster 1 and C2orf68, CTTN, GPR108, HERPUD1, LRPAP1, MAP1LC3B2, NKIRAS2, NR1H2, PDE5D and PMPCA in cluster 2 (Figure 2C).
AD subgroup associated gene set
Gene Set Enrichment Analysis (GSEA) revealed distinct functional enrichment profiles between the two transcriptomic clusters (Figure 3A). Cluster 1 exhibited significant enrichment for pathways involved in cell signaling and adhesion, including focal adhesion (Figure 3B) and MAPK signaling pathway (Figure 3C), suggesting an active state characterized by enhanced cell-cell-extracellular matrix interactions and proliferation. In contrast, Cluster 2 showed strong enrichment for oxidative phosphorylation (Figure 3D) and proteasome function (Figure 3E), suggesting an active oxidative and proteolytic metabolic phenotype.
Co-expressed genes in AD molecular groups
To identify co-expression modules associated with transcriptomic subtypes, WGCNA was performed after data preprocessing. Outlier samples were first identified and removed based on hierarchical clustering of sample distances to ensure the robustness of downstream network construction (Figure 4A). A soft-thresholding power was then selected using the scale-free topology criterion, with a power of 6 chosen to achieve a scale-free R2 > 0.85 (Figure 4B). Gene modules were identified through hierarchical clustering and dynamic tree cutting, followed by eigengene clustering to merge closely related modules (Figure 4C and Figure 4D). The resulting gene network was visualized using a heatmap of topological overlap, confirming the presence of distinct gene co-expression patterns (Figure 4E). Module-trait relationship analysis revealed a strong and significant correlation between the MEyellow module (Ngene = 743) and the molecular sub-group (Figure 4F).
Functional enrichment of the AD subgroup associated mitochondrial genes
GO and KEGG enrichment analysis was then performed on the intersected genes (N = 85) among DEGs, genes in the MEyellow module, and a list of mitochondrial proteins from MitoCarta3.0 (Figure 5A) to investigate the functional roles of mitochondrial-associated genes driving the transcriptional differences between clusters. KEGG pathway analysis identified enrichment in oxidative phosphorylation and metabolic pathways (Figure 5B). GO enrichment analysis revealed significant overrepresentation of terms associated with mitochondrial function, including proton motive force-driven mitochondrial ATP synthesis, respiratory chain complex, and NADH dehydrogenase activity (Figure 5C), suggesting that these genes are mainly associated with mitochondrial energy metabolism and energy regulation.
Hub mitochondrial genes in AD molecular differentiation
To identify the key genes in the 85 mitochondrial transcriptome-subgroup-associated genes, a PPI network was constructed (Figure 6A). Based on the ranking in each of the seven topological measures (See methods), the top 30 genes were selected, and their intersections were analyzed and visualized in an UpSet plot (Figure 6B). This analysis resulted in four hub genes consistently identified as central nodes based on all ranking criteria (BAD, BOLA1, CHCHD5, ISOC2). Their expression profiles were examined across the two transcriptomic clusters and found that all four hub genes were significantly upregulated in Cluster 1 compared to Cluster 2 (Figure 6C). Pairwise gene expression correlation analysis showed positive correlations among all four genes, indicating coordinated regulation, with CHCHD5 and ISOC2 exhibiting the strongest correlation (Figure 6D). Furthermore, a classification model we built using the expression of these four genes demonstrated robust discriminatory power between the two clusters, with the ROC curve showing an area under the curve (AUC) > 0.7 (Figure 6E). Additionally, transcription factor (TF) regulatory network analysis was performed to study the regulatory mechanisms governing the expression of the four identified hub genes. All known and predicted transcription factors that potentially regulate these hub genes were queried from hTFtarget, and the results were integrated and visualized as a transcriptional regulatory network (Figure 7). In the TF-hub gene regulatory network, BAD had the highest number of TFs, and ATF3, BRD2, BRD4, and CEBPA each interacted with all four hub genes, suggesting a shared regulatory mechanism.
Comparison of immune cell infiltration between AD subgroups
To investigate the immunological landscape associated with the transcriptomic subgroups, immune cell infiltration analysis using CIBERSORT was performed, which estimates the relative proportions of 22 immune cell types based on bulk transcriptome data (Figure 8). Among the immune subsets, regulatory T cells (Tregs) were found to be significantly more abundant in Cluster 1, suggesting an immunosuppressive microenvironment potentially associated with mitochondrial activity and signaling pathways upregulated in this group. In contrast, follicular helper T cells were significantly enriched in Cluster 2, indicating a potentially more active adaptive immune response in this subgroup.
DATA AVAILABILITY:
The transcriptomic data analyzed in this study are publicly available in the Gene Expression Omnibus (GEO) repository under accession numbers GSE121212, GSE157194, GSE193309, and GSE277961 (https://www.ncbi.nlm.nih.gov/geo/).

Figure 1: Consensus clustering of atopic dermatitis lesion samples based on transcriptomic profiles. (A) Heatmap and hierarchical clustering of the consensus matrix across atopic dermatitis (AD) samples. (B) Consensus cumulative distribution function (CDF) plot used to determine the optimal number of clusters (k = 2–10). (C) Delta area plot showing the relative change in the area under the CDF curve for each k. (D) Consensus cluster assignment for k = 2. Each column represents an individual sample, and colors indicate cluster membership (Cluster 1, red; Cluster 2, teal). Please click here to view a larger version of this figure.

Figure 2: Differential gene expression of atopic dermatitis molecular subtypes. (A) t-distributed stochastic neighbor embedding (t-SNE) plot of AD samples. Each point represents a sample projected into two dimensions and is coloured according to cluster assignment. (B) Volcano plot of differentially expressed genes (DEGs) between Cluster 1 and Cluster 2. Each point represents a gene, plotted by log2 fold change (x-axis) and −log10 adjusted p-value (y-axis). Red and blue points indicate significantly upregulated genes in Cluster 1 and Cluster 2, respectively, while grey points indicate non-significant genes. (C) Heatmap of the top 10 highly expressed genes in Cluster 1 and Cluster 2. Please click here to view a larger version of this figure.

Figure 3: Gene set enrichment analysis of atopic dermatitis molecular subtypes. (A) Two-sided bar plot showing GSEA results between Cluster 1 and Cluster 2, with the top significantly enriched pathways. Pathways enriched in Cluster 1 are shown on the right, and those enriched in Cluster 2 are shown on the left. (B–E) Representative enrichment plots for focal adhesion, MAPK signaling pathway, oxidative phosphorylation, and proteasome pathways. Please click here to view a larger version of this figure.

Figure 4: Weighted gene co-expression network analysis of atopic dermatitis samples. (A) Sample clustering dendrogram based on gene expression profiles. (B) Scale-free topology fit index and mean connectivity across soft-thresholding powers (1–30). (C) Clustering and heatmap of module eigengenes, with colors indicating pairwise correlations. (D) Hierarchical clustering dendrogram showing genes grouped into co-expressed modules. (E) Heatmap of the topological overlap matrix (TOM), representing co-expression similarity between gene pairs. (F) Heatmap of module–trait relationships, showing correlations between module eigengenes and clinical traits. Correlation coefficients are displayed within each cell, and color intensity indicates the strength and direction of the correlation (red, positive; blue, negative). Please click here to view a larger version of this figure.

Figure 5: Gene ontology and KEGG pathway enrichment of subtype-associated mitochondrial genes. (A) Venn diagram showing overlap among DEGs, subtype-associated module genes, and mitochondrial genes. (B) Bubble plot of the top 20 enriched KEGG pathways of intersecting genes. (C) Top 10 enriched Gene Ontology (GO) terms for Biological Process (BP), Cellular Component (CC), and Molecular Function (MF). All enrichment analyses were performed using an adjusted p-value < 0.05 (false discovery rate) as the significance threshold. Please click here to view a larger version of this figure.

Figure 6: Protein–protein interaction analysis and identification of hub genes. (A) Protein–protein interaction (PPI) network of 85 intersecting genes. Nodes represent proteins, and edges indicate predicted or experimentally validated interactions from the STRING database. (B) UpSet plot showing intersections among the top 30 ranked genes based on seven network centrality measures. (C) Boxplots showing expression levels of four hub genes in Cluster 1 and Cluster 2. (D) Pairwise correlation analysis among the four hub genes. (E) Receiver operating characteristic (ROC) curve showing classification performance, with sensitivity plotted against specificity. The area under the curve (AUC) indicates overall accuracy. Statistical significance in panel (C) was assessed using the Wilcoxon rank-sum test (*p < 0.05). Please click here to view a larger version of this figure.

Figure 7: Regulatory network of hub genes. Red circles represent hub genes and blue circles represent associated transcription factors (TFs). Edges indicate regulatory interactions. The size of each hub gene node reflects the number of interacting TFs. Please click here to view a larger version of this figure.

Figure 8: Comparison of immune cell infiltration between atopic dermatitis subgroups. Boxplots showing the estimated proportions of 22 immune cell types in each cluster (Cluster 1, red; Cluster 2, teal). Asterisks (*) indicate statistically significant differences between clusters, assessed using the Wilcoxon rank-sum test with false discovery rate correction for multiple comparisons. Please click here to view a larger version of this figure.