本研究利用TCGA-LUAD和GEO GSE115002转录组数据,鉴定了肺腺癌的诊断和预后生物标志物。 B3GNT3、 FERMT1和 SPP1 上调,将肿瘤与正常组织区分开来。这些基因与上皮-间充质转变和免疫抑制有关。结合基因表达与TNM阶段的命名图显示出可靠的预测价值。
| Name | Company | Catalog Number | Comments |
|---|---|---|---|
| Publicly Available Datasets | TCGA-LUAD Dataset | The Cancer Genome Atlas (TCGA) Portal (https://portal.gdc.cancer.gov/); 535 LUAD tumor samples, 59 adjacent normal lung tissue samples (RNA-sequencing count/FPKM values + clinical data: survival, TNM staging) | Transcriptomic and clinical data for differential expression, survival, and nomogram analysis; primary study cohort |
| GSE115002 Dataset | Gene Expression Omnibus (GEO) (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE115002); Agilent microarray, 52 LUAD tumor tissues, 52 matched adjacent normal lung tissues (treatment-naïve primary tumors) | Independent validation cohort for differential expression, diagnostic performance, and immune infiltration analysis | |
| Bioinformatics Software & Programming Environment | R Programming Language | Version 4.1 | Core platform for all transcriptomic, statistical, and graphical analyses |
| R Packages (Differential Expression) | DESeq2, limma | DESeq2: TCGA RNA-seq raw count differential expression analysis; limma: GSE115002 microarray normalization and differential expression analysis (Benjamini–Hochberg FDR correction) | |
| R Packages (Diagnostic Analysis) | pROC | Construction of ROC curves, calculation of AUC (95% CI), optimal cutoff determination (Youden’s index) for diagnostic performance assessment | |
| R Packages (Survival Analysis) | survival, survminer | Kaplan–Meier survival curve generation, log-rank test, univariate/multivariate Cox proportional hazards regression (HR + 95% CI); patient stratification by median gene expression | |
| R Packages (Functional Enrichment) | clusterProfiler, fgsea | clusterProfiler: GO (BP/CC/MF) and KEGG pathway enrichment analysis (adjusted P < 0.05); fgsea: GSEA for MSigDB Hallmark/KEGG gene sets (FDR < 0.25) | |
| R Packages (Nomogram Construction & Validation) | rms | Development of prognostic nomogram (integration of gene expression + TNM stage); Harrell’s C-index calculation, bootstrap resampling (1000 repetitions) for bias correction, calibration plot generation | |
| R Packages (Statistical & Visualization) | ggplot2, ComplexHeatmap, corrplot | Generation of volcano plots, bubble plots (enrichment), heatmaps (immune infiltration correlation), scatter plots (gene co-expression); Pearson/Spearman correlation analysis | |
| Bioinformatics Databases & Tools (Network/Immune Analysis) | STRING Database | Confidence score > 0.7 | Construction of protein–protein interaction (PPI) networks for B3GNT3/FERMT1/SPP1 and first-degree interactors |
| Cytoscape | - | Visualization of PPI and gene co-expression networks (edge weighting by correlation strength, hub gene identification) | |
| Immune Deconvolution Algorithm | CIBERSORT | Estimation of immune cell infiltration abundance (M2 macrophages, CD8+ T cells, neutrophils, NK cells, etc.) in LUAD samples; correlation with candidate gene expression | |
| Other Tools | Microsoft Office/LaTeX | - | Manuscript preparation, figure assembly, and table formatting; statistical result compilation |
Request permission to reuse the text or figures of this JoVE article
Request Permission