Method Article

A Verified Workflow for MiRNA-Seq Data Processing and Bioinformatics Analysis Using R

DOI:

10.3791/68760

October 24th, 2025

In This Article

Summary

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

Here, we present a protocol to analyze miRNA-Seq data using R. The workflow empowers researchers to explore miRNA regulated networks, and their significance in diverse biological and clinical questions. This work intends to serve as a practical guide for both novice and experienced researchers in the field of miRNA bioinformatics.

Abstract

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

MicroRNAs (miRNAs) are critical post-transcriptional regulators that influence a wide range of physiological and pathological processes. With the advancement of high-throughput sequencing technologies, miRNA-Seq has emerged as a powerful tool for profiling miRNA expression patterns. However, reliable interpretation of such data requires a standardized and reproducible analysis pipeline. Here, we present a verified workflow for miRNA-Seq data processing and bioinformatics analysis using R. This protocol encompasses all essential steps, including raw data preprocessing, quality control, alignment, quantification, normalization, differential expression analysis, target prediction, functional enrichment, and regulatory network construction. Designed for flexibility and transparency, the workflow integrates widely adopted R packages and supports species-specific annotation and modular customization. Additionally, users are guided to conduct downstream biological interpretation by leveraging curated databases and visualization tools such as Cytoscape. This protocol not only supports robust statistical analysis but also enables meaningful insights into miRNA-mRNA interactions and their roles in disease mechanisms. It is particularly well-suited for both novice and experienced researchers conducting miRNA biomarker discovery, disease modeling, or integrative multi-omics studies.

Introduction

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

MicroRNAs (miRNAs) are short non-coding RNA molecules that significantly influence gene expression by acting at the post-transcriptional stage1. They typically function by binding to complementary sequences in the 3' untranslated regions (UTRs) of target messenger RNAs (mRNAs), leading to mRNA degradation or translational repression1. Over the past two decades, miRNAs have been increasingly recognized as central regulators of various biological processes, including cell proliferation, differentiation, apoptosis, immune responses, and organ development2. Moreover, dysregulation of miRNA expression ....

Access restricted. Please log in or start a trial to view this content.

Protocol

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

NOTE: Materials with software links are listed in the Table of Materials.

1. Prepare RNA samples and sequence libraries

NOTE: Perform RNA extraction and sequencing outside this computational workflow. There is more than one way to analyze miRNA-sequencing data. This section provides the context of one practical.

  1. Extract total RNA: Extract total RNA from the biological samples using a kit optimized for small RNA isolation (e.g., miRNA isolation kit). Follow the manufacturer's protocol carefully. Ensure that you use RNase-free consumables and keep samples on ice to ....

Access restricted. Please log in or start a trial to view this content.

Results

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

We downloaded the microRNA expression matrix from GSE133530 and conducted differential expression analysis directly. We provided an example analytical R script for the dataset in Supplementary File 1. The dataset performed global miRNA profiling on 16 renal cysts of different sizes (minimal cysts: less than 1-5 mL, n = 10; medium cysts: between 10-25 mL, n = 4; large cysts: greater than 50 mL, n = 4) and minimally cystic tissue (MCT, n = 7, including 1 replicate) from fou.......

Access restricted. Please log in or start a trial to view this content.

Discussion

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

The analysis of miRNA-Seq data presents distinct challenges due to the small size and redundancy of reads, making rigorous quality control and preprocessing critical. One of the most important steps in the workflow is adapter trimming. Because miRNAs are approximately 22 nucleotides long, adapter sequences can easily dominate the reads if not properly removed. Failure to perform accurate trimming can result in misalignment and inflation of false-positive reads. Similarly, quality filtering should be implemented before al.......

Access restricted. Please log in or start a trial to view this content.

Disclosures

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

The authors declare no competing interests.

Acknowledgements

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,

We acknowledge the funding agencies and collaborators supporting this project. Shanghai Scientific and technological innovation action plan (22Y11905500, 24142201800), Institutional Project of PLA Navy No.905 Hospital (2024Q021), Youth Research Project of Changning District Health Committee (2024QN29) and Research Project of Naval Medical University (2024QN040).

....

Access restricted. Please log in or start a trial to view this content.

Materials

List of materials used in this article
NameCompanyCatalog NumberComments
Agilent-021827 Human miRNA MicroarrayAgilent/A commercial array for profiling microRNAs of human samples
BowtieJohns Hopkins Universityhttp://bowtie-bio.sourceforge.net/index.shtmlA software tool for aligning sequencing reads to long reference sequences
clusterProfiler (R package)Bioconductorhttps://bioconductor.org/packages/clusterProfiler/An R package designed for functional enrichment analysis and visualization of high-throughput biological data.
CutadaptOpen Sourcehttps://cutadapt.readthedocs.ioA command-line tool that removes adapter sequences, primers, poly-A tails and other unwanted fragments from high-throughput sequencing reads.
CytoscapeCytoscape Consortiumhttps://cytoscape.org/An open-source software platform designed for the visualization and analysis of complex biological networks.
DESeq2 (R package)Bioconductorhttps://bioconductor.org/packages/DESeq2/An R package designed for differential gene expression analysis of count data
EnhancedVolcano (R package)Bioconductorhttps://bioconductor.org/packages/EnhancedVolcano/ An R package designed to create publication-quality volcano plots.
FastQCBabraham Bioinformaticshttps://www.bioinformatics.babraham.ac.uk/projects/fastqc/An open-source quality-control tool for high-throughput sequencing data.
featureCountsSubread / SourceForgehttp://subread.sourceforge.net/A program used for counting reads mapped to genomic features
HTSeq-countPython Packagehttps://htseq.readthedocs.ioA command-line tool that counts how many aligned high-throughput sequencing reads overlap genomic features such as genes or exons. I
Illumina Human v2 MicroRNA expression beadchipIllumina /A commercial array for profiling microRNAs of human samples
multiMiR (R package)Bioconductorhttps://bioconductor.org/packages/multiMiR/An R package that provides the largest integrated collection of predicted and experimentally validated microRNA–target interactions together with their associations to diseases and drugs.
org.Hs.eg.db (R package)Bioconductorhttps://bioconductor.org/packages/org.Hs.eg.db/An annotation package designed for human (Homo sapiens) genomics research.
R SoftwareR Projecthttps://www.r-project.org/An open-source Project for Statistical Computing
RstudioPosit PBC/An integrated development environment helps to be more productive with R and Python
SAMtoolsOpen Sourcehttp://www.htslib.org/A software package for manipulating next-generation sequencing (NGS) data.

References

Loading...
$$\rightleftharpoonup{xx}$$ $$\longleftharp{xx}$$, $$\longrightharp{xx}$$,
  1. Hsu, P. W., et al. miRNAMap: genomic maps of microRNA genes and their target genes in mammalian genomes. Nucleic Acids Res. 34 (Database issue), D135-D139 (2006).
  2. Fragiadaki, M. Lessons from....

Access restricted. Please log in or start a trial to view this content.

Reprints and Permissions

Request permission to reuse the text or figures of this JoVE article

Request Permission

Tags

MiRNA SeqMiRNA ExpressionData ProcessingBioinformatics AnalysisDifferential ExpressionTarget PredictionFunctional EnrichmentRegulatory NetworkR PackagesCytoscape Visualization

Related Articles