-1::1
Simple Hit Counter
Skip to content

Products

Solutions

×
×
Sign In

EN

EN - EnglishCN - 简体中文DE - DeutschES - EspañolKR - 한국어IT - ItalianoFR - FrançaisPT - Português do BrasilPL - PolskiHE - עִבְרִיתRU - РусскийJA - 日本語TR - TürkçeAR - العربية
Sign In Start Free Trial

RESEARCH

JoVE Journal

Peer reviewed scientific video journal

Behavior
Biochemistry
Bioengineering
Biology
Cancer Research
Chemistry
Developmental Biology
View All
JoVE Encyclopedia of Experiments

Video encyclopedia of advanced research methods

Biological Techniques
Biology
Cancer Research
Immunology
Neuroscience
Microbiology
JoVE Visualize

Visualizing science through experiment videos

EDUCATION

JoVE Core

Video textbooks for undergraduate courses

Analytical Chemistry
Anatomy and Physiology
Biology
Calculus
Cell Biology
Chemistry
Civil Engineering
Electrical Engineering
View All
JoVE Science Education

Visual demonstrations of key scientific experiments

Advanced Biology
Basic Biology
Chemistry
View All
JoVE Lab Manual

Videos of experiments for undergraduate lab courses

Biology
Chemistry

BUSINESS

JoVE Business

Video textbooks for business education

Accounting
Finance
Macroeconomics
Marketing
Microeconomics

OTHERS

JoVE Quiz

Interactive video based quizzes for formative assessments

Authors

Teaching Faculty

Librarians

K12 Schools

Biopharma

Products

RESEARCH

JoVE Journal

Peer reviewed scientific video journal

JoVE Encyclopedia of Experiments

Video encyclopedia of advanced research methods

JoVE Visualize

Visualizing science through experiment videos

EDUCATION

JoVE Core

Video textbooks for undergraduates

JoVE Science Education

Visual demonstrations of key scientific experiments

JoVE Lab Manual

Videos of experiments for undergraduate lab courses

BUSINESS

JoVE Business

Video textbooks for business education

OTHERS

JoVE Quiz

Interactive video based quizzes for formative assessments

Solutions

Authors
Teaching Faculty
Librarians
<<<<<<< HEAD
K12 Schools
Biopharma
=======
K12 Schools
>>>>>>> dee1fd4 (fixed header link)

Language

English

EN

English

CN

简体中文

DE

Deutsch

ES

Español

KR

한국어

IT

Italiano

FR

Français

PT

Português do Brasil

PL

Polski

HE

עִבְרִית

RU

Русский

JA

日本語

TR

Türkçe

AR

العربية

    Menu

    JoVE Journal

    Behavior

    Biochemistry

    Bioengineering

    Biology

    Cancer Research

    Chemistry

    Developmental Biology

    Engineering

    Environment

    Genetics

    Immunology and Infection

    Medicine

    Neuroscience

    Menu

    JoVE Encyclopedia of Experiments

    Biological Techniques

    Biology

    Cancer Research

    Immunology

    Neuroscience

    Microbiology

    Menu

    JoVE Core

    Analytical Chemistry

    Anatomy and Physiology

    Biology

    Calculus

    Cell Biology

    Chemistry

    Civil Engineering

    Electrical Engineering

    Introduction to Psychology

    Mechanical Engineering

    Medical-Surgical Nursing

    View All

    Menu

    JoVE Science Education

    Advanced Biology

    Basic Biology

    Chemistry

    Clinical Skills

    Engineering

    Environmental Sciences

    Physics

    Psychology

    View All

    Menu

    JoVE Lab Manual

    Biology

    Chemistry

    Menu

    JoVE Business

    Accounting

    Finance

    Macroeconomics

    Marketing

    Microeconomics

Start Free Trial
Loading...
Home
JoVE Journal
Biology
An Integrated Approach for Microprotein Identification and Sequence Analysis
An Integrated Approach for Microprotein Identification and Sequence Analysis
JoVE Journal
Biology
A subscription to JoVE is required to view this content.  Sign in or start your free trial.
JoVE Journal Biology
An Integrated Approach for Microprotein Identification and Sequence Analysis

An Integrated Approach for Microprotein Identification and Sequence Analysis

Full Text
3,936 Views
09:37 min
July 12, 2022

DOI: 10.3791/63841-v

Omar Brito-Estrada*1, Keira R. Hassel*1, Catherine A. Makarewich1,2

1The Heart Institute, Division of Molecular Cardiovascular Biology,Cincinnati Children's Hospital Medical Center, 2Department of Pediatrics,University of Cincinnati College of Medicine

Overview

This study outlines a protocol for analyzing genomic regions to assess microprotein-coding potential using PhyloCSF on the UCSC Genome Browser. The methodology is designed to be user-friendly for researchers of various backgrounds, highlighting its accessibility without prior bioinformatics training.

Key Study Components

Research Area

  • Genomic analysis
  • Microprotein identification
  • Bioinformatics tools

Background

  • Microproteins are small proteins that may have important biological functions.
  • Identifying microprotein-coding regions can enhance our understanding of non-coding genomic areas.
  • The UCSC Genome Browser is a widely used tool for genomic research.

Methods Used

  • Protocol using PhyloCSF to analyze genomic sequences
  • UCSC Genome Browser as the primary platform
  • Tools for sequence alignment and conservation analysis

Main Results

  • Successful identification of conserved open reading frames in various species.
  • Demonstrated high sequence conservation in the microprotein mitoregulin.
  • Negative scoring in long noncoding RNA hot air indicates correct annotation.

Conclusions

  • The study provides a practical approach to identify microproteins, bridging bioinformatics and molecular biology.
  • These findings expand the available resources for genetic research and enhance the understanding of protein-coding potential.

Frequently Asked Questions

What is the purpose of using PhyloCSF?
PhyloCSF is used to identify conserved short open reading frames with microprotein-coding potential within genomic regions.
Can researchers without bioinformatics training follow this protocol?
Yes, the methods described are user-friendly and accessible for researchers of all backgrounds.
Which browser is used for the analysis?
The UCSC Genome Browser is utilized for navigating genomic sequences.
What does a high phyloCSF score indicate?
A high phyloCSF score indicates a region of high sequence conservation, suggesting potential protein-coding ability.
What are microproteins?
Microproteins are small proteins that might have essential roles in various biological processes.
How can sequence alignment contribute to this analysis?
Sequence alignment helps in comparing and identifying conservation among different species, enhancing the validation of findings.

The protocol described here provides detailed instructions on how to analyze genomic regions of interest for microprotein-coding potential using PhyloCSF on the user-friendly UCSC Genome Browser. Additionally, several tools and resources are recommended to further investigate sequence characteristics of identified microproteins to gain insight into their putative functions.

The protocol described here provides detailed instructions on analyzing genomic regions of interest for protein-coding potential using phyloCSF on the user-friendly UCSC genome browser. PhloCSF can effectively identify conserved short open reading frames with micro protein-coding potential in genomic regions that are currently annotated as non-coding. The methods described here are easily used and can be implemented by investigators of all backgrounds without prior training or expertise in bioiformatics or comparative genomics.

To begin, open an internet browser window and navigate to the University of California Santa Cruz or UCSC genome browser. Under the our tools heading, select the track hubs option. In the public hubs tab type phyloCSF into the search terms box.

Then, click on the search public hubs button. Connect to phyloCSF by clicking on the connect button for the hub name phyloCSF. After clicking on connect, wait to redirect to the UCSC genome browser gateway page.

To query a different species, select the species of interest under the browse or select species heading by clicking on the appropriate icon, or type the species into the text box that says, enter species common name or assembly ID.Using the dropdown menu choose the assembly to search under defined position heading then enter the position gene symbol or search terms in the position or search term box and click on go to navigate to a gene of interest on the genome browser. If the search resulted in multiple matches wait to be redirected to a page that requires the selection of a position of interest, then click on the appropriate gene of interest. After navigating to the UCSC genome browser select the blast-like alignment tool or blat under the our tools heading to query a specific DNA or protein sequence.

Alternatively, hover the cursor over the tools tab and select the blat option or follow the given link. Using the dropdown menu select the species, genome, and assembly of interest. Then, define the query type, paste the sequence of interest into the blat search genome text box and click submit.

Next, click on the browser link under the actions heading to navigate to the genomic region of interest. Visually scan the genomic area of interest for positively scoring phyloCSF regions. Use the zoom feature to magnify regions of interest to examine sequence characteristics and search for the start and stop codons.

To zoom in manually hold the shift key and click and hold the mouse button while dragging along the region of interest. Alternatively, use the zoom in and zoom out buttons at the top of the page to navigate. Zoom in until the nucleotide or base sequence is visible.

Visually scan the nuclear tide sequence near the beginning and end of the positively scoring phyloCSF regions to identify punitive start and stop codons. Hover the mouse cursor over the view heading at the top of the page and click on the in other genomes convert option, then define the genome of interest using the dropdown menu below the new genome heading. Select the genomic assembly of interest under the new assembly heading and click the submit button.

Once the browser returns a list of regions in the new assembly with similarity. Click on the chromosome position link to navigate to the homologous region of interest. Follow the navigational strategies described earlier to analyze the sequence.

To navigate to the gene description page, click on the gene of interest in the gen code track on the UCSC genome browser. Under the sequence and links to tools and databases heading click on the link in the table that reads other species faster. Click on the boxes associated with the species of interest to select them.

Then, click on submit. Copy and paste the sequences appearing at the bottom of the page in faster format into a word processing document. Next, open a second browser window and navigate to the clustal omega multiple sequence alignment tool on the European Bioinformatics Institute website.

Paste the sequence files on the clipboard into the box in step one that reads sequences in any supported format. Scroll to the bottom of the page and click on submit. Observe below the aligned results for symbols that indicate the degree of conservation of each amino acid.

To view the amino acid properties and color click on the show colors link directly above the sequences to color the amino acids according to their properties. Then copy and paste the sequence alignment into a word processing or slide show program to generate a figure or illustration file. To view other outputs from the clustal omega results page, click on the tabs guide tree or phylo genetic tree.

Finally, click on the results viewer's tab for options to view the sequence information using jalview or to access direct links to mview and simple phylogeny. A representative phyloCSF analysis of the mitoregulin gene indicates a region of high sequence conservation corresponding to a validated micro protein. The complete mitoregulin coding sequence is contained within exon one and scores very highly on the phyloCSF minus one track.

A conserved start codon can be observed at the beginning of the positively scoring region in the phyloCSF minus one track. The positively scoring region in the first exon of mitoregulin begins directly over a start codon and terminates at the stop codon. The multiple sequence alignment of the micro protein mitoregulin for eight different species is shown here.

The analysis of the long noncoding RNA hot air showed a negative score throughout the entire gene across all six tracks indicating a lack of sequence conservation and supporting that hot air is correctly annotated as a noncoding RNA. PhyloCSF analysis of the mouse one, eight, one, zero, zero, five, eight, I 24 rike gene showed that a conserved open reading frame spans three exons and the positive phyloCSF score jumps from the plus two track in exon one to the plus three track in exon two, and then back to the plus two track in exon three. PhyloCSF analysis of the meet one gene locus was also effectively used to identify multiple distinct coding open reading frames within a single RNA molecule.

It is important to note that while a positive phyloCSF score is highly suggestive of micro protein-coding capacity this line of evidence cannot stand alone and must be experimentally validated. Once a period of micro protein has been identified the amino acid sequence can be analyzed for conserved domains or sequence characteristics to provide insight into its function. PhyloCSF has been effectively used to identify novel micro proteins in genomic regions previously thought to be non-coding and will continue to be a helpful tool in future micro protein identification studies.

View the full transcript and gain access to thousands of scientific videos

Sign In Start Free Trial

Explore More Videos

Microprotein IdentificationSequence AnalysisPhyloCSFUCSC Genome BrowserProtein-coding PotentialGenomic RegionsBioinformaticsComparative GenomicsOpen Reading FramesConserved RegionsAlignment ToolSequence CharacteristicsGene NavigationBlast-like AlignmentAssembly Selection

Related Videos

Digital Microfluidics for Automated Proteomic Processing

10:55

Digital Microfluidics for Automated Proteomic Processing

Related Videos

12.9K Views

Identification of Protein Complexes in Escherichia coli using Sequential Peptide Affinity Purification in Combination with Tandem Mass Spectrometry

14:58

Identification of Protein Complexes in Escherichia coli using Sequential Peptide Affinity Purification in Combination with Tandem Mass Spectrometry

Related Videos

48.8K Views

Probing High-density Functional Protein Microarrays to Detect Protein-protein Interactions

08:07

Probing High-density Functional Protein Microarrays to Detect Protein-protein Interactions

Related Videos

8.4K Views

An Aquatic Microbial Metaproteomics Workflow: From Cells to Tryptic Peptides Suitable for Tandem Mass Spectrometry-based Analysis

08:09

An Aquatic Microbial Metaproteomics Workflow: From Cells to Tryptic Peptides Suitable for Tandem Mass Spectrometry-based Analysis

Related Videos

9.2K Views

Resolving Affinity Purified Protein Complexes by Blue Native PAGE and Protein Correlation Profiling

09:35

Resolving Affinity Purified Protein Complexes by Blue Native PAGE and Protein Correlation Profiling

Related Videos

14.4K Views

Combining Chemical Cross-linking and Mass Spectrometry of Intact Protein Complexes to Study the Architecture of Multi-subunit Protein Assemblies

10:01

Combining Chemical Cross-linking and Mass Spectrometry of Intact Protein Complexes to Study the Architecture of Multi-subunit Protein Assemblies

Related Videos

20.3K Views

Split-BioID — Proteomic Analysis of Context-specific Protein Complexes in Their Native Cellular Environment

09:02

Split-BioID — Proteomic Analysis of Context-specific Protein Complexes in Their Native Cellular Environment

Related Videos

20.4K Views

Detection of Protein Ubiquitination Sites by Peptide Enrichment and Mass Spectrometry

11:54

Detection of Protein Ubiquitination Sites by Peptide Enrichment and Mass Spectrometry

Related Videos

10.2K Views

Evaluation of Protein–Protein Interactions using an On-Membrane Digestion Technique

07:07

Evaluation of Protein–Protein Interactions using an On-Membrane Digestion Technique

Related Videos

7.1K Views

Identification of Antibacterial Immunity Proteins in Escherichia coli using MALDI-TOF-TOF-MS/MS and Top-Down Proteomic Analysis

09:26

Identification of Antibacterial Immunity Proteins in Escherichia coli using MALDI-TOF-TOF-MS/MS and Top-Down Proteomic Analysis

Related Videos

3.6K Views

JoVE logo
Contact Us Recommend to Library
Research
  • JoVE Journal
  • JoVE Encyclopedia of Experiments
  • JoVE Visualize
Business
  • JoVE Business
Education
  • JoVE Core
  • JoVE Science Education
  • JoVE Lab Manual
  • JoVE Quizzes
Solutions
  • Authors
  • Teaching Faculty
  • Librarians
  • K12 Schools
  • Biopharma
About JoVE
  • Overview
  • Leadership
Others
  • JoVE Newsletters
  • JoVE Help Center
  • Blogs
  • Site Maps
Contact Us Recommend to Library
JoVE logo

Copyright © 2026 MyJoVE Corporation. All rights reserved

Privacy Terms of Use Policies
WeChat QR code