Translate this page to:
In JoVE (1)
Other Publications (31)
- Chemistry & Biology
- Proceedings of the National Academy of Sciences of the United States of America
- Bioinformatics (Oxford, England)
- Genome Biology
- Current Opinion in Structural Biology
- Molecular Cell
- Genes & Development
- Bioinformatics (Oxford, England)
- Bioinformatics (Oxford, England)
- PLoS Genetics
- Nature
- Nature
- Genetics
- Proceedings of the National Academy of Sciences of the United States of America
- Current Biology : CB
- Bioinformatics (Oxford, England)
- Cell
- Molecular Cell
- PLoS Genetics
- Science (New York, N.Y.)
- The EMBO Journal
- Genome Biology
- Science (New York, N.Y.)
- Science (New York, N.Y.)
- Nature
- PLoS Biology
- Genes & Development
- Science (New York, N.Y.)
- Genome Research
- Nature
- Genome Biology
Automatic Translation
This translation into Hindi was automatically generated.
English Version | Other Languages
Articles by Richard E. Green in JoVE
प्राइमर एक्सटेंशन कैद: भारी अपमानित डीएनए स्रोत से लक्षित अनुक्रम रिट्रीवल
Adrian W. Briggs, Jeffrey M. Good, Richard E. Green, Johannes Krause, Tomislav Maricic, Udo Stenzel, Svante Pääbo
Max-Planck Institute for Evolutionary Anthropology, Leipzig
हम लक्षित प्राचीन डीएनए अनुक्रम पुनर्प्राप्ति, जो हम पांच Neandertal व्यक्तियों की पूरी mitochondrial जीनोम के पुनर्निर्माण के लिए इस्तेमाल किया की एक विधि प्रस्तुत करते हैं. वर्तमान दिन मनुष्यों के साथ इन दृश्यों की तुलना से पता चलता है कि Neandertals एक लंबी अवधि कम प्रभावी जनसंख्या के आकार का था.
Other articles by Richard E. Green on PubMed
Sulfotransferases and Sulfatases in Mycobacteria
Chemistry & Biology. Jul, 2002 | Pubmed ID: 12144918
Analysis of the genomes of M. tuberculosis, M. leprae, M. smegmatis, and M. avium has revealed a large family of genes homologous to known sulfotransferases. Despite reports detailing a suite of sulfated glycolipids in many mycobacteria, a corresponding family of sulfotransferase genes remains uncharacterized. Here, a sequence-based analysis of newly discovered mycobacterial sulfotransferase genes, named stf1-stf10, is presented. Interestingly, two sulfotransferase genes are highly similar to mammalian sulfotransferases, increasing the list of mycobacterial eukaryotic-like protein families. The sulfotransferases join an equally complex family of mycobacterial sulfatases: a large family of sulfatase genes has been found in all of the mycobacterial genomes examined. As sulfated molecules are common mediators of cell-cell interactions, the sulfotransferases and sulfatases may be involved in regulating host-pathogen interactions.
Evidence for the Widespread Coupling of Alternative Splicing and Nonsense-mediated MRNA Decay in Humans
Proceedings of the National Academy of Sciences of the United States of America. Jan, 2003 | Pubmed ID: 12502788
To better understand the role of alternative splicing, we conducted a large-scale analysis of reliable alternative isoforms of known human genes. Each isoform was classified according to its splice pattern and supporting evidence. We found that one-third of the alternative transcripts examined contain premature termination codons, and most persist even after rigorous filtering by multiple methods. These transcripts are apparent targets of nonsense-mediated mRNA decay (NMD), a surveillance mechanism that selectively degrades nonsense mRNAs. Several of these transcripts are from genes for which alternative splicing is known to regulate protein expression by generating alternate isoforms that are differentially subjected to NMD. We propose that regulated unproductive splicing and translation (RUST), through the coupling of alternative splicing and NMD, may be a pervasive, underappreciated means of regulating protein expression.
Widespread Predicted Nonsense-mediated MRNA Decay of Alternatively-spliced Transcripts of Human Normal and Disease Genes
Bioinformatics (Oxford, England). 2003 | Pubmed ID: 12855447
We have recently shown that a third of reliably-inferred alternative mRNA isoforms are candidates for nonsense-mediated mRNA decay (NMD), an mRNA surveillance system (Lewis et al., 2003; PROC: Natl Acad. Sci. USA, 100, 189-192). Rather than being translated to yield protein, these transcripts are expected to be degraded and may be subject to regulated unproductive splicing and translation (RUST). Our initial experimental studies are consistent with these predictions and suggest an unappreciated role for NMD in several human diseases.
An Unappreciated Role for RNA Surveillance
Genome Biology. 2004 | Pubmed ID: 14759258
Nonsense-mediated mRNA decay (NMD) is a eukaryotic mRNA surveillance mechanism that detects and degrades mRNAs with premature termination codons (PTC+ mRNAs). In mammals, a termination codon is recognized as premature if it lies more than about 50 nucleotides upstream of the final intron position. More than a third of reliably inferred alternative splicing events in humans have been shown to result in PTC+ mRNA isoforms. As the mechanistic details of NMD have only recently been elucidated, we hypothesized that many PTC+ isoforms may have been cloned, characterized and deposited in the public databases, even though they would be targeted for degradation in vivo.
The Evolving Roles of Alternative Splicing
Current Opinion in Structural Biology. Jun, 2004 | Pubmed ID: 15193306
Alternative splicing is now commonly thought to affect more than half of all human genes. Recent studies have investigated not only the scope but also the biological impact of alternative splicing on a large scale, revealing that its role in generating proteome diversity may be augmented by a role in regulation. For instance, protein function can be regulated by the removal of interaction or localization domains by alternative splicing. Alternative splicing can also regulate gene expression by splicing transcripts into unproductive mRNAs targeted for degradation. To fully understand the scope of alternative splicing, we must also determine how many of the predicted splice variants represent functional forms. Comparisons of alternative splicing between human and mouse genes show that predominant splice variants are usually conserved, but rare variants are less commonly shared. Evolutionary conservation of splicing patterns suggests functional importance and provides insight into the evolutionary history of alternative splicing.
Genome-wide Analysis Reveals an Unexpected Function for the Drosophila Splicing Factor U2AF50 in the Nuclear Export of Intronless MRNAs
Molecular Cell. Jun, 2004 | Pubmed ID: 15200955
The protein factor U2AF is an essential component required for pre-mRNA splicing. Mutations identified in the S. pombe large U2AF subunit were used to engineer transgenic Drosophila carrying temperature-sensitive U2AF large subunit alleles. Mutant recombinant U2AF heterodimers showed reduced polypyrimidine tract RNA binding at elevated temperatures. Genome-wide RNA profiling comparing wild-type and mutant strains identified more than 400 genes differentially expressed in the dU2AF50 mutant flies grown at the restrictive temperature. Surprisingly, almost 40% of the downregulated genes lack introns. Microarray analyses revealed that nuclear export of a large number of intronless mRNAs is impaired in Drosophila-cultured cells RNAi knocked down for dU2AF50. Immunopurification of nuclear RNP complexes showed that dU2AF50 associates with intronless mRNAs. These results reveal an unexpected role for the splicing factor dU2AF50 in the nuclear export of intronless mRNAs.
Global Analysis of Positive and Negative Pre-mRNA Splicing Regulators in Drosophila
Genes & Development. Jun, 2005 | Pubmed ID: 15937219
To gain insight into splicing regulation, we developed a microarray to assay all annotated alternative splicing events in Drosophila melanogaster and identified the alternative splice events controlled by four splicing regulators: dASF/SF2, B52/SRp55, hrp48, and PSI. The number of events controlled by each of these factors was found to be highly variable: dASF/SF2 strongly affects >300 splicing events, whereas PSI strongly affects only 43 events. Pairwise analysis also revealed many instances of splice site usage affected by multiple factors and provides the framework to understand the network controlling the alternatively spliced mRNA isoforms that compose the Drosophila transcriptome.
Statistical Evaluation of Pairwise Protein Sequence Comparison with the Bayesian Bootstrap
Bioinformatics (Oxford, England). Oct, 2005 | Pubmed ID: 16105900
Protein sequence comparison methods are routinely used to infer the intricate network of evolutionary relationships found within the rapidly growing library of protein sequences, and thereby to predict the structure and function of uncharacterized proteins. In the present study, we detail an improved statistical benchmark of pairwise protein sequence comparison algorithms. We use bootstrap resampling techniques to determine standard statistical errors and to estimate the confidence of our conclusions. We show that the underlying structure within benchmark databases causes Efron's standard, non-parametric bootstrap to be biased. Consequently, the standard bootstrap underpredicts average performance when used in the context of evaluating sequence comparison methods. We have developed, as an alternative, an unbiased statistical evaluation based on the Bayesian bootstrap, a resampling method operationally similar to the standard bootstrap.
Pairwise Alignment Incorporating Dipeptide Covariation
Bioinformatics (Oxford, England). Oct, 2005 | Pubmed ID: 16123116
MOTIVATION: Standard algorithms for pairwise protein sequence alignment make the simplifying assumption that amino acid substitutions at neighboring sites are uncorrelated. This assumption allows implementation of fast algorithms for pairwise sequence alignment, but it ignores information that could conceivably increase the power of remote homolog detection. We examine the validity of this assumption by constructing extended substitution matrices that encapsulate the observed correlations between neighboring sites, by developing an efficient and rigorous algorithm for pairwise protein sequence alignment that incorporates these local substitution correlations and by assessing the ability of this algorithm to detect remote homologies. RESULTS: Our analysis indicates that local correlations between substitutions are not strong on the average. Furthermore, incorporating local substitution correlations into pairwise alignment did not lead to a statistically significant improvement in remote homology detection. Therefore, the standard assumption that individual residues within protein sequences evolve independently of neighboring positions appears to be an efficient and appropriate approximation.
Functionality of Intergenic Transcription: an Evolutionary Comparison
PLoS Genetics. Oct, 2006 | Pubmed ID: 17040132
Although a large proportion of human transcription occurs outside the boundaries of known genes, the functional significance of this transcription remains unknown. We have compared the expression patterns of known genes as well as intergenic transcripts within the ENCODE regions between humans and chimpanzees in brain, heart, testis, and lymphoblastoid cell lines. We find that intergenic transcripts show patterns of tissue-specific conservation of their expression, which are comparable to exonic transcripts of known genes. This suggests that intergenic transcripts are subject to functional constraints that restrict their rate of evolutionary change as well as putative positive selection to an extent comparable to that of classical protein-coding genes. In brain and testis, we find that part of this intergenic transcription is caused by widespread use of alternative promoters. Further, we find that about half of the expression differences between humans and chimpanzees are due to intergenic transcripts.
Analysis of One Million Base Pairs of Neanderthal DNA
Nature. Nov, 2006 | Pubmed ID: 17108958
Neanderthals are the extinct hominid group most closely related to contemporary humans, so their genome offers a unique opportunity to identify genetic changes specific to anatomically fully modern humans. We have identified a 38,000-year-old Neanderthal fossil that is exceptionally free of contamination from modern human DNA. Direct high-throughput sequencing of a DNA extract from this fossil has thus far yielded over one million base pairs of hominoid nuclear DNA sequences. Comparison with the human and chimpanzee genomes reveals that modern human and Neanderthal DNA sequences diverged on average about 500,000 years ago. Existing technology and fossil resources are now sufficient to initiate a Neanderthal genome-sequencing effort.
Unproductive Splicing of SR Genes Associated with Highly Conserved and Ultraconserved DNA Elements
Nature. Apr, 2007 | Pubmed ID: 17361132
The human and mouse genomes share a number of long, perfectly conserved nucleotide sequences, termed ultraconserved elements. Whereas these regions can act as transcriptional enhancers when upstream of genes, those within genes are less well understood. In particular, the function of ultraconserved elements that overlap alternatively spliced exons of genes encoding RNA-binding proteins is unknown. Here we report that in every member of the human SR family of splicing regulators, highly or ultraconserved elements are alternatively spliced, either as alternative 'poison cassette exons' containing early in-frame stop codons, or as alternative introns in the 3' untranslated region. These alternative splicing events target the resulting messenger RNAs for degradation by means of an RNA surveillance pathway called nonsense-mediated mRNA decay. Mouse orthologues of the human SR proteins exhibit the same unproductive splicing patterns. Three SR proteins have been previously shown to direct splicing of their own transcripts, and one of these is known to autoregulate its expression by coupling alternative splicing with decay; our results suggest that unproductive splicing is important for regulation of the entire SR family. We find that unproductive splicing associated with conserved regions has arisen independently in different SR genes, suggesting that splicing factors may readily acquire this form of regulation.
The Joint Allele-frequency Spectrum in Closely Related Species
Genetics. Sep, 2007 | Pubmed ID: 17603120
We develop the theory for computing the joint frequency spectra of alleles in two closely related species. We allow for arbitrary population growth in both species after they had a common ancestor. We focus on the case in which a single chromosome is sequenced from one of the species. We use classical diffusion theory to show that, if the ancestral species was at equilibrium under mutation and drift and a chromosome from one of the descendant species carries the derived allele, the frequency spectrum in the other species is uniform, independently of the demographic history of both species. We also predict the expected densities of segregating and fixed sites when the chromosome from the other species carries the ancestral allele. We compare the predictions of our model with the site-frequency spectra of SNPs in the four HapMap populations of humans when the nucleotide present in the Neanderthal DNA sequence is ancestral or derived, using the chimp genome as the outgroup.
Patterns of Damage in Genomic DNA Sequences from a Neandertal
Proceedings of the National Academy of Sciences of the United States of America. Sep, 2007 | Pubmed ID: 17715061
High-throughput direct sequencing techniques have recently opened the possibility to sequence genomes from Pleistocene organisms. Here we analyze DNA sequences determined from a Neandertal, a mammoth, and a cave bear. We show that purines are overrepresented at positions adjacent to the breaks in the ancient DNA, suggesting that depurination has contributed to its degradation. We furthermore show that substitutions resulting from miscoding cytosine residues are vastly overrepresented in the DNA sequences and drastically clustered in the ends of the molecules, whereas other substitutions are rare. We present a model where the observed substitution patterns are used to estimate the rate of deamination of cytosine residues in single- and double-stranded portions of the DNA, the length of single-stranded ends, and the frequency of nicks. The results suggest that reliable genome sequences can be obtained from Pleistocene organisms.
The Derived FOXP2 Variant of Modern Humans Was Shared with Neandertals
Current Biology : CB. Nov, 2007 | Pubmed ID: 17949978
Although many animals communicate vocally, no extant creature rivals modern humans in language ability. Therefore, knowing when and under what evolutionary pressures our capacity for language evolved is of great interest. Here, we find that our closest extinct relatives, the Neandertals, share with modern humans two evolutionary changes in FOXP2, a gene that has been implicated in the development of speech and language. We furthermore find that in Neandertals, these changes lie on the common modern human haplotype, which previously was shown to have been subject to a selective sweep. These results suggest that these genetic changes and the selective sweep predate the common ancestor (which existed about 300,000-400,000 years ago) of modern human and Neandertal populations. This is in contrast to more recent age estimates of the selective sweep based on extant human diversity data. Thus, these results illustrate the usefulness of retrieving direct genetic information from ancient remains for understanding recent human evolution.
PatMaN: Rapid Alignment of Short Sequences to Large Databases
Bioinformatics (Oxford, England). Jul, 2008 | Pubmed ID: 18467344
We present a tool suited for searching for many short nucleotide sequences in large databases, allowing for a predefined number of gaps and mismatches. The commandline-driven program implements a non-deterministic automata matching algorithm on a keyword tree of the search strings. Both queries with and without ambiguity codes can be searched. Search time is short for perfect matches, and retrieval time rises exponentially with the number of edits allowed. AVAILABILITY: The C++ source code for PatMaN is distributed under the GNU General Public License and has been tested on the GNU/Linux operating system. It is available from http://bioinf.eva.mpg.de/patman. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
A Complete Neandertal Mitochondrial Genome Sequence Determined by High-throughput Sequencing
Cell. Aug, 2008 | Pubmed ID: 18692465
A complete mitochondrial (mt) genome sequence was reconstructed from a 38,000 year-old Neandertal individual with 8341 mtDNA sequences identified among 4.8 Gb of DNA generated from approximately 0.3 g of bone. Analysis of the assembled sequence unequivocally establishes that the Neandertal mtDNA falls outside the variation of extant human mtDNAs, and allows an estimate of the divergence date between the two mtDNA lineages of 660,000 +/- 140,000 years. Of the 13 proteins encoded in the mtDNA, subunit 2 of cytochrome c oxidase of the mitochondrial electron transport chain has experienced the largest number of amino acid substitutions in human ancestors since the separation from Neandertals. There is evidence that purifying selection in the Neandertal mtDNA was reduced compared with other primate lineages, suggesting that the effective population size of Neandertals was small.
Genome-wide Analysis of Alternative Pre-mRNA Splicing and RNA-binding Specificities of the Drosophila HnRNP A/B Family Members
Molecular Cell. Feb, 2009 | Pubmed ID: 19250905
Heterogeneous nuclear ribonucleoproteins (hnRNPs) have been traditionally seen as proteins packaging RNA nonspecifically into ribonucleoprotein particles (RNPs), but evidence suggests specific cellular functions on discrete target pre-mRNAs. Here we report genome-wide analysis of alternative splicing patterns regulated by four Drosophila homologs of the mammalian hnRNP A/B family (hrp36, hrp38, hrp40, and hrp48). Analysis of the global RNA-binding distributions of each protein revealed both small and extensively bound regions on target transcripts. A significant subset of RNAs were bound and regulated by more than one hnRNP protein, revealing a combinatorial network of interactions. In vitro RNA-binding site selection experiments (SELEX) identified distinct binding motif specificities for each protein, which were overrepresented in their respective regulated and bound transcripts. These results indicate that individual heterogeneous ribonucleoproteins have specific affinities for overlapping, but distinct, populations of target pre-mRNAs controlling their patterns of RNA processing.
Genome-wide Identification of Alternative Splice Forms Down-regulated by Nonsense-mediated MRNA Decay in Drosophila
PLoS Genetics. Jun, 2009 | Pubmed ID: 19543372
Alternative mRNA splicing adds a layer of regulation to the expression of thousands of genes in Drosophila melanogaster. Not all alternative splicing results in functional protein; it can also yield mRNA isoforms with premature stop codons that are degraded by the nonsense-mediated mRNA decay (NMD) pathway. This coupling of alternative splicing and NMD provides a mechanism for gene regulation that is highly conserved in mammals. NMD is also active in Drosophila, but its effect on the repertoire of alternative splice forms has been unknown, as has the mechanism by which it recognizes targets. Here, we have employed a custom splicing-sensitive microarray to globally measure the effect of alternative mRNA processing and NMD on Drosophila gene expression. We have developed a new algorithm to infer the expression change of each mRNA isoform of a gene based on the microarray measurements. This method is of general utility for interpreting splicing-sensitive microarrays and high-throughput sequence data. Using this approach, we have identified a high-confidence set of 45 genes where NMD has a differential effect on distinct alternative isoforms, including numerous RNA-binding and ribosomal proteins. Coupled alternative splicing and NMD decrease expression of these genes, which may in turn have a downstream effect on expression of other genes. The NMD-affected genes are enriched for roles in translation and mitosis, perhaps underlying the previously observed role of NMD factors in cell cycle progression. Our results have general implications for understanding the NMD mechanism in fly. Most notably, we found that the NMD-target mRNAs had significantly longer 3' untranslated regions (UTRs) than the nontarget isoforms of the same genes, supporting a role for 3' UTR length in the recognition of NMD targets in fly.
Targeted Retrieval and Analysis of Five Neandertal MtDNA Genomes
Science (New York, N.Y.). Jul, 2009 | Pubmed ID: 19608918
Analysis of Neandertal DNA holds great potential for investigating the population history of this group of hominins, but progress has been limited due to the rarity of samples and damaged state of the DNA. We present a method of targeted ancient DNA sequence retrieval that greatly reduces sample destruction and sequencing demands and use this method to reconstruct the complete mitochondrial DNA (mtDNA) genomes of five Neandertals from across their geographic range. We find that mtDNA genetic diversity in Neandertals that lived 38,000 to 70,000 years ago was approximately one-third of that in contemporary modern humans. Together with analyses of mtDNA protein evolution, these data suggest that the long-term effective population size of Neandertals was smaller than that of modern humans and extant great apes.
The Neandertal Genome and Ancient DNA Authenticity
The EMBO Journal. Sep, 2009 | Pubmed ID: 19661919
Recent advances in high-thoughput DNA sequencing have made genome-scale analyses of genomes of extinct organisms possible. With these new opportunities come new difficulties in assessing the authenticity of the DNA sequences retrieved. We discuss how these difficulties can be addressed, particularly with regard to analyses of the Neandertal genome. We argue that only direct assays of DNA sequence positions in which Neandertals differ from all contemporary humans can serve as a reliable means to estimate human contamination. Indirect measures, such as the extent of DNA fragmentation, nucleotide misincorporations, or comparison of derived allele frequencies in different fragment size classes, are unreliable. Fortunately, interim approaches based on mtDNA differences between Neandertals and current humans, detection of male contamination through Y chromosomal sequences, and repeated sequencing from the same fossil to detect autosomal contamination allow initial large-scale sequencing of Neandertal genomes. This will result in the discovery of fixed differences in the nuclear genome between Neandertals and current humans that can serve as future direct assays for contamination. For analyses of other fossil hominins, which may become possible in the future, we suggest a similar 'boot-strap' approach in which interim approaches are applied until sufficient data for more definitive direct assays are acquired.
Computational Challenges in the Analysis of Ancient DNA
Genome Biology. 2010 | Pubmed ID: 20441577
High-throughput sequencing technologies have opened up a new avenue for studying extinct organisms. Here we identify and quantify biases introduced by particular characteristics of ancient DNA samples. These analyses demonstrate the importance of closely related genomic sequence for correctly identifying and classifying bona fide endogenous DNA fragments. We show that more accurate genome divergence estimates from ancient DNA sequence can be attained using at least two outgroup genomes and appropriate filtering.
A Draft Sequence of the Neandertal Genome
Science (New York, N.Y.). May, 2010 | Pubmed ID: 20448178
Neandertals, the closest evolutionary relatives of present-day humans, lived in large parts of Europe and western Asia before disappearing 30,000 years ago. We present a draft sequence of the Neandertal genome composed of more than 4 billion nucleotides from three individuals. Comparisons of the Neandertal genome to the genomes of five present-day humans from different parts of the world identify a number of genomic regions that may have been affected by positive selection in ancestral modern humans, including genes involved in metabolism and in cognitive and skeletal development. We show that Neandertals shared more genetic variants with present-day humans in Eurasia than with present-day humans in sub-Saharan Africa, suggesting that gene flow from Neandertals into the ancestors of non-Africans occurred before the divergence of Eurasian groups from each other.
Targeted Investigation of the Neandertal Genome by Array-based Sequence Capture
Science (New York, N.Y.). May, 2010 | Pubmed ID: 20448179
It is now possible to perform whole-genome shotgun sequencing as well as capture of specific genomic regions for extinct organisms. However, targeted resequencing of large parts of nuclear genomes has yet to be demonstrated for ancient DNA. Here we show that hybridization capture on microarrays can successfully recover more than a megabase of target regions from Neandertal DNA even in the presence of approximately 99.8% microbial DNA. Using this approach, we have sequenced approximately 14,000 protein-coding positions inferred to have changed on the human lineage since the last common ancestor shared with chimpanzees. By generating the sequence of one Neandertal and 50 present-day humans at these positions, we have identified 88 amino acid substitutions that have become fixed in humans since our divergence from the Neandertals.
Genetic History of an Archaic Hominin Group from Denisova Cave in Siberia
Nature. Dec, 2010 | Pubmed ID: 21179161
Using DNA extracted from a finger bone found in Denisova Cave in southern Siberia, we have sequenced the genome of an archaic hominin to about 1.9-fold coverage. This individual is from a group that shares a common origin with Neanderthals. This population was not involved in the putative gene flow from Neanderthals into Eurasians; however, the data suggest that it contributed 4-6% of its genetic material to the genomes of present-day Melanesians. We designate this hominin population 'Denisovans' and suggest that it may have been widespread in Asia during the Late Pleistocene epoch. A tooth found in Denisova Cave carries a mitochondrial genome highly similar to that of the finger bone. This tooth shares no derived morphological features with Neanderthals or modern humans, further indicating that Denisovans have an evolutionary history distinct from Neanderthals and modern humans.
Genomic DNA Sequences from Mastodon and Woolly Mammoth Reveal Deep Speciation of Forest and Savanna Elephants
PLoS Biology. 2010 | Pubmed ID: 21203580
To elucidate the history of living and extinct elephantids, we generated 39,763 bp of aligned nuclear DNA sequence across 375 loci for African savanna elephant, African forest elephant, Asian elephant, the extinct American mastodon, and the woolly mammoth. Our data establish that the Asian elephant is the closest living relative of the extinct mammoth in the nuclear genome, extending previous findings from mitochondrial DNA analyses. We also find that savanna and forest elephants, which some have argued are the same species, are as or more divergent in the nuclear genome as mammoths and Asian elephants, which are considered to be distinct genera, thus resolving a long-standing debate about the appropriate taxonomic classification of the African elephants. Finally, we document a much larger effective population size in forest elephants compared with the other elephantid taxa, likely reflecting species differences in ancient geographic structure and range and differences in life history traits such as variance in male reproductive success.
Evolution of a Tissue-specific Splicing Network
Genes & Development. Mar, 2011 | Pubmed ID: 21406555
Alternative splicing of precursor mRNA (pre-mRNA) is a strategy employed by most eukaryotes to increase transcript and proteomic diversity. Many metazoan splicing factors are members of multigene families, with each member having different functions. How these highly related proteins evolve unique properties has been unclear. Here we characterize the evolution and function of a new Drosophila splicing factor, termed LS2 (Large Subunit 2), that arose from a gene duplication event of dU2AF(50), the large subunit of the highly conserved heterodimeric general splicing factor U2AF (U2-associated factor). The quickly evolving LS2 gene has diverged from the splicing-promoting, ubiquitously expressed dU2AF(50) such that it binds a markedly different RNA sequence, acts as a splicing repressor, and is preferentially expressed in testes. Target transcripts of LS2 are also enriched for performing testes-related functions. We therefore propose a path for the evolution of a new splicing factor in Drosophila that regulates specific pre-mRNAs and contributes to transcript diversity in a tissue-specific manner.
The Shaping of Modern Human Immune Systems by Multiregional Admixture with Archaic Humans
Science (New York, N.Y.). Oct, 2011 | Pubmed ID: 21868630
Whole genome comparisons identified introgression from archaic to modern humans. Our analysis of highly polymorphic human leukocyte antigen (HLA) class I, vital immune system components subject to strong balancing selection, shows how modern humans acquired the HLA-B*73 allele in west Asia through admixture with archaic humans called Denisovans, a likely sister group to the Neandertals. Virtual genotyping of Denisovan and Neandertal genomes identified archaic HLA haplotypes carrying functionally distinctive alleles that have introgressed into modern Eurasian and Oceanian populations. These alleles, of which several encode unique or strong ligands for natural killer cell receptors, now represent more than half the HLA alleles of modern Eurasians and also appear to have been later introduced into Africans. Thus, adaptive introgression of archaic alleles has significantly shaped modern human immune systems.
Assemblathon 1: a Competitive Assessment of De Novo Short Read Assembly Methods
Genome Research. Dec, 2011 | Pubmed ID: 21926179
Low-cost short read sequencing technology has revolutionized genomics, though it is only just becoming practical for the high-quality de novo assembly of a novel large genome. We describe the Assemblathon 1 competition, which aimed to comprehensively assess the state of the art in de novo assembly methods when applied to current sequencing technologies. In a collaborative effort, teams were asked to assemble a simulated Illumina HiSeq data set of an unknown, simulated diploid genome. A total of 41 assemblies from 17 different groups were received. Novel haplotype aware assessments of coverage, contiguity, structure, base calling, and copy number were made. We establish that within this benchmark: (1) It is possible to assemble the genome to a high level of coverage and accuracy, and that (2) large differences exist between the assemblies, suggesting room for further improvements in current methods. The simulated benchmark, including the correct answer, the assemblies, and the code that was used to evaluate the assemblies is now public and freely available from http://www.assemblathon.org/.
The Developmental Transcriptome of Drosophila Melanogaster
Nature. Mar, 2011 | Pubmed ID: 21179090
Drosophila melanogaster is one of the most well studied genetic model organisms; nonetheless, its genome still contains unannotated coding and non-coding genes, transcripts, exons and RNA editing sites. Full discovery and annotation are pre-requisites for understanding how the regulation of transcription, splicing and RNA editing directs the development of this complex organism. Here we used RNA-Seq, tiling microarrays and cDNA sequencing to explore the transcriptome in 30 distinct developmental stages. We identified 111,195 new elements, including thousands of genes, coding and non-coding transcripts, exons, splicing and editing events, and inferred protein isoforms that previously eluded discovery using established experimental, prediction and conservation-based approaches. These data substantially expand the number of known transcribed elements in the Drosophila genome and provide a high-resolution view of transcriptome dynamics throughout development.
Sequencing Three Crocodilian Genomes to Illuminate the Evolution of Archosaurs and Amniotes
Genome Biology. Jan, 2012 | Pubmed ID: 22293439
ABSTRACT: The International Crocodilian Genomes Working Group (ICGWG) will sequence and assemble the American alligator (Alligator mississippiensis), saltwater crocodile (Crocodylus porosus) and Indian gharial (Gavialis gangeticus) genomes. The status of these projects and our planned analyses are described.
