RESUMO
Analyses of ancient DNA typically involve sequencing the surviving short oligonucleotides and aligning to genome assemblies from related, modern species. Here, we report that skin from a female woolly mammoth (Mammuthus primigenius) that died 52,000 years ago retained its ancient genome architecture. We use PaleoHi-C to map chromatin contacts and assemble its genome, yielding 28 chromosome-length scaffolds. Chromosome territories, compartments, loops, Barr bodies, and inactive X chromosome (Xi) superdomains persist. The active and inactive genome compartments in mammoth skin more closely resemble Asian elephant skin than other elephant tissues. Our analyses uncover new biology. Differences in compartmentalization reveal genes whose transcription was potentially altered in mammoths vs. elephants. Mammoth Xi has a tetradic architecture, not bipartite like human and mouse. We hypothesize that, shortly after this mammoth's death, the sample spontaneously freeze-dried in the Siberian cold, leading to a glass transition that preserved subfossils of ancient chromosomes at nanometer scale.
Assuntos
Genoma , Mamutes , Pele , Animais , Mamutes/genética , Genoma/genética , Feminino , Elefantes/genética , Cromatina/genética , Fósseis , DNA Antigo/análise , Camundongos , Humanos , Cromossomo X/genéticaRESUMO
A number of species have recently recovered from near-extinction. Although these species have avoided the immediate extinction threat, their long-term viability remains precarious due to the potential genetic consequences of population declines, which are poorly understood on a timescale beyond a few generations. Woolly mammoths (Mammuthus primigenius) became isolated on Wrangel Island around 10,000 years ago and persisted for over 200 generations before becoming extinct around 4,000 years ago. To study the evolutionary processes leading up to the mammoths' extinction, we analyzed 21 Siberian woolly mammoth genomes. Our results show that the population recovered quickly from a severe bottleneck and remained demographically stable during the ensuing six millennia. We find that mildly deleterious mutations gradually accumulated, whereas highly deleterious mutations were purged, suggesting ongoing inbreeding depression that lasted for hundreds of generations. The time-lag between demographic and genetic recovery has wide-ranging implications for conservation management of recently bottlenecked populations.
Assuntos
Extinção Biológica , Genoma , Mamutes , Mutação , Animais , Mamutes/genética , Genoma/genética , Sibéria , Filogenia , Evolução Molecular , Fatores de TempoRESUMO
We sequenced and assembled using multiple long-read sequencing technologies the genomes of chimpanzee, bonobo, gorilla, orangutan, gibbon, macaque, owl monkey, and marmoset. We identified 1,338,997 lineage-specific fixed structural variants (SVs) disrupting 1,561 protein-coding genes and 136,932 regulatory elements, including the most complete set of human-specific fixed differences. We estimate that 819.47 Mbp or â¼27% of the genome has been affected by SVs across primate evolution. We identify 1,607 structurally divergent regions wherein recurrent structural variation contributes to creating SV hotspots where genes are recurrently lost (e.g., CARD, C4, and OLAH gene families) and additional lineage-specific genes are generated (e.g., CKAP2, VPS36, ACBD7, and NEK5 paralogs), becoming targets of rapid chromosomal diversification and positive selection (e.g., RGPD gene family). High-fidelity long-read sequencing has made these dynamic regions of the genome accessible for sequence-level analyses within and between primate species.
Assuntos
Genoma , Primatas , Animais , Humanos , Sequência de Bases , Primatas/classificação , Primatas/genética , Evolução Biológica , Análise de Sequência de DNA , Variação Estrutural do GenomaRESUMO
DNA-editing enzymes perform chemical reactions on DNA nucleobases. These reactions can change the genetic identity of the modified base or modulate gene expression. Interest in DNA-editing enzymes has burgeoned in recent years due to the advent of clustered regularly interspaced short palindromic repeat-associated (CRISPR-Cas) systems, which can be used to direct their DNA-editing activity to specific genomic loci of interest. In this review, we showcase DNA-editing enzymes that have been repurposed or redesigned and developed into programmable base editors. These include deaminases, glycosylases, methyltransferases, and demethylases. We highlight the astounding degree to which these enzymes have been redesigned, evolved, and refined and present these collective engineering efforts as a paragon for future efforts to repurpose and engineer other families of enzymes. Collectively, base editors derived from these DNA-editing enzymes facilitate programmable point mutation introduction and gene expression modulation by targeted chemical modification of nucleobases.
Assuntos
Sistemas CRISPR-Cas , Edição de Genes , Proteína 9 Associada à CRISPR/genética , Genoma , DNA/genética , DNA/metabolismoRESUMO
Nonhuman primates provide unique evolutionary and comparative insight into the human phenotype. Genome assemblies are now available for nearly half of the species in the primate order, expanding our understanding of genetic variation within and between species and making important contributions to evolutionary biology, evolutionary anthropology, and human genetics.
Assuntos
Variação Genética , Genoma , Primatas , Animais , Humanos , Evolução Biológica , Genoma/genética , Genômica , Primatas/genéticaRESUMO
A ubiquitous feature of eukaryotic transcriptional regulation is cooperative self-assembly between transcription factors (TFs) and DNA cis-regulatory motifs. It is thought that this strategy enables specific regulatory connections to be formed in gene networks between otherwise weakly interacting, low-specificity molecular components. Here, using synthetic gene circuits constructed in yeast, we find that high regulatory specificity can emerge from cooperative, multivalent interactions among artificial zinc-finger-based TFs. We show that circuits "wired" using the strategy of cooperative TF assembly are effectively insulated from aberrant misregulation of the host cell genome. As we demonstrate in experiments and mathematical models, this mechanism is sufficient to rescue circuit-driven fitness defects, resulting in genetic and functional stability of circuits in long-term continuous culture. Our naturally inspired approach offers a simple, generalizable means for building high-fidelity, evolutionarily robust gene circuits that can be scaled to a wide range of host organisms and applications.
Assuntos
Redes Reguladoras de Genes , Fatores de Transcrição , Fatores de Transcrição/genética , Saccharomyces cerevisiae/genética , GenomaRESUMO
Today's genomics workflows typically require alignment to a reference sequence, which limits discovery. We introduce a unifying paradigm, SPLASH (Statistically Primary aLignment Agnostic Sequence Homing), which directly analyzes raw sequencing data, using a statistical test to detect a signature of regulation: sample-specific sequence variation. SPLASH detects many types of variation and can be efficiently run at scale. We show that SPLASH identifies complex mutation patterns in SARS-CoV-2, discovers regulated RNA isoforms at the single-cell level, detects the vast sequence diversity of adaptive immune receptors, and uncovers biology in non-model organisms undocumented in their reference genomes: geographic and seasonal variation and diatom association in eelgrass, an oceanic plant impacted by climate change, and tissue-specific transcripts in octopus. SPLASH is a unifying approach to genomic analysis that enables expansive discovery without metadata or references.
Assuntos
Algoritmos , Genômica , Genoma , Análise de Sequência de RNA , Humanos , Antígenos HLA/genética , Análise de Célula ÚnicaRESUMO
Systematic evaluation of the impact of genetic variants is critical for the study and treatment of human physiology and disease. While specific mutations can be introduced by genome engineering, we still lack scalable approaches that are applicable to the important setting of primary cells, such as blood and immune cells. Here, we describe the development of massively parallel base-editing screens in human hematopoietic stem and progenitor cells. Such approaches enable functional screens for variant effects across any hematopoietic differentiation state. Moreover, they allow for rich phenotyping through single-cell RNA sequencing readouts and separately for characterization of editing outcomes through pooled single-cell genotyping. We efficiently design improved leukemia immunotherapy approaches, comprehensively identify non-coding variants modulating fetal hemoglobin expression, define mechanisms regulating hematopoietic differentiation, and probe the pathogenicity of uncharacterized disease-associated variants. These strategies will advance effective and high-throughput variant-to-function mapping in human hematopoiesis to identify the causes of diverse diseases.
Assuntos
Edição de Genes , Células-Tronco Hematopoéticas , Humanos , Diferenciação Celular , Sistemas CRISPR-Cas , Genoma , Hematopoese , Células-Tronco Hematopoéticas/metabolismo , Engenharia Genética , Análise de Célula ÚnicaRESUMO
Snakes are a remarkable squamate lineage with unique morphological adaptations, especially those related to the evolution of vertebrate skeletons, organs, and sensory systems. To clarify the genetic underpinnings of snake phenotypes, we assembled and analyzed 14 de novo genomes from 12 snake families. We also investigated the genetic basis of the morphological characteristics of snakes using functional experiments. We identified genes, regulatory elements, and structural variations that have potentially contributed to the evolution of limb loss, an elongated body plan, asymmetrical lungs, sensory systems, and digestive adaptations in snakes. We identified some of the genes and regulatory elements that might have shaped the evolution of vision, the skeletal system and diet in blind snakes, and thermoreception in infrared-sensitive snakes. Our study provides insights into the evolution and development of snakes and vertebrates.
Assuntos
Genoma , Serpentes , Animais , Serpentes/genética , Adaptação Fisiológica , Aclimatação , Evolução Molecular , Filogenia , Evolução BiológicaRESUMO
Functional genomic strategies have become fundamental for annotating gene function and regulatory networks. Here, we combined functional genomics with proteomics by quantifying protein abundances in a genome-scale knockout library in Saccharomyces cerevisiae, using data-independent acquisition mass spectrometry. We find that global protein expression is driven by a complex interplay of (1) general biological properties, including translation rate, protein turnover, the formation of protein complexes, growth rate, and genome architecture, followed by (2) functional properties, such as the connectivity of a protein in genetic, metabolic, and physical interaction networks. Moreover, we show that functional proteomics complements current gene annotation strategies through the assessment of proteome profile similarity, protein covariation, and reverse proteome profiling. Thus, our study reveals principles that govern protein expression and provides a genome-spanning resource for functional annotation.
Assuntos
Proteoma , Proteômica , Proteômica/métodos , Proteoma/metabolismo , Genômica/métodos , Genoma , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismoRESUMO
Antarctic krill (Euphausia superba) is Earth's most abundant wild animal, and its enormous biomass is vital to the Southern Ocean ecosystem. Here, we report a 48.01-Gb chromosome-level Antarctic krill genome, whose large genome size appears to have resulted from inter-genic transposable element expansions. Our assembly reveals the molecular architecture of the Antarctic krill circadian clock and uncovers expanded gene families associated with molting and energy metabolism, providing insights into adaptations to the cold and highly seasonal Antarctic environment. Population-level genome re-sequencing from four geographical sites around the Antarctic continent reveals no clear population structure but highlights natural selection associated with environmental variables. An apparent drastic reduction in krill population size 10 mya and a subsequent rebound 100 thousand years ago coincides with climate change events. Our findings uncover the genomic basis of Antarctic krill adaptations to the Southern Ocean and provide valuable resources for future Antarctic research.
Assuntos
Euphausiacea , Genoma , Animais , Relógios Circadianos/genética , Ecossistema , Euphausiacea/genética , Euphausiacea/fisiologia , Genômica , Análise de Sequência de DNA , Elementos de DNA Transponíveis , Evolução Biológica , Adaptação FisiológicaRESUMO
A generic level of chromatin organization generated by the interplay between cohesin and CTCF suffices to limit promiscuous interactions between regulatory elements, but a lineage-specific chromatin assembly that supersedes these constraints is required to configure the genome to guide gene expression changes that drive faithful lineage progression. Loss-of-function approaches in B cell precursors show that IKAROS assembles interactions across megabase distances in preparation for lymphoid development. Interactions emanating from IKAROS-bound enhancers override CTCF-imposed boundaries to assemble lineage-specific regulatory units built on a backbone of smaller invariant topological domains. Gain of function in epithelial cells confirms IKAROS' ability to reconfigure chromatin architecture at multiple scales. Although the compaction of the Igκ locus required for genome editing represents a function of IKAROS unique to lymphocytes, the more general function to preconfigure the genome to support lineage-specific gene expression and suppress activation of extra-lineage genes provides a paradigm for lineage restriction.
Assuntos
Cromatina , Genoma , Linfócitos B/metabolismo , Fator de Ligação a CCCTC/metabolismo , Cromatina/metabolismo , Montagem e Desmontagem da Cromatina , Humanos , Animais , CamundongosRESUMO
The fidelity of genetic information is essential for cellular function and viability. DNA double-strand breaks (DSBs) pose a significant threat to genome integrity, necessitating efficient repair mechanisms. While the predominant repair strategies are usually accurate, paradoxically, error-prone pathways also exist. This review explores recent advances and our understanding of microhomology-mediated end joining (MMEJ), an intrinsically mutagenic DSB repair pathway conserved across organisms. Central to MMEJ is the activity of DNA polymerase theta (Polθ), a specialized polymerase that fuels MMEJ mutagenicity. We examine the molecular intricacies underlying MMEJ activity and discuss its function during mitosis, where the activity of Polθ emerges as a last-ditch effort to resolve persistent DSBs, especially when homologous recombination is compromised. We explore the promising therapeutic applications of targeting Polθ in cancer treatment and genome editing. Lastly, we discuss the evolutionary consequences of MMEJ, highlighting its delicate balance between protecting genome integrity and driving genomic diversity.
Assuntos
Quebras de DNA de Cadeia Dupla , Reparo do DNA por Junção de Extremidades , Humanos , Animais , Evolução Molecular , DNA Polimerase Dirigida por DNA/metabolismo , DNA Polimerase Dirigida por DNA/genética , Genoma/genética , DNA Polimerase tetaRESUMO
CRISPR-Cas systems are host-encoded pathways that protect microbes from viral infection using an adaptive RNA-guided mechanism. Using genome-resolved metagenomics, we find that CRISPR systems are also encoded in diverse bacteriophages, where they occur as divergent and hypercompact anti-viral systems. Bacteriophage-encoded CRISPR systems belong to all six known CRISPR-Cas types, though some lack crucial components, suggesting alternate functional roles or host complementation. We describe multiple new Cas9-like proteins and 44 families related to type V CRISPR-Cas systems, including the Casλ RNA-guided nuclease family. Among the most divergent of the new enzymes identified, Casλ recognizes double-stranded DNA using a uniquely structured CRISPR RNA (crRNA). The Casλ-RNA-DNA structure determined by cryoelectron microscopy reveals a compact bilobed architecture capable of inducing genome editing in mammalian, Arabidopsis, and hexaploid wheat cells. These findings reveal a new source of CRISPR-Cas enzymes in phages and highlight their value as genome editors in plant and human cells.
Assuntos
Bacteriófagos , Sistemas CRISPR-Cas , Animais , Humanos , Microscopia Crioeletrônica , Edição de Genes , Genoma , Bacteriófagos/genética , DNA , RNA , Mamíferos/genéticaRESUMO
Incomplete lineage sorting (ILS) makes ancestral genetic polymorphisms persist during rapid speciation events, inducing incongruences between gene trees and species trees. ILS has complicated phylogenetic inference in many lineages, including hominids. However, we lack empirical evidence that ILS leads to incongruent phenotypic variation. Here, we performed phylogenomic analyses to show that the South American monito del monte is the sister lineage of all Australian marsupials, although over 31% of its genome is closer to the Diprotodontia than to other Australian groups due to ILS during ancient radiation. Pervasive conflicting phylogenetic signals across the whole genome are consistent with some of the morphological variation among extant marsupials. We detected hundreds of genes that experienced stochastic fixation during ILS, encoding the same amino acids in non-sister species. Using functional experiments, we confirm how ILS may have directly contributed to hemiplasy in morphological traits that were established during rapid marsupial speciation ca. 60 mya.
Assuntos
Marsupiais , Animais , Austrália , Evolução Molecular , Especiação Genética , Genoma , Marsupiais/genética , Fenótipo , FilogeniaRESUMO
The precise genetic origins of the first Neolithic farming populations in Europe and Southwest Asia, as well as the processes and the timing of their differentiation, remain largely unknown. Demogenomic modeling of high-quality ancient genomes reveals that the early farmers of Anatolia and Europe emerged from a multiphase mixing of a Southwest Asian population with a strongly bottlenecked western hunter-gatherer population after the last glacial maximum. Moreover, the ancestors of the first farmers of Europe and Anatolia went through a period of extreme genetic drift during their westward range expansion, contributing highly to their genetic distinctiveness. This modeling elucidates the demographic processes at the root of the Neolithic transition and leads to a spatial interpretation of the population history of Southwest Asia and Europe during the late Pleistocene and early Holocene.
Assuntos
Fazendeiros , Genoma , Agricultura , DNA Mitocondrial/genética , Europa (Continente) , Deriva Genética , Genômica , História Antiga , Migração Humana , HumanosRESUMO
The Avars settled the Carpathian Basin in 567/68 CE, establishing an empire lasting over 200 years. Who they were and where they came from is highly debated. Contemporaries have disagreed about whether they were, as they claimed, the direct successors of the Mongolian Steppe Rouran empire that was destroyed by the Turks in â¼550 CE. Here, we analyze new genome-wide data from 66 pre-Avar and Avar-period Carpathian Basin individuals, including the 8 richest Avar-period burials and further elite sites from Avar's empire core region. Our results provide support for a rapid long-distance trans-Eurasian migration of Avar-period elites. These individuals carried Northeast Asian ancestry matching the profile of preceding Mongolian Steppe populations, particularly a genome available from the Rouran period. Some of the later elite individuals carried an additional non-local ancestry component broadly matching the steppe, which could point to a later migration or reflect greater genetic diversity within the initial migrant population.
Assuntos
Povo Asiático , DNA Antigo , Genética Populacional , Povo Asiático/genética , Genoma , História Antiga , Migração Humana/história , Humanos , EnxofreRESUMO
Here, we report inducible mosaic animal for perturbation (iMAP), a transgenic platform enabling in situ CRISPR targeting of at least 100 genes in parallel throughout the mouse body. iMAP combines Cre-loxP and CRISPR-Cas9 technologies and utilizes a germline-transmitted transgene carrying a large array of individually floxed, tandemly linked gRNA-coding units. Cre-mediated recombination triggers expression of all the gRNAs in the array but only one of them per cell, converting the mice to mosaic organisms suitable for phenotypic characterization and also for high-throughput derivation of conventional single-gene perturbation lines via breeding. Using gRNA representation as a readout, we mapped a miniature Perturb-Atlas cataloging the perturbations of 90 genes across 39 tissues, which yields rich insights into context-dependent gene functions and provides a glimpse of the potential of iMAP in genome decoding.
Assuntos
Sistemas CRISPR-Cas , RNA Guia de Cinetoplastídeos , Animais , Sistemas CRISPR-Cas/genética , Edição de Genes , Genoma , Camundongos , RNA Guia de Cinetoplastídeos/genética , RNA Guia de Cinetoplastídeos/metabolismo , TransgenesRESUMO
The metabolic activities of microbial communities play a defining role in the evolution and persistence of life on Earth, driving redox reactions that give rise to global biogeochemical cycles. Community metabolism emerges from a hierarchy of processes, including gene expression, ecological interactions, and environmental factors. In wild communities, gene content is correlated with environmental context, but predicting metabolite dynamics from genomes remains elusive. Here, we show, for the process of denitrification, that metabolite dynamics of a community are predictable from the genes each member of the community possesses. A simple linear regression reveals a sparse and generalizable mapping from gene content to metabolite dynamics for genomically diverse bacteria. A consumer-resource model correctly predicts community metabolite dynamics from single-strain phenotypes. Our results demonstrate that the conserved impacts of metabolic genes can predict community metabolite dynamics, enabling the prediction of metabolite dynamics from metagenomes, designing denitrifying communities, and discovering how genome evolution impacts metabolism.
Assuntos
Genômica , Metabolômica , Microbiota/genética , Biomassa , Desnitrificação , Genoma , Modelos Biológicos , Nitratos/metabolismo , Nitritos/metabolismo , Fenótipo , Análise de Regressão , Reprodutibilidade dos TestesRESUMO
Regulatory landscapes drive complex developmental gene expression, but it remains unclear how their integrity is maintained when incorporating novel genes and functions during evolution. Here, we investigated how a placental mammal-specific gene, Zfp42, emerged in an ancient vertebrate topologically associated domain (TAD) without adopting or disrupting the conserved expression of its gene, Fat1. In ESCs, physical TAD partitioning separates Zfp42 and Fat1 with distinct local enhancers that drive their independent expression. This separation is driven by chromatin activity and not CTCF/cohesin. In contrast, in embryonic limbs, inactive Zfp42 shares Fat1's intact TAD without responding to active Fat1 enhancers. However, neither Fat1 enhancer-incompatibility nor nuclear envelope-attachment account for Zfp42's unresponsiveness. Rather, Zfp42's promoter is rendered inert to enhancers by context-dependent DNA methylation. Thus, diverse mechanisms enabled the integration of independent Zfp42 regulation in the Fat1 locus. Critically, such regulatory complexity appears common in evolution as, genome wide, most TADs contain multiple independently expressed genes.