Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 6.236
Filtrar
1.
Nat Commun ; 15(1): 3095, 2024 Apr 23.
Artigo em Inglês | MEDLINE | ID: mdl-38653976

RESUMO

Vocal rhythm plays a fundamental role in sexual selection and species recognition in birds, but little is known of its genetic basis due to the confounding effect of vocal learning in model systems. Uncovering its genetic basis could facilitate identifying genes potentially important in speciation. Here we investigate the genomic underpinnings of rhythm in vocal non-learning Pogoniulus tinkerbirds using 135 individual whole genomes distributed across a southern African hybrid zone. We find rhythm speed is associated with two genes that are also known to affect human speech, Neurexin-1 and Coenzyme Q8A. Models leveraging ancestry reveal these candidate loci also impact rhythmic stability, a trait linked with motor performance which is an indicator of quality. Character displacement in rhythmic stability suggests possible reinforcement against hybridization, supported by evidence of asymmetric assortative mating in the species producing faster, more stable rhythms. Because rhythm is omnipresent in animal communication, candidate genes identified here may shape vocal rhythm across birds and other vertebrates.


Assuntos
Vocalização Animal , Animais , Vocalização Animal/fisiologia , Masculino , Genômica , Genoma/genética , Feminino , Aves Canoras/genética , Aves Canoras/fisiologia , Aves/genética , Aves/fisiologia
2.
Methods Mol Biol ; 2757: 461-490, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38668979

RESUMO

Understanding gene evolution across genomes and organisms, including ctenophores, can provide unexpected biological insights. It enables powerful integrative approaches that leverage sequence diversity to advance biomedicine. Sequencing and bioinformatic tools can be inexpensive and user-friendly, but numerous options and coding can intimidate new users. Distinct challenges exist in working with data from diverse species but may go unrecognized by researchers accustomed to gold-standard genomes. Here, we provide a high-level workflow and detailed pipeline to enable animal collection, single-molecule sequencing, and phylogenomic analysis of gene and species evolution. As a demonstration, we focus on (1) PacBio RNA-seq of the genome-sequenced ctenophore Mnemiopsis leidyi, (2) diversity and evolution of the mechanosensitive ion channel Piezo in genetic models and basal-branching animals, and (3) associated challenges and solutions to working with diverse species and genomes, including gene model updating and repair using single-molecule RNA-seq. We provide a Python Jupyter Notebook version of our pipeline (GitHub Repository: Ctenophore-Ocean-To-Tree-2023 https://github.com/000generic/Ctenophore-Ocean-To-Tree-2023 ) that can be run for free in the Google Colab cloud to replicate our findings or modified for specific or greater use. Our protocol enables users to design new sequencing projects in ctenophores, marine invertebrates, or other novel organisms. It provides a simple, comprehensive platform that can ease new user entry into running their evolutionary sequence analyses.


Assuntos
Ctenóforos , Evolução Molecular , Filogenia , RNA-Seq , Animais , RNA-Seq/métodos , Ctenóforos/genética , Ctenóforos/classificação , Genoma/genética , Biologia Computacional/métodos , Software , Genômica/métodos , Modelos Genéticos
3.
Methods Mol Biol ; 2757: 185-200, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38668967

RESUMO

Long-read sequencing has proven the necessity for high-quality genomic assemblies of reference species, including enigmatic ctenophores. Obtaining high-molecular-weight genomic DNA is pivotal to this process and has proven highly problematic for many species. Here, we discuss different methodologies for gDNA isolation and present a protocol for isolating gDNA for several members of the phylum Ctenophora. Specifically, we describe a Pacific Biosciences library construction method used in conjunction with gDNA isolation methods that have proven successful in obtaining high-quality genomic assemblies in ctenophores.


Assuntos
Ctenóforos , DNA , Genômica , Análise de Sequência de DNA , Animais , Ctenóforos/genética , Genômica/métodos , DNA/genética , DNA/isolamento & purificação , Análise de Sequência de DNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Biblioteca Gênica , Genoma/genética
4.
PLoS Genet ; 20(4): e1011228, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38598567

RESUMO

The laboratory mouse has served as the premier animal model system for both basic and preclinical investigations for over a century. However, laboratory mice capture only a subset of the genetic variation found in wild mouse populations, ultimately limiting the potential of classical inbred strains to uncover phenotype-associated variants and pathways. Wild mouse populations are reservoirs of genetic diversity that could facilitate the discovery of new functional and disease-associated alleles, but the scarcity of commercially available, well-characterized wild mouse strains limits their broader adoption in biomedical research. To overcome this barrier, we have recently developed, sequenced, and phenotyped a set of 11 inbred strains derived from wild-caught Mus musculus domesticus. Each of these "Nachman strains" immortalizes a unique wild haplotype sampled from one of five environmentally distinct locations across North and South America. Whole genome sequence analysis reveals that each strain carries between 4.73-6.54 million single nucleotide differences relative to the GRCm39 mouse reference, with 42.5% of variants in the Nachman strain genomes absent from current classical inbred mouse strain panels. We phenotyped the Nachman strains on a customized pipeline to assess the scope of disease-relevant neurobehavioral, biochemical, physiological, metabolic, and morphological trait variation. The Nachman strains exhibit significant inter-strain variation in >90% of 1119 surveyed traits and expand the range of phenotypic diversity captured in classical inbred strain panels. These novel wild-derived inbred mouse strain resources are set to empower new discoveries in both basic and preclinical research.


Assuntos
Variação Genética , Camundongos Endogâmicos , Fenótipo , Animais , Camundongos , Camundongos Endogâmicos/genética , Genômica/métodos , Animais Selvagens/genética , Genoma/genética , Polimorfismo de Nucleotídeo Único , Haplótipos , Sequenciamento Completo do Genoma
5.
Mol Biol Rep ; 51(1): 560, 2024 Apr 20.
Artigo em Inglês | MEDLINE | ID: mdl-38643284

RESUMO

BACKGROUND: Zygotic genome activation (ZGA) is an important event in the early embryo development, and human embryo developmental arrest has been highly correlated with ZGA failure in clinical studies. Although a few studies have linked maternal factors to mammalian ZGA, more studies are needed to fully elucidate the maternal factors that are involved in ZGA. METHODS AND RESULTS: In this study, we utilized published single-cell RNA sequencing data from a Dux-mediated mouse embryonic stem cell to induce a 2-cell-like transition state and selected potential drivers for the transition according to an RNA velocity analysis. CONCLUSIONS: An overlap of potential candidate markers of 2-cell-like-cells identified in this research with markers generated by various data sets suggests that Trim75 is a potential driver of minor ZGA and may recruit EP300 and establish H3K27ac in the gene body of minor ZGA genes, thereby contributing to mammalian preimplantation embryo development.


Assuntos
Regulação da Expressão Gênica no Desenvolvimento , Zigoto , Animais , Humanos , Camundongos , Desenvolvimento Embrionário/genética , Genoma/genética , Embrião de Mamíferos , Mamíferos
6.
Proc Natl Acad Sci U S A ; 121(15): e2319506121, 2024 Apr 09.
Artigo em Inglês | MEDLINE | ID: mdl-38557186

RESUMO

Genomes are typically mosaics of regions with different evolutionary histories. When speciation events are closely spaced in time, recombination makes the regions sharing the same history small, and the evolutionary history changes rapidly as we move along the genome. When examining rapid radiations such as the early diversification of Neoaves 66 Mya, typically no consistent history is observed across segments exceeding kilobases of the genome. Here, we report an exception. We found that a 21-Mb region in avian genomes, mapped to chicken chromosome 4, shows an extremely strong and discordance-free signal for a history different from that of the inferred species tree. Such a strong discordance-free signal, indicative of suppressed recombination across many millions of base pairs, is not observed elsewhere in the genome for any deep avian relationships. Although long regions with suppressed recombination have been documented in recently diverged species, our results pertain to relationships dating circa 65 Mya. We provide evidence that this strong signal may be due to an ancient rearrangement that blocked recombination and remained polymorphic for several million years prior to fixation. We show that the presence of this region has misled previous phylogenomic efforts with lower taxon sampling, showing the interplay between taxon and locus sampling. We predict that similar ancient rearrangements may confound phylogenetic analyses in other clades, pointing to a need for new analytical models that incorporate the possibility of such events.


Assuntos
Evolução Biológica , Genoma , Animais , Filogenia , Genoma/genética , Aves , Recombinação Genética
7.
Methods Mol Biol ; 2744: 247-265, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38683324

RESUMO

In this protocol paper, we review a set of methods developed in recent years for analyzing nuclear reads obtained from genome skimming. As the cost of sequencing drops, genome skimming (low-coverage shotgun sequencing of a sample) becomes increasingly a cost-effective method of measuring biodiversity at high resolution. While most practitioners only use assembled over-represented organelle reads from a genome skim, the vast majority of the reads are nuclear. Using assembly-free and alignment-free methods described in this protocol, we can compare samples to each other and reference genomes to compute distances, characterize underlying genomes, and infer evolutionary relationships.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Genômica/métodos , Genoma/genética , Software , Núcleo Celular/genética , Biologia Computacional/métodos , Humanos
9.
Genes (Basel) ; 15(4)2024 Apr 03.
Artigo em Inglês | MEDLINE | ID: mdl-38674388

RESUMO

The selection and breeding of good meat rabbit breeds are fundamental to their industrial development, and genomic selection (GS) can employ genomic information to make up for the shortcomings of traditional phenotype-based breeding methods. For the practical implementation of GS in meat rabbit breeding, it is necessary to assess different marker densities and GS models. Here, we obtained low-coverage whole-genome sequencing (lcWGS) data from 1515 meat rabbits (including parent herd and half-sibling offspring). The specific objectives were (1) to derive a baseline for heritability estimates and genomic predictions based on randomly selected marker densities and (2) to assess the accuracy of genomic predictions for single- and multiple-trait linear mixed models. We found that a marker density of 50 K can be used as a baseline for heritability estimation and genomic prediction. For GS, the multi-trait genomic best linear unbiased prediction (GBLUP) model results in more accurate predictions for virtually all traits compared to the single-trait model, with improvements greater than 15% for all of them, which may be attributed to the use of information on genetically related traits. In addition, we discovered a positive correlation between the performance of the multi-trait GBLUP and the genetic correlation between the traits. We anticipate that this approach will provide solutions for GS, as well as optimize breeding programs, in meat rabbits.


Assuntos
Genômica , Carne , Animais , Coelhos/genética , Genômica/métodos , Seleção Genética , Fenótipo , Cruzamento/métodos , Modelos Genéticos , Característica Quantitativa Herdável , Marcadores Genéticos , Sequenciamento Completo do Genoma/métodos , Locos de Características Quantitativas , Genoma/genética
10.
Genes (Basel) ; 15(4)2024 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-38674443

RESUMO

Various bovine species have been domesticated and bred for thousands of years, and they provide adequate animal-derived products, including meat, milk, and leather, to meet human requirements. Despite the review studies on economic traits in cattle, the genetic basis of traits has only been partially explained by phenotype and pedigree breeding methods, due to the complexity of genomic regulation during animal development and growth. With the advent of next-generation sequencing technology, genomics projects, such as the 1000 Bull Genomes Project, Functional Annotation of Animal Genomes project, and Bovine Pangenome Consortium, have advanced bovine genomic research. These large-scale genomics projects gave us a comprehensive concept, technology, and public resources. In this review, we summarize the genomics research progress of the main bovine species during the past decade, including cattle (Bos taurus), yak (Bos grunniens), water buffalo (Bubalus bubalis), zebu (Bos indicus), and gayal (Bos frontalis). We mainly discuss the development of genome sequencing and functional annotation, focusing on how genomic analysis reveals genetic variation and its impact on phenotypes in several bovine species.


Assuntos
Genoma , Genômica , Animais , Bovinos/genética , Genoma/genética , Genômica/métodos , Búfalos/genética , Fenótipo , Sequenciamento de Nucleotídeos em Larga Escala , Cruzamento
11.
J Comput Biol ; 31(4): 294-311, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38621180

RESUMO

Whole Genome Duplications (WGDs) are events that double the content and structure of a genome. In some organisms, multiple WGD events have been observed while loss of genetic material is a typical occurrence following a WGD event. The requirement of classic rearrangement models that every genetic marker has to occur exactly two times in a given problem instance, therefore, poses a serious restriction in this context. The Double-Cut and Join (DCJ) model is a simple and powerful model for the analysis of large structural rearrangements. After being extended to the DCJ-Indel model, capable of handling gains and losses of genetic material, research has shifted in recent years toward enabling it to handle natural genomes, for which no assumption about the distribution of markers has to be made. The traditional theoretical framework for studying WGD events is the Genome Halving Problem (GHP). While the GHP is solved for the DCJ model for genomes without losses, there are currently no exact algorithms utilizing the DCJ-Indel model that are able to handle natural genomes. In this work, we present a general view on the DCJ-Indel model that we apply to derive an exact polynomial time and space solution for the GHP on genomes with at most two genes per family before generalizing the problem to an integer linear program solution for natural genomes.


Assuntos
Algoritmos , Genoma , Modelos Genéticos , Genoma/genética , Duplicação Gênica , Evolução Molecular
12.
Nat Commun ; 15(1): 3451, 2024 Apr 24.
Artigo em Inglês | MEDLINE | ID: mdl-38658544

RESUMO

Enhancers are fast-evolving genomic sequences that control spatiotemporal gene expression patterns. By examining enhancer turnover across mammalian species and in multiple tissue types, we uncover a relationship between the emergence of enhancers and genome organization as a function of germline DNA replication time. While enhancers are most abundant in euchromatic regions, enhancers emerge almost twice as often in late compared to early germline replicating regions, independent of transposable elements. Using a deep learning sequence model, we demonstrate that new enhancers are enriched for mutations that alter transcription factor (TF) binding. Recently evolved enhancers appear to be mostly neutrally evolving and enriched in eQTLs. They also show more tissue specificity than conserved enhancers, and the TFs that bind to these elements, as inferred by binding sequences, also show increased tissue-specific gene expression. We find a similar relationship with DNA replication time in cancer, suggesting that these observations may be time-invariant principles of genome evolution. Our work underscores that genome organization has a profound impact in shaping mammalian gene regulation.


Assuntos
Replicação do DNA , Elementos Facilitadores Genéticos , Animais , Humanos , Evolução Molecular , Fatores de Transcrição/metabolismo , Fatores de Transcrição/genética , Camundongos , Regulação da Expressão Gênica , Especificidade de Órgãos/genética , Mutação , Genoma/genética , Elementos de DNA Transponíveis/genética
13.
PLoS One ; 19(4): e0297987, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38578816

RESUMO

Sex identification is a common objective in molecular ecology. While many vertebrates display sexual dimorphism, determining the sex can be challenging in certain situations, such as species lacking clear sex-related phenotypic characteristics or in studies using non-invasive methods. In these cases, DNA analyses serve as valuable tools not only for sex determination but also for validating sex assignment based on phenotypic traits. In this study, we developed a bioinformatic framework for sex assignment using genomic data obtained through GBS, and having an available closely related genome assembled at the chromosome level. Our method consists of two ad hoc indexes that rely on the different properties of the mammalian heteromorphic sex chromosomes. For this purpose, we mapped RAD-seq loci to a reference genome and then obtained missingness and coverage depth values for the autosomes and X and Y chromosomes of each individual. Our methodology successfully determined the sex of 165 fur seals that had been phenotypically sexed in a previous study and 40 sea lions sampled in a non-invasive way. Additionally, we evaluated the accuracy of each index in sequences with varying average coverage depths, with Index Y proving greater reliability and robustness in assigning sex to individuals with low-depth coverage. We believe that the approach presented here can be extended to any animal taxa with known heteromorphic XY/ZW sex chromosome systems and that it can tolerate various qualities of GBS sequencing data.


Assuntos
Genoma , Cromossomos Sexuais , Humanos , Animais , Reprodutibilidade dos Testes , Genoma/genética , Cromossomos Sexuais/genética , Cromossomo Y , Genômica , Mamíferos/genética
14.
Sci Rep ; 14(1): 8073, 2024 04 05.
Artigo em Inglês | MEDLINE | ID: mdl-38580653

RESUMO

The fishing cat, Prionailurus viverrinus, faces a population decline, increasing the importance of maintaining healthy zoo populations. Unfortunately, zoo-managed individuals currently face a high prevalence of transitional cell carcinoma (TCC), a form of bladder cancer. To investigate the genetics of inherited diseases among captive fishing cats, we present a chromosome-scale assembly, generate the pedigree of the zoo-managed population, reaffirm the close genetic relationship with the Asian leopard cat (Prionailurus bengalensis), and identify 7.4 million single nucleotide variants (SNVs) and 23,432 structural variants (SVs) from whole genome sequencing (WGS) data of healthy and TCC cats. Only BRCA2 was found to have a high recurrent number of missense mutations in fishing cats diagnosed with TCC when compared to inherited human cancer risk variants. These new fishing cat genomic resources will aid conservation efforts to improve their genetic fitness and enhance the comparative study of feline genomes.


Assuntos
Carcinoma de Células de Transição , Neoplasias da Bexiga Urinária , Gatos , Animais , Humanos , Genoma/genética , Neoplasias da Bexiga Urinária/patologia , Carcinoma de Células de Transição/patologia , Genômica , Células Germinativas/patologia
15.
Biosystems ; 239: 105199, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38641198

RESUMO

Over the past quarter-century, the field of evolutionary biology has been transformed by the emergence of complete genome sequences and the conceptual framework known as the 'Net of Life.' This paradigm shift challenges traditional notions of evolution as a tree-like process, emphasizing the complex, interconnected network of gene flow that may blur the boundaries between distinct lineages. In this context, gene loss, rather than horizontal gene transfer, is the primary driver of gene content, with vertical inheritance playing a principal role. The 'Net of Life' not only impacts our understanding of genome evolution but also has profound implications for classification systems, the rapid appearance of new traits, and the spread of diseases. Here, we explore the core tenets of the 'Net of Life' and its implications for genome-scale phylogenetic divergence, providing a comprehensive framework for further investigations in evolutionary biology.


Assuntos
Evolução Molecular , Fluxo Gênico , Genoma , Filogenia , Genoma/genética , Animais , Humanos , Transferência Genética Horizontal , Modelos Genéticos , Evolução Biológica
16.
Nature ; 629(8010): 127-135, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38658750

RESUMO

Phenotypic variation among species is a product of evolutionary changes to developmental programs1,2. However, how these changes generate novel morphological traits remains largely unclear. Here we studied the genomic and developmental basis of the mammalian gliding membrane, or patagium-an adaptative trait that has repeatedly evolved in different lineages, including in closely related marsupial species. Through comparative genomic analysis of 15 marsupial genomes, both from gliding and non-gliding species, we find that the Emx2 locus experienced lineage-specific patterns of accelerated cis-regulatory evolution in gliding species. By combining epigenomics, transcriptomics and in-pouch marsupial transgenics, we show that Emx2 is a critical upstream regulator of patagium development. Moreover, we identify different cis-regulatory elements that may be responsible for driving increased Emx2 expression levels in gliding species. Lastly, using mouse functional experiments, we find evidence that Emx2 expression patterns in gliders may have been modified from a pre-existing program found in all mammals. Together, our results suggest that patagia repeatedly originated through a process of convergent genomic evolution, whereby regulation of Emx2 was altered by distinct cis-regulatory elements in independently evolved species. Thus, different regulatory elements targeting the same key developmental gene may constitute an effective strategy by which natural selection has harnessed regulatory evolution in marsupial genomes to generate phenotypic novelty.


Assuntos
Evolução Molecular , Proteínas de Homeodomínio , Locomoção , Marsupiais , Fatores de Transcrição , Animais , Feminino , Masculino , Camundongos , Epigenômica , Perfilação da Expressão Gênica , Regulação da Expressão Gênica no Desenvolvimento , Genoma/genética , Genômica , Proteínas de Homeodomínio/genética , Proteínas de Homeodomínio/metabolismo , Locomoção/genética , Marsupiais/anatomia & histologia , Marsupiais/classificação , Marsupiais/genética , Marsupiais/crescimento & desenvolvimento , Filogenia , Sequências Reguladoras de Ácido Nucleico/genética , Fatores de Transcrição/metabolismo , Fatores de Transcrição/genética , Fenótipo , Humanos
17.
Nat Comput Sci ; 4(4): 285-298, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38600256

RESUMO

The single-cell assay for transposase-accessible chromatin using sequencing (scATAC-seq) technology provides insight into gene regulation and epigenetic heterogeneity at single-cell resolution, but cell annotation from scATAC-seq remains challenging due to high dimensionality and extreme sparsity within the data. Existing cell annotation methods mostly focus on the cell peak matrix without fully utilizing the underlying genomic sequence. Here we propose a method, SANGO, for accurate single-cell annotation by integrating genome sequences around the accessibility peaks within scATAC data. The genome sequences of peaks are encoded into low-dimensional embeddings, and then iteratively used to reconstruct the peak statistics of cells through a fully connected network. The learned weights are considered as regulatory modes to represent cells, and utilized to align the query cells and the annotated cells in the reference data through a graph transformer network for cell annotations. SANGO was demonstrated to consistently outperform competing methods on 55 paired scATAC-seq datasets across samples, platforms and tissues. SANGO was also shown to be able to detect unknown tumor cells through attention edge weights learned by the graph transformer. Moreover, from the annotated cells, we found cell-type-specific peaks that provide functional insights/biological signals through expression enrichment analysis, cis-regulatory chromatin interaction analysis and motif enrichment analysis.


Assuntos
Cromatina , Análise de Célula Única , Humanos , Algoritmos , Cromatina/genética , Cromatina/metabolismo , Sequenciamento de Cromatina por Imunoprecipitação/métodos , Biologia Computacional/métodos , Genoma/genética , Genômica/métodos , Neoplasias/genética , Análise de Célula Única/métodos , Transposases/genética , Transposases/metabolismo
18.
Sci Rep ; 14(1): 6437, 2024 03 18.
Artigo em Inglês | MEDLINE | ID: mdl-38499737

RESUMO

Intra-colonial genetic variability (IGV), the presence of more than one genotype in a single colony, has been increasingly studied in scleractinians, revealing its high prevalence. Several studies hypothesised that IGV brings benefits, but few have investigated its roles from a genetic perspective. Here, using genomic data (SNPs), we investigated these potential benefits in populations of the coral Pocillopora acuta from Reunion Island (southwestern Indian Ocean). As the detection of IGV depends on sequencing and bioinformatics errors, we first explored the impact of the bioinformatics pipeline on its detection. Then, SNPs and genes variable within colonies were characterised. While most of the tested bioinformatics parameters did not significantly impact the detection of IGV, filtering on genotype depth of coverage strongly improved its detection by reducing genotyping errors. Mosaicism and chimerism, the two processes leading to IGV (the first through somatic mutations, the second through fusion of distinct organisms), were found in 7% and 12% of the colonies, respectively. Both processes led to several intra-colonial allelic differences, but most were non-coding or silent. However, 7% of the differences were non-silent and found in genes involved in a high diversity of biological processes, some of which were directly linked to responses to environmental stresses. IGV, therefore, appears as a source of genetic diversity and genetic plasticity, increasing the adaptive potential of colonies. Such benefits undoubtedly play an important role in the maintenance and the evolution of scleractinian populations and appear crucial for the future of coral reefs in the context of ongoing global changes.


Assuntos
Antozoários , Animais , Antozoários/genética , Recifes de Corais , Genótipo , Genoma/genética , Genômica
20.
Nature ; 628(8007): 373-380, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38448583

RESUMO

Pervasive transcriptional activity is observed across diverse species. The genomes of extant organisms have undergone billions of years of evolution, making it unclear whether these genomic activities represent effects of selection or 'noise'1-4. Characterizing default genome states could help understand whether pervasive transcriptional activity has biological meaning. Here we addressed this question by introducing a synthetic 101-kb locus into the genomes of Saccharomyces cerevisiae and Mus musculus and characterizing genomic activity. The locus was designed by reversing but not complementing human HPRT1, including its flanking regions, thus retaining basic features of the natural sequence but ablating evolved coding or regulatory information. We observed widespread activity of both reversed and native HPRT1 loci in yeast, despite the lack of evolved yeast promoters. By contrast, the reversed locus displayed no activity at all in mouse embryonic stem cells, and instead exhibited repressive chromatin signatures. The repressive signature was alleviated in a locus variant lacking CpG dinucleotides; nevertheless, this variant was also transcriptionally inactive. These results show that synthetic genomic sequences that lack coding information are active in yeast, but inactive in mouse embryonic stem cells, consistent with a major difference in 'default genomic states' between these two divergent eukaryotic cell types, with implications for understanding pervasive transcription, horizontal transfer of genetic information and the birth of new genes.


Assuntos
Genes Sintéticos , Genoma , Saccharomyces cerevisiae , Transcrição Gênica , Animais , Humanos , Camundongos , Cromatina/genética , Ilhas de CpG , Genes Sintéticos/genética , Genoma/genética , Células-Tronco Embrionárias Murinas/metabolismo , Regiões Promotoras Genéticas/genética , Saccharomyces cerevisiae/genética , Hipoxantina Fosforribosiltransferase/genética , Evolução Molecular
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA