Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 25
Filtrar
1.
Mol Biol Evol ; 41(1)2024 Jan 03.
Artículo en Inglés | MEDLINE | ID: mdl-38069903

RESUMEN

The increasing availability of genomic resequencing data sets and high-quality reference genomes across the tree of life present exciting opportunities for comparative population genomic studies. However, substantial challenges prevent the simple reuse of data across different studies and species, arising from variability in variant calling pipelines, data quality, and the need for computationally intensive reanalysis. Here, we present snpArcher, a flexible and highly efficient workflow designed for the analysis of genomic resequencing data in nonmodel organisms. snpArcher provides a standardized variant calling pipeline and includes modules for variant quality control, data visualization, variant filtering, and other downstream analyses. Implemented in Snakemake, snpArcher is user-friendly, reproducible, and designed to be compatible with high-performance computing clusters and cloud environments. To demonstrate the flexibility of this pipeline, we applied snpArcher to 26 public resequencing data sets from nonmammalian vertebrates. These variant data sets are hosted publicly to enable future comparative population genomic analyses. With its extensibility and the availability of public data sets, snpArcher will contribute to a broader understanding of genetic variation across species by facilitating the rapid use and reuse of large genomic data sets.


Asunto(s)
Metagenómica , Programas Informáticos , Animales , Flujo de Trabajo , Genómica , Análisis de Secuencia de ADN , Secuenciación de Nucleótidos de Alto Rendimiento
2.
Mol Biol Evol ; 40(9)2023 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-37665177

RESUMEN

An important goal of evolutionary genomics is to identify genomic regions whose substitution rates differ among lineages. For example, genomic regions experiencing accelerated molecular evolution in some lineages may provide insight into links between genotype and phenotype. Several comparative genomics methods have been developed to identify genomic accelerations between species, including a Bayesian method called PhyloAcc, which models shifts in substitution rate in multiple target lineages on a phylogeny. However, few methods consider the possibility of discordance between the trees of individual loci and the species tree due to incomplete lineage sorting, which might cause false positives. Here, we present PhyloAcc-GT, which extends PhyloAcc by modeling gene tree heterogeneity. Given a species tree, we adopt the multispecies coalescent model as the prior distribution of gene trees, use Markov chain Monte Carlo (MCMC) for inference, and design novel MCMC moves to sample gene trees efficiently. Through extensive simulations, we show that PhyloAcc-GT outperforms PhyloAcc and other methods in identifying target lineage-specific accelerations and detecting complex patterns of rate shifts, and is robust to specification of population size parameters. PhyloAcc-GT is usually more conservative than PhyloAcc in calling convergent rate shifts because it identifies more accelerations on ancestral than on terminal branches. We apply PhyloAcc-GT to two examples of convergent evolution: flightlessness in ratites and marine mammal adaptations, and show that PhyloAcc-GT is a robust tool to identify shifts in substitution rate associated with specific target lineages while accounting for incomplete lineage sorting.


Asunto(s)
Evolución Biológica , Modelos Genéticos , Animales , Teorema de Bayes , Filogenia , Genómica , Mamíferos
3.
Nature ; 553(7686): 77-81, 2018 01 03.
Artículo en Inglés | MEDLINE | ID: mdl-29300007

RESUMEN

In contrast to infections with human immunodeficiency virus (HIV) in humans and simian immunodeficiency virus (SIV) in macaques, SIV infection of a natural host, sooty mangabeys (Cercocebus atys), is non-pathogenic despite high viraemia. Here we sequenced and assembled the genome of a captive sooty mangabey. We conducted genome-wide comparative analyses of transcript assemblies from C. atys and AIDS-susceptible species, such as humans and macaques, to identify candidates for host genetic factors that influence susceptibility. We identified several immune-related genes in the genome of C. atys that show substantial sequence divergence from macaques or humans. One of these sequence divergences, a C-terminal frameshift in the toll-like receptor-4 (TLR4) gene of C. atys, is associated with a blunted in vitro response to TLR-4 ligands. In addition, we found a major structural change in exons 3-4 of the immune-regulatory protein intercellular adhesion molecule 2 (ICAM-2); expression of this variant leads to reduced cell surface expression of ICAM-2. These data provide a resource for comparative genomic studies of HIV and/or SIV pathogenesis and may help to elucidate the mechanisms by which SIV-infected sooty mangabeys avoid AIDS.


Asunto(s)
Síndrome de Inmunodeficiencia Adquirida/genética , Cercocebus atys/genética , Cercocebus atys/virología , Predisposición Genética a la Enfermedad , Genoma/genética , Especificidad del Huésped/genética , Virus de la Inmunodeficiencia de los Simios , Síndrome de Inmunodeficiencia Adquirida/virología , Secuencia de Aminoácidos , Animales , Moléculas de Adhesión Celular/química , Moléculas de Adhesión Celular/genética , Moléculas de Adhesión Celular/metabolismo , Cercocebus atys/inmunología , Exones/genética , Femenino , Mutación del Sistema de Lectura/genética , Variación Genética , Genómica , VIH/patogenicidad , Humanos , Macaca/virología , Eliminación de Secuencia , Síndrome de Inmunodeficiencia Adquirida del Simio/genética , Síndrome de Inmunodeficiencia Adquirida del Simio/virología , Virus de la Inmunodeficiencia de los Simios/patogenicidad , Especificidad de la Especie , Receptor Toll-Like 4/química , Receptor Toll-Like 4/genética , Receptor Toll-Like 4/inmunología , Transcriptoma/genética , Secuenciación Completa del Genoma
4.
Genome Res ; 30(6): 826-834, 2020 06.
Artículo en Inglés | MEDLINE | ID: mdl-32461224

RESUMEN

Mutation is the ultimate source of all genetic novelty and the cause of heritable genetic disorders. Mutational burden has been linked to complex disease, including neurodevelopmental disorders such as schizophrenia and autism. The rate of mutation is a fundamental genomic parameter and direct estimates of this parameter have been enabled by accurate comparisons of whole-genome sequences between parents and offspring. Studies in humans have revealed that the paternal age at conception explains most of the variation in mutation rate: Each additional year of paternal age in humans leads to approximately 1.5 additional inherited mutations. Here, we present an estimate of the de novo mutation rate in the rhesus macaque (Macaca mulatta) using whole-genome sequence data from 32 individuals in four large pedigrees. We estimated an average mutation rate of 0.58 × 10-8 per base pair per generation (at an average parental age of 7.5 yr), much lower than found in direct estimates from great apes. As in humans, older macaque fathers transmit more mutations to their offspring, increasing the per generation mutation rate by 4.27 × 10-10 per base pair per year. We found that the rate of mutation accumulation after puberty is similar between macaques and humans, but that a smaller number of mutations accumulate before puberty in macaques. We additionally investigated the role of paternal age on offspring sociability, a proxy for normal neurodevelopment, by studying 203 male macaques in large social groups.


Asunto(s)
Conducta Animal , Mutación de Línea Germinal , Acumulación de Mutaciones , Edad Paterna , Efectos Tardíos de la Exposición Prenatal/genética , Habilidades Sociales , Factores de Edad , Animales , Femenino , Humanos , Macaca mulatta , Masculino , Tasa de Mutación , Embarazo , Especificidad de la Especie
5.
Mol Biol Evol ; 38(4): 1460-1471, 2021 04 13.
Artículo en Inglés | MEDLINE | ID: mdl-33226085

RESUMEN

Mutations play a key role in the development of disease in an individual and the evolution of traits within species. Recent work in humans and other primates has clarified the origins and patterns of single-nucleotide variants, showing that most arise in the father's germline during spermatogenesis. It remains unknown whether larger mutations, such as deletions and duplications of hundreds or thousands of nucleotides, follow similar patterns. Such mutations lead to copy-number variation (CNV) within and between species, and can have profound effects by deleting or duplicating genes. Here, we analyze patterns of CNV mutations in 32 rhesus macaque individuals from 14 parent-offspring trios. We find the rate of CNV mutations per generation is low (less than one per genome) and we observe no correlation between parental age and the number of CNVs that are passed on to offspring. We also examine segregating CNVs within the rhesus macaque sample and compare them to a similar data set from humans, finding that both species have far more segregating deletions than duplications. We contrast this with long-term patterns of gene copy-number evolution between 17 mammals, where the proportion of deletions that become fixed along the macaque lineage is much smaller than the proportion of segregating deletions. These results suggest purifying selection acting on deletions, such that the majority of them are removed from the population over time. Rhesus macaques are an important biomedical model organism, so these results will aid in our understanding of this species and the disease models it supports.


Asunto(s)
Variaciones en el Número de Copia de ADN , Macaca mulatta/genética , Mutación , Animales , Femenino , Duplicación de Gen , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Masculino , Selección Genética , Eliminación de Secuencia , Secuenciación Completa del Genoma
6.
Mol Biol Evol ; 38(2): 486-501, 2021 01 23.
Artículo en Inglés | MEDLINE | ID: mdl-32946576

RESUMEN

Bumblebees are a diverse group of globally important pollinators in natural ecosystems and for agricultural food production. With both eusocial and solitary life-cycle phases, and some social parasite species, they are especially interesting models to understand social evolution, behavior, and ecology. Reports of many species in decline point to pathogen transmission, habitat loss, pesticide usage, and global climate change, as interconnected causes. These threats to bumblebee diversity make our reliance on a handful of well-studied species for agricultural pollination particularly precarious. To broadly sample bumblebee genomic and phenotypic diversity, we de novo sequenced and assembled the genomes of 17 species, representing all 15 subgenera, producing the first genus-wide quantification of genetic and genomic variation potentially underlying key ecological and behavioral traits. The species phylogeny resolves subgenera relationships, whereas incomplete lineage sorting likely drives high levels of gene tree discordance. Five chromosome-level assemblies show a stable 18-chromosome karyotype, with major rearrangements creating 25 chromosomes in social parasites. Differential transposable element activity drives changes in genome sizes, with putative domestications of repetitive sequences influencing gene coding and regulatory potential. Dynamically evolving gene families and signatures of positive selection point to genus-wide variation in processes linked to foraging, diet and metabolism, immunity and detoxification, as well as adaptations for life at high altitudes. Our study reveals how bumblebee genes and genomes have evolved across the Bombus phylogeny and identifies variations potentially linked to key ecological and behavioral traits of these important pollinators.


Asunto(s)
Adaptación Biológica/genética , Abejas/genética , Evolución Biológica , Genoma de los Insectos , Animales , Uso de Codones , Elementos Transponibles de ADN , Dieta , Conducta Alimentaria , Componentes del Gen , Tamaño del Genoma , Selección Genética
7.
BMC Evol Biol ; 19(1): 36, 2019 01 29.
Artículo en Inglés | MEDLINE | ID: mdl-30696414

RESUMEN

BACKGROUND: At the very end of the larval stage Drosophila expectorate a glue secreted by their salivary glands to attach themselves to a substrate while pupariating. The glue is a mixture of apparently unrelated proteins, some of which are highly glycosylated and possess internal repeats. Because species adhere to distinct substrates (i.e. leaves, wood, rotten fruits), glue genes are expected to evolve rapidly. RESULTS: We used available genome sequences and PCR-sequencing of regions of interest to investigate the glue genes in 20 Drosophila species. We discovered a new gene in addition to the seven glue genes annotated in D. melanogaster. We also identified a phase 1 intron at a conserved position present in five of the eight glue genes of D. melanogaster, suggesting a common origin for those glue genes. A slightly significant rate of gene turnover was inferred. Both the number of repeats and the repeat sequence were found to diverge rapidly, even between closely related species. We also detected high repeat number variation at the intrapopulation level in D. melanogaster. CONCLUSION: Most conspicuous signs of accelerated evolution are found in the repeat regions of several glue genes.


Asunto(s)
Drosophila/genética , Evolución Molecular , Genes de Insecto , Proteínas del Pegamento Salivar de Drosophila/genética , Glándulas Salivales/metabolismo , Secuencia de Aminoácidos , Animales , Secuencia de Bases , Codón sin Sentido/genética , Funciones de Verosimilitud , Familia de Multigenes , Nucleótidos/genética , Secuencias Repetitivas de Ácidos Nucleicos/genética , Especificidad de la Especie
8.
Genome Res ; 25(12): 1921-33, 2015 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-26377836

RESUMEN

We describe a genome reference of the African green monkey or vervet (Chlorocebus aethiops). This member of the Old World monkey (OWM) superfamily is uniquely valuable for genetic investigations of simian immunodeficiency virus (SIV), for which it is the most abundant natural host species, and of a wide range of health-related phenotypes assessed in Caribbean vervets (C. a. sabaeus), whose numbers have expanded dramatically since Europeans introduced small numbers of their ancestors from West Africa during the colonial era. We use the reference to characterize the genomic relationship between vervets and other primates, the intra-generic phylogeny of vervet subspecies, and genome-wide structural variations of a pedigreed C. a. sabaeus population. Through comparative analyses with human and rhesus macaque, we characterize at high resolution the unique chromosomal fission events that differentiate the vervets and their close relatives from most other catarrhine primates, in whom karyotype is highly conserved. We also provide a summary of transposable elements and contrast these with the rhesus macaque and human. Analysis of sequenced genomes representing each of the main vervet subspecies supports previously hypothesized relationships between these populations, which range across most of sub-Saharan Africa, while uncovering high levels of genetic diversity within each. Sequence-based analyses of major histocompatibility complex (MHC) polymorphisms reveal extremely low diversity in Caribbean C. a. sabaeus vervets, compared to vervets from putatively ancestral West African regions. In the C. a. sabaeus research population, we discover the first structural variations that are, in some cases, predicted to have a deleterious effect; future studies will determine the phenotypic impact of these variations.


Asunto(s)
Chlorocebus aethiops/genética , Genoma , Genómica , Animales , Chlorocebus aethiops/clasificación , Pintura Cromosómica , Biología Computacional/métodos , Evolución Molecular , Reordenamiento Génico , Variación Genética , Genómica/métodos , Cariotipo , Complejo Mayor de Histocompatibilidad/genética , Anotación de Secuencia Molecular , Filogenia , Filogeografía
10.
Proc Natl Acad Sci U S A ; 111(48): 17230-5, 2014 Dec 02.
Artículo en Inglés | MEDLINE | ID: mdl-25385592

RESUMEN

Little is known about the genetic changes that distinguish domestic cat populations from their wild progenitors. Here we describe a high-quality domestic cat reference genome assembly and comparative inferences made with other cat breeds, wildcats, and other mammals. Based upon these comparisons, we identified positively selected genes enriched for genes involved in lipid metabolism that underpin adaptations to a hypercarnivorous diet. We also found positive selection signals within genes underlying sensory processes, especially those affecting vision and hearing in the carnivore lineage. We observed an evolutionary tradeoff between functional olfactory and vomeronasal receptor gene repertoires in the cat and dog genomes, with an expansion of the feline chemosensory system for detecting pheromones at the expense of odorant detection. Genomic regions harboring signatures of natural selection that distinguish domestic cats from their wild congeners are enriched in neural crest-related genes associated with behavior and reward in mouse models, as predicted by the domestication syndrome hypothesis. Our description of a previously unidentified allele for the gloving pigmentation pattern found in the Birman breed supports the hypothesis that cat breeds experienced strong selection on specific mutations drawn from random bred populations. Collectively, these findings provide insight into how the process of domestication altered the ancestral wildcat genome and build a resource for future disease mapping and phylogenomic studies across all members of the Felidae.


Asunto(s)
Animales Domésticos/genética , Animales Salvajes/genética , Gatos/genética , Genoma/genética , Genómica/métodos , Adaptación Fisiológica/genética , Secuencia de Aminoácidos , Animales , Carnivoría , Gatos/clasificación , Mapeo Cromosómico , Variaciones en el Número de Copia de ADN , Perros , Femenino , Eliminación de Gen , Duplicación de Gen , Masculino , Proteínas de Transporte de Membrana/clasificación , Proteínas de Transporte de Membrana/genética , Datos de Secuencia Molecular , Filogenia , Selección Genética/genética , Análisis de Secuencia de ADN , Homología de Secuencia de Aminoácido , Especificidad de la Especie
11.
Mol Biol Evol ; 32(5): 1232-6, 2015 May.
Artículo en Inglés | MEDLINE | ID: mdl-25631926

RESUMEN

Convergent evolution occurs when the same trait arises independently in multiple lineages. In most cases of phenotypic convergence such transitions are adaptive, so finding the underlying molecular causes of convergence can provide insight into the process of adaptation. Convergent evolution at the genomic level also lends itself to study by comparative methods, although molecular convergence can also occur by chance, adding noise to this process. Parker et al. studied convergence across the genomes of several mammals, including echolocating bats and dolphins (Parker J, Tsagkogeorga G, Cotton JA, Liu Y, Provero P, Stupka E, Rossiter SJ. 2013. Genome-wide signatures of convergent evolution in echolocating mammals. Nature 502:228-231). On the basis of a null distribution of site-specific likelihood support (SSLS) generated using simulated topologies, they concluded that there was evidence for genome-wide adaptive convergence between echolocating taxa. Here, we demonstrate that methods based on SSLS do not adequately measure convergence, and reiterate the use of an empirical null model that directly compares convergent substitutions between all pairs of species. We find that when the proper comparisons are made there is no surprising excess of convergence between echolocating mammals, even in sensory genes.


Asunto(s)
Adaptación Fisiológica/genética , Ecolocación , Evolución Molecular , Genoma , Animales , Quirópteros/genética , Delfines/genética , Genómica , Modelos Genéticos , Filogenia
12.
Mol Biol Evol ; 31(2): 253-7, 2014 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-24202611

RESUMEN

Substitution rates vary between species, and many explanations regarding the causes of this variation have been proposed. Here we consider how new genomic data on the per-generation mutation rate impinge on proposed hypotheses for substitution rate variation in primates. We propose that the generation-time effect as it is usually understood cannot explain the observed rate variation, but instead that selection for decreased somatic mutation rates can. By considering the disparate causes underlying mutation rate changes in recent human history, we also show that the per-generation mutation rate is increasing even as the per-cell-division rate is decreasing.


Asunto(s)
Sustitución de Aminoácidos , División Celular , Tasa de Mutación , Primates/genética , Animales , Evolución Molecular , Variación Genética , Humanos , Modelos Genéticos
13.
Mol Biol Evol ; 30(8): 1987-97, 2013 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-23709260

RESUMEN

Current sequencing methods produce large amounts of data, but genome assemblies constructed from these data are often fragmented and incomplete. Incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. This means that methods attempting to estimate rates of gene duplication and loss often will be misled by such errors and that rates of gene family evolution will be consistently overestimated. Here, we present a method that takes these errors into account, allowing one to accurately infer rates of gene gain and loss among genomes even with low assembly and annotation quality. The method is implemented in the newest version of the software package CAFE, along with several other novel features. We demonstrate the accuracy of the method with extensive simulations and reanalyze several previously published data sets. Our results show that errors in genome annotation do lead to higher inferred rates of gene gain and loss but that CAFE 3 sufficiently accounts for these errors to provide accurate estimates of important evolutionary parameters.


Asunto(s)
Genoma , Anotación de Secuencia Molecular/métodos , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Algoritmos , Biología Computacional/métodos , Evolución Molecular , Genómica/métodos , Reproducibilidad de los Resultados
14.
Integr Comp Biol ; 2024 May 30.
Artículo en Inglés | MEDLINE | ID: mdl-38816211

RESUMEN

Comparative genomics provides ample ways to study genome evolution and its relationship to phenotypic traits. By developing and testing alternate models of evolution throughout a phylogeny, one can estimate rates of molecular evolution along different lineages in a phylogeny and link these rates with observations in extant species, such as convergent phenotypes. Pipelines for such work can help identify when and where genomic changes may be associated with, or possibly influence, phenotypic traits. We recently developed a set of models called PhyloAcc, using a Bayesian framework to estimate rates of nucleotide substitution on different branches a phylogenetic tree and evaluate their association with pre-defined or estimated phenotypic traits PhyloAcc-ST and PhyloAcc-GT both allow users to define a priori a set of target lineages and then compare different models to identify loci accelerating in one or more target lineages. Whereas ST considers only one species tree across all input loci, GT considers alternate topologies for every locus. PhyloAcc-C simultaneously models molecular rates and rates of continuous trait evolution,allowing the user to ask whether the two are associated. Here we describe these models and provide tips and workflows on how to prepare the input data and run PhyloAcc.

15.
Mol Ecol Resour ; 24(2): e13893, 2024 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-37966259

RESUMEN

Environmental change is intensifying the biodiversity crisis and threatening species across the tree of life. Conservation genomics can help inform conservation actions and slow biodiversity loss. However, more training, appropriate use of novel genomic methods and communication with managers are needed. Here, we review practical guidance to improve applied conservation genomics. We share insights aimed at ensuring effectiveness of conservation actions around three themes: (1) improving pedagogy and training in conservation genomics including for online global audiences, (2) conducting rigorous population genomic analyses properly considering theory, marker types and data interpretation and (3) facilitating communication and collaboration between managers and researchers. We aim to update students and professionals and expand their conservation toolkit with genomic principles and recent approaches for conserving and managing biodiversity. The biodiversity crisis is a global problem and, as such, requires international involvement, training, collaboration and frequent reviews of the literature and workshops as we do here.


Asunto(s)
Conservación de los Recursos Naturales , Genómica , Humanos , Conservación de los Recursos Naturales/métodos , Biodiversidad , Genoma
16.
bioRxiv ; 2023 Aug 29.
Artículo en Inglés | MEDLINE | ID: mdl-37693498

RESUMEN

A species tree is a central concept in evolutionary biology whereby a single branching phylogeny reflects relationships among species. However, the phylogenies of different genomic regions often differ from the species tree. Although tree discordance is often widespread in phylogenomic studies, we still lack a clear understanding of how variation in phylogenetic patterns is shaped by genome biology or the extent to which discordance may compromise comparative studies. We characterized patterns of phylogenomic discordance across the murine rodents (Old World mice and rats) - a large and ecologically diverse group that gave rise to the mouse and rat model systems. Combining new linked-read genome assemblies for seven murine species with eleven published rodent genomes, we first used ultra-conserved elements (UCEs) to infer a robust species tree. We then used whole genomes to examine finer-scale patterns of discordance and found that phylogenies built from proximate chromosomal regions had similar phylogenies. However, there was no relationship between tree similarity and local recombination rates in house mice, suggesting that genetic linkage influences phylogenetic patterns over deeper timescales. This signal may be independent of contemporary recombination landscapes. We also detected a strong influence of linked selection whereby purifying selection at UCEs led to less discordance, while genes experiencing positive selection showed more discordant and variable phylogenetic signals. Finally, we show that assuming a single species tree can result in high error rates when testing for positive selection under different models. Collectively, our results highlight the complex relationship between phylogenetic inference and genome biology and underscore how failure to account for this complexity can mislead comparative genomic studies.

17.
bioRxiv ; 2023 Sep 03.
Artículo en Inglés | MEDLINE | ID: mdl-37693452

RESUMEN

Sperm competition can drive rapid evolution of male reproductive traits, but it remains unclear how variation in sperm competition intensity shapes phenotypic and molecular diversity across clades. Old World mice and rats (subfamily Murinae) comprise a rapid radiation and exhibit incredible diversity in sperm morphology and production. We combined phenotype and sequence data to model the evolution of reproductive traits and genes across 78 murine species. We identified several shifts towards smaller relative testes mass, a trait reflective of reduced sperm competition. Several sperm traits were associated with relative testes mass, suggesting that mating system evolution likely selects for convergent traits related to sperm competitive ability. Molecular evolutionary rates of spermatogenesis proteins also correlated with relative testes mass, but in an unexpected direction. We predicted that sperm competition would result in rapid divergence among species with large relative testes mass, but instead found that many spermatogenesis genes evolve more rapidly in species with smaller relative testes mass due to relaxed purifying selection. While some reproductive genes evolved under positive selection, relaxed selection played a greater role underlying rapid evolution in small testes species. Our work demonstrates that sexual selection can impose strong purifying selection shaping the evolution of male reproduction.

18.
Genome Biol Evol ; 14(6)2022 05 31.
Artículo en Inglés | MEDLINE | ID: mdl-35642315

RESUMEN

The X chromosome of therian mammals shows strong conservation among distantly related species, limiting insights into the distinct selective processes that have shaped sex chromosome evolution. We constructed a chromosome-scale de novo genome assembly for the Siberian dwarf hamster (Phodopus sungorus), a species reported to show extensive recombination suppression across an entire arm of the X chromosome. Combining a physical genome assembly based on shotgun and long-range proximity ligation sequencing with a dense genetic map, we detected widespread suppression of female recombination across ∼65% of the Phodopus X chromosome. This region of suppressed recombination likely corresponds to the Xp arm, which has previously been shown to be highly heterochromatic. Using additional sequencing data from two closely related species (P. campbelli and P. roborovskii), we show that recombination suppression on Xp appears to be independent of major structural rearrangements. The suppressed Xp arm was enriched for several transposable element families and de-enriched for genes primarily expressed in placenta, but otherwise showed similar gene densities, expression patterns, and rates of molecular evolution when compared to the recombinant Xq arm. Phodopus Xp gene content and order was also broadly conserved relative to the more distantly related rat X chromosome. These data suggest that widespread suppression of recombination has likely evolved through the transient induction of facultative heterochromatin on the Phodopus Xp arm without major changes in chromosome structure or genetic content. Thus, substantial changes in the recombination landscape have so far had relatively subtle influences on patterns of X-linked molecular evolution in these species.


Asunto(s)
Phodopus , Cromosoma X , Animales , Cricetinae , Evolución Molecular , Femenino , Phodopus/genética , Embarazo , Ratas , Recombinación Genética , Cromosoma X/genética
19.
Genome Biol ; 21(1): 15, 2020 01 23.
Artículo en Inglés | MEDLINE | ID: mdl-31969194

RESUMEN

BACKGROUND: Arthropods comprise the largest and most diverse phylum on Earth and play vital roles in nearly every ecosystem. Their diversity stems in part from variations on a conserved body plan, resulting from and recorded in adaptive changes in the genome. Dissection of the genomic record of sequence change enables broad questions regarding genome evolution to be addressed, even across hyper-diverse taxa within arthropods. RESULTS: Using 76 whole genome sequences representing 21 orders spanning more than 500 million years of arthropod evolution, we document changes in gene and protein domain content and provide temporal and phylogenetic context for interpreting these innovations. We identify many novel gene families that arose early in the evolution of arthropods and during the diversification of insects into modern orders. We reveal unexpected variation in patterns of DNA methylation across arthropods and examples of gene family and protein domain evolution coincident with the appearance of notable phenotypic and physiological adaptations such as flight, metamorphosis, sociality, and chemoperception. CONCLUSIONS: These analyses demonstrate how large-scale comparative genomics can provide broad new insights into the genotype to phenotype map and generate testable hypotheses about the evolution of animal diversity.


Asunto(s)
Artrópodos/genética , Evolución Molecular , Animales , Artrópodos/clasificación , Metilación de ADN , Especiación Genética , Variación Genética , Filogenia
20.
Genome Biol Evol ; 11(5): 1483-1486, 2019 05 01.
Artículo en Inglés | MEDLINE | ID: mdl-31028392

RESUMEN

Genome assemblies from next-generation sequencing technologies are now an integral part of biological research, but many sequencing and assembly processes are still error-prone. Unfortunately, these errors can propagate to downstream analyses and wreak havoc on results and conclusions. Although such errors are recognized when dealing with diploid genotype data, modern reference assemblies (which are represented as haploid sequences) lack any type of succinct quality assessment for every position. Here we present Referee, a program that uses diploid genotype quality information in order to annotate a haploid assembly with a quality score for every position. Referee aims to provide an assembly with concise quality information on a Phred-like scale in FASTQ format for easy filtering of low-quality sites. Referee also provides output of quality scores in BED format that can be easily visualized as tracks on most genome browsers. Referee is freely available at https://gwct.github.io/referee/.


Asunto(s)
Genómica/métodos , Programas Informáticos , Animales , Genotipo , Estándares de Referencia
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA