RESUMEN
RNA sequencing (RNAseq) methodology has experienced a burst of technological developments in the last decade, which has opened up opportunities for studying the mechanisms of adaptation to environmental factors at both the organismal and cellular level. Selecting the most suitable experimental approach for specific research questions and model systems can, however, be a challenge and researchers in ecology and evolution are commonly faced with the choice of whether to study gene expression variation in whole bodies, specific tissues, and/or single cells. A wide range of sometimes polarised opinions exists over which approach is best. Here, we highlight the advantages and disadvantages of each of these approaches to provide a guide to help researchers make informed decisions and maximise the power of their study. Using illustrative examples of various ecological and evolutionary research questions, we guide the readers through the different RNAseq approaches and help them identify the most suitable design for their own projects.
RESUMEN
Drosophila melanogaster is a leading model in population genetics and genomics, and a growing number of whole-genome data sets from natural populations of this species have been published over the last years. A major challenge is the integration of disparate data sets, often generated using different sequencing technologies and bioinformatic pipelines, which hampers our ability to address questions about the evolution of this species. Here we address these issues by developing a bioinformatics pipeline that maps pooled sequencing (Pool-Seq) reads from D. melanogaster to a hologenome consisting of fly and symbiont genomes and estimates allele frequencies using either a heuristic (PoolSNP) or a probabilistic variant caller (SNAPE-pooled). We use this pipeline to generate the largest data repository of genomic data available for D. melanogaster to date, encompassing 271 previously published and unpublished population samples from over 100 locations in >20 countries on four continents. Several of these locations have been sampled at different seasons across multiple years. This data set, which we call Drosophila Evolution over Space and Time (DEST), is coupled with sampling and environmental metadata. A web-based genome browser and web portal provide easy access to the SNP data set. We further provide guidelines on how to use Pool-Seq data for model-based demographic inference. Our aim is to provide this scalable platform as a community resource which can be easily extended via future efforts for an even more extensive cosmopolitan data set. Our resource will enable population geneticists to analyze spatiotemporal genetic patterns and evolutionary dynamics of D. melanogaster populations in unprecedented detail.
Asunto(s)
Drosophila melanogaster , Metagenómica , Animales , Drosophila melanogaster/genética , Frecuencia de los Genes , Genética de Población , GenómicaRESUMEN
Genetic variation is the fuel of evolution, with standing genetic variation especially important for short-term evolution and local adaptation. To date, studies of spatiotemporal patterns of genetic variation in natural populations have been challenging, as comprehensive sampling is logistically difficult, and sequencing of entire populations costly. Here, we address these issues using a collaborative approach, sequencing 48 pooled population samples from 32 locations, and perform the first continent-wide genomic analysis of genetic variation in European Drosophila melanogaster. Our analyses uncover longitudinal population structure, provide evidence for continent-wide selective sweeps, identify candidate genes for local climate adaptation, and document clines in chromosomal inversion and transposable element frequencies. We also characterize variation among populations in the composition of the fly microbiome, and identify five new DNA viruses in our samples.
Asunto(s)
Drosophila melanogaster/genética , Genoma de los Insectos , Variación Estructural del Genoma , Microbiota , Selección Genética , Aclimatación/genética , Altitud , Animales , Virus ADN , Drosophila melanogaster/virología , Europa (Continente) , Genoma Mitocondrial , Haplotipos , Virus de Insectos , Masculino , Filogeografía , Polimorfismo de Nucleótido SimpleRESUMEN
The comparative analysis of genetic and physical maps as well as of whole genome sequences had revealed that in the Drosophila genus, most structural rearrangements occurred within chromosomal elements as a result of paracentric inversions. Genome sequence comparison would seem the best method to estimate rates of chromosomal evolution, but the high-quality reference genomes required for this endeavor are still scanty. Here, we have obtained dense physical maps for Muller elements A, C, and E of Drosophila subobscura, a species with an extensively studied rich and adaptive chromosomal polymorphism. These maps are based on 462 markers: 115, 236, and 111 markers for elements A, C, and E, respectively. The availability of these dense maps will facilitate genome assembly and will thus greatly contribute to obtaining a good reference genome, which is a required step for D. subobscura to attain the model species status. The comparative analysis of these physical maps and those obtained from the D. pseudoobscura and D. melanogaster genomes allowed us to infer the number of fixed inversions and chromosomal evolutionary rates for each pairwise comparison. For all three elements, rates inferred from the more closely related species were higher than those inferred from the more distantly related species, which together with results of relative-rate tests point to an acceleration in the D. subobscura lineage at least for elements A and E.
Asunto(s)
Genoma/genética , Mapeo Físico de Cromosoma/métodos , Animales , Inversión Cromosómica , Drosophila/genética , Evolución Molecular , Genes de Insecto , Marcadores Genéticos , Polimorfismo GenéticoRESUMEN
In Drosophila, chromosomes have been extensively reorganized during evolution, with most rearrangements affecting the gene order in chromosomal elements but not their gene content. The level of reorganization and the evidence for breakpoint reuse vary both between and within elements. The subito gene stands out as a gene involved in multiple rearrangements both because of its active single-gene transposition and because it is the nearest gene to diverse rearrangements breakpoints. Indeed, subito has undergone three single-gene transpositions and it is the nearest gene to the breakpoints of other single-gene transpositions and of two chromosomal inversions. Given that subito is involved in meiosis and therefore active in the female germ line, the high number of nearby fixed breakages might be related among others to the presumed high accessibility of the subito region to the machinery associated with double-strand breaks repair. A second important contributor would be the reduced and simple regulatory region of subito, which would imply that a fraction of the rearrangements originating from subito nearby breakages would have not affected either its pattern or timing of expression and would have, thus, not resulted in reduced fitness.
Asunto(s)
Puntos de Rotura del Cromosoma , Cromosomas de Insectos/química , Proteínas de Drosophila/genética , Drosophila/genética , Evolución Molecular , Cinesinas/genética , Filogenia , Animales , Inversión Cromosómica , Cromosomas de Insectos/metabolismo , Roturas del ADN de Doble Cadena , Reparación del ADN , Elementos Transponibles de ADN , Drosophila/clasificación , Proteínas de Drosophila/metabolismo , Femenino , Regulación de la Expresión Génica , Orden Génico , Cinesinas/metabolismo , MeiosisRESUMEN
Inversions are an integral part of structural variation within species, and they play a leading role in genome reorganization across species. Work at both the cytological and genome sequence levels has revealed heterogeneity in the distribution of inversion breakpoints, with some regions being recurrently used. Breakpoint reuse at the molecular level has mostly been assessed for fixed inversions through genome sequence comparison, and therefore rather broadly. Here, we have identified and sequenced the breakpoints of two polymorphic inversions-E1 and E2 that share a breakpoint-in the extant Est and E1 + 2 chromosomal arrangements of Drosophila subobscura. The breakpoints are two medium-sized repeated motifs that mediated the inversions by two different mechanisms: E1 via staggered breaks and subsequent repair and E2 via repeat-mediated ectopic recombination. The fine delimitation of the shared breakpoint revealed its strict reuse at the molecular level regardless of which was the intermediate arrangement. The occurrence of other rearrangements in the most proximal and distal extended breakpoint regions reveals the broad reuse of these regions. This differential degree of fragility might be related to their sharing the presence outside the inverted region of snoRNA-encoding genes.
Asunto(s)
Puntos de Rotura del Cromosoma , Paseo de Cromosoma/métodos , Cromosomas de Insectos/genética , Drosophila/genética , Animales , Inversión Cromosómica , Drosophila/clasificación , Evolución Molecular , Filogenia , Polimorfismo Genético , Secuencias Repetitivas de Ácidos Nucleicos , Análisis de Secuencia de ADNRESUMEN
The IT-insulin/target of rapamycin (TOR)-signal transduction pathway is a relatively well-characterized pathway that plays a central role in fundamental biological processes. Network-level analyses of DNA divergence in Drosophila and vertebrates have revealed a clear gradient in the levels of purifying selection along this pathway, with the downstream genes being the most constrained. Remarkably, this feature does not result from factors known to affect selective constraint such as gene expression, codon bias, protein length, and connectivity. The present work aims to establish whether the selective constraint gradient detected along the IT pathway at the between-species level can also be observed at a shorter time scale. With this purpose, we have surveyed DNA polymorphism in Drosophila melanogaster and divergence from D. simulans along the IT pathway. Our network-level analysis shows that DNA polymorphism exhibits the same polarity in the strength of purifying selection as previously detected at the divergence level. This equivalent feature detected both within species and between closely and distantly related species points to the action of a general mechanism, whose action is neither organism specific nor evolutionary time dependent. The detected polarity would be, therefore, intrinsic to the IT pathway architecture and function.
Asunto(s)
Proteínas de Drosophila/genética , Drosophila melanogaster/genética , Evolución Molecular , Insulina/genética , Proteínas Quinasas/genética , Transducción de Señal/genética , Animales , ADN , Drosophila , Proteínas de Drosophila/metabolismo , Genes de Insecto , Genética de Población , Insulina/metabolismo , Modelos Lineales , Polimorfismo Genético , Proteínas Quinasas/metabolismo , Serina-Treonina Quinasas TORRESUMEN
A usual approach to detect the spatial footprint left by recent adaptive events has been to follow up putative candidates emerging from multilocus scans of variation by sequencing additional fragments. We have used a similar experimental and analytical approach to study variation at 15 independently evolving and randomly chosen regions of the X chromosome of Drosophila melanogaster. These incompletely sequenced regions, each extending over approximately 40 kb, were subjected to two tests of positive selection that take into account the spatial distribution of nucleotide variation. Our analysis of variation at these genomic regions in a European population of D. melanogaster has allowed us to uncover a candidate region for positive selection and to empirically evaluate the comparative performance of the two tests of selection under a bottleneck scenario. Moreover, the boundaries here estimated for both the rate of adaptive substitution (delta) and the average selection coefficient (s) would support previous estimates obtained by maximum likelihood that suggest rather strong but uncommon positive selection.
Asunto(s)
Huella de ADN/métodos , Drosophila melanogaster/genética , Cromosoma X/genética , Análisis de Varianza , Animales , Modelos Genéticos , Polimorfismo Genético , Selección Genética , Análisis de Secuencia de ADNRESUMEN
Drosophila melanogaster is an important model for antiviral immunity in arthropods, but very few DNA viruses have been described from the family Drosophilidae. This deficiency limits our opportunity to use natural host-pathogen combinations in experimental studies, and may bias our understanding of the Drosophila virome. Here, we report fourteen DNA viruses detected in a metagenomic analysis of 6668 pool-sequenced Drosophila, sampled from forty-seven European locations between 2014 and 2016. These include three new nudiviruses, a new and divergent entomopoxvirus, a virus related to Leptopilina boulardi filamentous virus, and a virus related to Musca domestica salivary gland hypertrophy virus. We also find an endogenous genomic copy of galbut virus, a double-stranded RNA partitivirus, segregating at very low frequency. Remarkably, we find that Drosophila Vesanto virus, a small DNA virus previously described as a bidnavirus, may be composed of up to twelve segments and thus represent a new lineage of segmented DNA viruses. Two of the DNA viruses, Drosophila Kallithea nudivirus and Drosophila Vesanto virus are relatively common, found in 2 per cent or more of wild flies. The others are rare, with many likely to be represented by a single infected fly. We find that virus prevalence in Europe reflects the prevalence seen in publicly available datasets, with Drosophila Kallithea nudivirus and Drosophila Vesanto virus the only ones commonly detectable in public data from wild-caught flies and large population cages, and the other viruses being rare or absent. These analyses suggest that DNA viruses are at lower prevalence than RNA viruses in D.melanogaster, and may be less likely to persist in laboratory cultures. Our findings go some way to redressing an earlier bias toward RNA virus studies in Drosophila, and lay the foundation needed to harness the power of Drosophila as a model system for the study of DNA viruses.
RESUMEN
Cytological studies revealed that the number of chromosomes and their organization varies across species. The increasing availability of whole genome sequences of multiple species across specific phylogenies has confirmed and greatly extended these cytological observations. In the Drosophila genus, the ancestral karyotype consists of five rod-like acrocentric chromosomes (Muller elements A to E) and one dot-like chromosome (element F), each exhibiting a generally conserved gene content. Chromosomal fusions and paracentric inversions are thus the major contributors, respectively, to chromosome number variation among species and to gene order variation within chromosomal element. The subobscura cluster of Drosophila consists in three species that retain the genus ancestral karyotype and differ by a reduced number of fixed inversions. Here, we have used cytological information and the D. guanche genome sequence to identify and molecularly characterize the breakpoints of inversions that became fixed since the D. guanche-D. subobscura split. Our results have led us to propose a modified version of the D. guanche cytological map of its X chromosome, and to establish that (i) most inversions became fixed in the D. subobscura lineage and (ii) the order in which the four X chromosome overlapping inversions occurred and became fixed.
Asunto(s)
Inversión Cromosómica , Cromosomas de Insectos/genética , Drosophila/genética , Animales , Puntos de Rotura del Cromosoma , Mapeo Cromosómico , Drosophila/clasificación , Evolución Molecular , CariotipoRESUMEN
Cytological and molecular studies have revealed that inversion chromosomal polymorphism is widespread across taxa and that inversions are among the most common structural changes fixed between species. Two major mechanisms have been proposed for the origin of inversions considering that breaks occur at either repetitive or non-homologous sequences. While inversions originating through the first mechanism might have a multiple origin, those originating through the latter mechanism would have a unique origin. Variation at regions flanking inversion breakpoints can be informative on the origin and history of inversions given the reduced recombination in heterokaryotypes. Here, we have analyzed nucleotide variation at a fragment flanking the most centromere-proximal shared breakpoint of several sequential overlapping inversions of the E chromosome of Drosophila subobscura -inversions E1, E2, E9 and E3. The molecular genealogy inferred from variation at this shared fragment does not exhibit the branching pattern expected according to the sequential origin of inversions. The detected discordance between the molecular and cytological genealogies has led us to consider a novel possibility for the origin of an inversion, and more specifically that one of these inversions originated on a heterokaryotype for chromosomal arrangements. Based on this premise, we propose three new models for inversions origin.
Asunto(s)
Puntos de Rotura del Cromosoma , Inversión Cromosómica , Cromosomas de Insectos/genética , Drosophila/genética , Animales , Secuencia de Bases , Drosophila/clasificación , Evolución Molecular , Modelos Genéticos , Filogenia , Polimorfismo Genético , Recombinación Genética , Análisis de Secuencia de ADN/métodosRESUMEN
Drosophila guanche is a member of the obscura group that originated in the Canary Islands archipelago upon its colonization by D. subobscura. It evolved into a new species in the laurisilva, a laurel forest present in wet regions that in the islands have only minor long-term weather fluctuations. Oceanic island endemic species such as D. guanche can become model species to investigate not only the relative role of drift and adaptation in speciation processes but also how population size affects nucleotide variation. Moreover, the previous identification of two satellite DNAs in D. guanche makes this species attractive for studying how centromeric DNA evolves. As a prerequisite for its establishment as a model species suitable to address all these questions, we generated a high-quality D. guanche genome sequence composed of 42 cytologically mapped scaffolds, which are assembled into six super-scaffolds (one per chromosome). The comparative analysis of the D. guanche proteome with that of twelve other Drosophila species identified 151 genes that were subject to adaptive evolution in the D. guanche lineage, with a subset of them being involved in flight and genome stability. For example, the Centromere Identifier (CID) protein, directly interacting with centromeric satellite DNA, shows signals of adaptation in this species. Both genomic analyses and FISH of the two satellites would support an ongoing replacement of centromeric satellite DNA in D. guanche.
Asunto(s)
Adaptación Fisiológica/genética , Drosophila/genética , Evolución Molecular , Vuelo Animal/fisiología , Genes de Insecto , Inestabilidad Genómica , Islas , Animales , Secuencia de Bases , Cromosomas/genética , Elementos Transponibles de ADN/genética , Anotación de Secuencia Molecular , Océanos y Mares , FilogeniaRESUMEN
Chromosomal inversions are structural changes that alter gene order but generally not gene content in the affected region. In Drosophila, extensive cytological studies revealed the widespread character of inversion polymorphism, with evidence for its adaptive character. In Drosophila subobscura, polymorphism affects both its four large autosomal elements and its X (A) chromosome. The characterization of eight of these autosomal inversions breakpoints revealed that most of them originated through the staggered-breaks mechanism. Here, we have performed chromosomal walks to identify the breakpoints of two X-chromosome widely distributed inversions -A2 and A1- of D. subobscura. Inversion A2 is considered a warm-adapted arrangement that exhibits parallel latitudinal clines in the species ancestral distribution area and in both American subcontinents, whereas inversion A1 is only present in the Palearctic region where it presents an east-west cline. The duplication detected at the A2 inversion breakpoints is consistent with its origin by the staggered-breaks mechanism. Inversion A1 breakpoints could not be molecularly identified even though they could be narrowly delimited. This result points to chromosome walking limitations when using as a guide the genome of other species. Limitations stem from the rate of evolution by paracentric inversions, which in Drosophila is highest for the X chromosome.
Asunto(s)
Inversión Cromosómica , Drosophila/genética , Evolución Molecular , Animales , GenómicaRESUMEN
The insulin/TOR signal transduction pathway plays a critical role in determining such important traits as body and organ size, metabolic homeostasis and life span. Although this pathway is highly conserved across the animal kingdom, the affected traits can exhibit important differences even between closely related species. Evolutionary studies of regulatory regions require the reliable identification of transcription factor binding sites. Here we have focused on the Insulin Receptor (InR) expression from its P2 promoter in the Drosophila genus, which in D. melanogaster is up-regulated by hypophosphorylated Drosophila FOXO (dFOXO). We have finely characterized this transcription factor binding sites in vitro along the 1.3 kb region upstream of the InR P2 promoter in five Drosophila species. Moreover, we have tested the effect of mutations in the characterized dFOXO sites of D. melanogaster in transgenic flies. The number of experimentally established binding sites varies across the 1.3 kb region of any particular species, and their distribution also differs among species. In D. melanogaster, InR expression from P2 is differentially affected by dFOXO binding sites at the proximal and distal halves of the species 1.3 kb fragment. The observed uneven distribution of binding sites across this fragment might underlie their differential contribution to regulate InR transcription.
Asunto(s)
Proteínas de Drosophila/genética , Drosophila melanogaster/genética , Factores de Transcripción Forkhead/metabolismo , Regulación de la Expresión Génica/genética , Regiones Promotoras Genéticas , Animales , Animales Modificados Genéticamente , Proteínas de Drosophila/metabolismo , Drosophila melanogaster/metabolismo , Mutación , Fenotipo , Fosforilación , Filogenia , Unión Proteica/genética , Proteínas Tirosina Quinasas Receptoras/metabolismo , Transducción de Señal , Regulación hacia ArribaRESUMEN
Inversion polymorphism is widespread in the Drosophila genus as well as in other dipteran genera. The presence of polytene chromosomes in some insect organs and, thus, the possibility to observe the different arrangements generated by inversions through a microscope enhanced the cytological study of this structural polymorphism. In several Drosophila species, these studies provided evidence for the adaptive character of this polymorphism, which together with the standing interest to uncover targets of natural selection has led to a renewed interest for inversion polymorphism. Our recent molecular characterization of the breakpoint regions of five inversions of the E chromosome of D. subobscura has allowed us to design a PCR-based strategy to molecularly identify the different chromosomal arrangements and, most importantly, to determine the E chromosome karyotype of medium- and large-sized samples from natural populations. Individuals of a test sample that were both cytologically and molecularly karyotyped were used to establish the strategy that was subsequently applied to karyotype a larger sample. Our strategy has proved to be robust and time efficient, and it lays therefore the groundwork for future studies of the E chromosome structural polymorphism through space and time, and of its putative contribution to adaptation.
Asunto(s)
Puntos de Rotura del Cromosoma , Inversión Cromosómica/genética , Cromosomas de Insectos/genética , Drosophila/genética , Animales , Cariotipificación/métodos , Polimorfismo GenéticoRESUMEN
Chromosomal polymorphism is widespread in the Drosophila genus, with extensive evidence supporting its adaptive character in diverse species. Moreover, inversions are the major contributors to the genus chromosomal evolution. The molecular characterization of a reduced number of polymorphic inversion breakpoints in Drosophila melanogaster and Drosophila subobscura supports that their inversions would have mostly originated through a mechanism that generates duplications -staggered double-strand breaks- and has thus the potential to contribute to their adaptive character. There is also evidence for inversion breakpoint reuse at different time scales. Here, we have characterized the breakpoints of two inversions of D. subobscura -O4 and O8- involved in complex arrangements that are frequent in the warm parts of the species distribution area. The duplications detected at their breakpoints are consistent with their origin through the staggered-break mechanism, which further supports it as the prevalent mechanism in D. subobscura. The comparative analysis of inversions breakpoint regions across the Drosophila genus has revealed several genes affected by multiple disruptions due not only to inversions but also to single-gene transpositions and duplications.
Asunto(s)
Inversión Cromosómica , Cromosomas de Insectos/genética , Drosophila/genética , Animales , Puntos de Rotura del Cromosoma , Paseo de Cromosoma , Drosophila melanogaster/genética , Femenino , Duplicación de Gen , Masculino , Filogenia , Polimorfismo GenéticoRESUMEN
Chromosomal inversions can contribute to the adaptation of organisms to their environment by capturing particular advantageous allelic combinations of a set of genes included in the inverted fragment and also by advantageous functional changes due to the inversion process itself that might affect not only the expression of flanking genes but also their dose and structure. Of the two mechanisms originating inversions -ectopic recombination, and staggered double-strand breaks and subsequent repair- only the latter confers the inversion the potential to have dosage effects and/or to generate advantageous chimeric genes. In Drosophila subobscura, there is ample evidence for the adaptive character of its chromosomal polymorphism, with an important contribution of some warm-climate arrangements such as E1+2+9+12. Here, we have characterized the breakpoints of inversion E12 and established that it originated through the staggered-break mechanism like four of the five inversions of D. subobscura previously studied. This mechanism that also predominates in the D. melanogaster lineage might be prevalent in the Sophophora subgenus and contribute to the adaptive character of the polymorphic and fixed inversions of its species. Finally, we have shown that the D. subobscura inversion breakpoint regions have generally been disrupted by additional structural changes occurred at different time scales.
Asunto(s)
Inversión Cromosómica , Drosophila/genética , Duplicaciones Segmentarias en el Genoma , Animales , Roturas del ADN de Doble Cadena , Reparación del ADN , Evolución Molecular , Dosificación de Gen , Orden GénicoRESUMEN
The effects on nucleotide variation of adaptations to temperate habitats and of the possible bottleneck associated with the origin of European populations of Drosophila melanogaster should be detectable in DNA sequences given the short time elapsed relative to the species population size. We surveyed nucleotide variation in 109 fragments distributed across the X chromosome in a European population of D. melanogaster to detect the footprint of positive selection. Fragments were located primarily in large noncoding regions. Multilocus tests based on Tajima's D statistic revealed a significant departure from neutral expectations in a stationary panmictic population, with an important contribution from both positive and negative D values. A positive relationship between Tajima's D values and distance to coding region was detected, with a comparative excess of significantly negative D values in the subset of fragments closer to coding regions. Also, there was a significant heterogeneity in the polymorphism to divergence ratio, with 12 fragments contributing 42% to the test statistic. Moreover, these fragments were comparatively closer to coding regions. These findings would imply positive selection events, and thus selective sweeps, during the species expansion to Europe.
Asunto(s)
Drosophila melanogaster/genética , Selección Genética , Animales , Mapeo Cromosómico , ADN/genética , Demografía , Variación Genética , Modelos Genéticos , Datos de Secuencia Molecular , Polimorfismo Genético , Mapeo RestrictivoRESUMEN
Nucleotide variation in populations originating from the recent range expansion of a species should reflect their adaptation to new habitats as well as their demographic history. A survey of nucleotide variation at 109 noncoding X-chromosome fragments in a European population of Drosophila melanogaster allowed identifying some candidates to have been recently affected by positive selection. Adaptive changes leave a spatial differential footprint that can be used to discriminate among candidates by extending their study to neighboring regions. Here, we surveyed variation at an approximately 190-kb region spanning a locus exhibiting a significantly skewed frequency spectrum. A stretch of approximately 12 kb with reduced variation was detected within a continuously sequenced region that included the focal fragment. Moreover, the regions flanking this stretch exhibited an excess of high-frequency derived variants. Application of maximum likelihood ratio and goodness-of-fit tests suggested that the pattern of variation detected at the studied region (at cytological bands 17C-17D) might have been shaped by a recent selective change, most probably at or around the phantom gene that encodes CYP306A1, a cytochrome P450 enzyme in the ecdysteroidogenic pathway.