RESUMO
The schizophoran superfamily Ephydroidea (Diptera: Cyclorrhapha) includes eight families, ranging from the well-known vinegar flies (Drosophilidae) and shore flies (Ephydridae), to several small, relatively unusual groups, the phylogenetic placement of which has been particularly challenging for systematists. An extraordinary diversity in life histories, feeding habits and morphology are a hallmark of fly biology, and the Ephydroidea are no exception. Extreme specialization can lead to "orphaned" taxa with no clear evidence for their phylogenetic position. To resolve relationships among a diverse sample of Ephydroidea, including the highly modified flies in the families Braulidae and Mormotomyiidae, we conducted phylogenomic sampling. Using exon capture from Anchored Hybrid Enrichment and transcriptomics to obtain 320 orthologous nuclear genes sampled for 32 species of Ephydroidea and 11 outgroups, we evaluate a new phylogenetic hypothesis for representatives of the superfamily. These data strongly support monophyly of Ephydroidea with Ephydridae as an early branching radiation and the placement of Mormotomyiidae as a family-level lineage sister to all remaining families. We confirm placement of Cryptochetidae as sister taxon to a large clade containing both Drosophilidae and Braulidae-the latter a family of honeybee ectoparasites. Our results reaffirm that sampling of both taxa and characters is critical in hyperdiverse clades and that these factors have a major influence on phylogenomic reconstruction of the history of the schizophoran fly radiation.
Assuntos
Drosophilidae , Ácido Acético , Animais , Drosophilidae/genética , FilogeniaRESUMO
Y chromosomes play important roles in sex determination and male fertility. In several groups (e.g., mammals) there is strong evidence that they evolved through gene loss from a common X-Y ancestor, but in Drosophila the acquisition of new genes plays a major role. This conclusion came mostly from studies in two species. Here we report the identification of the 22 Y-linked genes in D. willistoni. They all fit the previously observed pattern of autosomal or X-linked testis-specific genes that duplicated to the Y. The ratio of gene gains to gene losses is ~25 in D. willistoni, confirming the prominent role of gene gains in the evolution of Drosophila Y chromosomes. We also found four large segmental duplications (ranging from 62 kb to 303 kb) from autosomal regions to the Y, containing ~58 genes. All but four of these duplicated genes became pseudogenes in the Y or disappeared. In the GK20609 gene the Y-linked copy remained functional, whereas its original autosomal copy degenerated, demonstrating how autosomal genes are transferred to the Y chromosome. Since the segmental duplication that carried GK20609 contained six other testis-specific genes, it seems that chance plays a significant role in the acquisition of new genes by the Drosophila Y chromosome.
Assuntos
Drosophila/genética , Genes de Insetos , Cromossomo Y/genética , Animais , Duplicação Gênica , MasculinoRESUMO
Rhodnius prolixus not only has served as a model organism for the study of insect physiology, but also is a major vector of Chagas disease, an illness that affects approximately seven million people worldwide. We sequenced the genome of R. prolixus, generated assembled sequences covering 95% of the genome (â¼ 702 Mb), including 15,456 putative protein-coding genes, and completed comprehensive genomic analyses of this obligate blood-feeding insect. Although immune-deficiency (IMD)-mediated immune responses were observed, R. prolixus putatively lacks key components of the IMD pathway, suggesting a reorganization of the canonical immune signaling network. Although both Toll and IMD effectors controlled intestinal microbiota, neither affected Trypanosoma cruzi, the causal agent of Chagas disease, implying the existence of evasion or tolerance mechanisms. R. prolixus has experienced an extensive loss of selenoprotein genes, with its repertoire reduced to only two proteins, one of which is a selenocysteine-based glutathione peroxidase, the first found in insects. The genome contained actively transcribed, horizontally transferred genes from Wolbachia sp., which showed evidence of codon use evolution toward the insect use pattern. Comparative protein analyses revealed many lineage-specific expansions and putative gene absences in R. prolixus, including tandem expansions of genes related to chemoreception, feeding, and digestion that possibly contributed to the evolution of a blood-feeding lifestyle. The genome assembly and these associated analyses provide critical information on the physiology and evolution of this important vector species and should be instrumental for the development of innovative disease control methods.
Assuntos
Adaptação Fisiológica/genética , Doença de Chagas , Interações Hospedeiro-Parasita/genética , Insetos Vetores , Rhodnius , Trypanosoma cruzi/fisiologia , Animais , Sequência de Bases , Transferência Genética Horizontal , Humanos , Insetos Vetores/genética , Insetos Vetores/parasitologia , Dados de Sequência Molecular , Rhodnius/genética , Rhodnius/parasitologia , Wolbachia/genéticaRESUMO
The autosomal gene Mst77F of Drosophila melanogaster is essential for male fertility. In 2010, Krsticevic et al. (Genetics 184: 295-307) found 18 Y-linked copies of Mst77F ("Mst77Y"), which collectively account for 20% of the functional Mst77F-like mRNA. The Mst77Y genes were severely misassembled in the then-available genome assembly and were identified by cloning and sequencing polymerase chain reaction products. The genomic structure of the Mst77Y region and the possible existence of additional copies remained unknown. The recent publication of two long-read assemblies of D. melanogaster prompted us to reinvestigate this challenging region of the Y chromosome. We found that the Illumina Synthetic Long Reads assembly failed in the Mst77Y region, most likely because of its tandem duplication structure. The PacBio MHAP assembly of the Mst77Y region seems to be very accurate, as revealed by comparisons with the previously found Mst77Y genes, a bacterial artificial chromosome sequence, and Illumina reads of the same strain. We found that the Mst77Y region spans 96 kb and originated from a 3.4-kb transposition from chromosome 3L to the Y chromosome, followed by tandem duplications inside the Y chromosome and invasion of transposable elements, which account for 48% of its length. Twelve of the 18 Mst77Y genes found in 2010 were confirmed in the PacBio assembly, the remaining six being polymerase chain reaction-induced artifacts. There are several identical copies of some Mst77Y genes, coincidentally bringing the total copy number to 18. Besides providing a detailed picture of the Mst77Y region, our results highlight the utility of PacBio technology in assembling difficult genomic regions such as tandemly repeated genes.
Assuntos
Drosophila melanogaster/genética , Dosagem de Genes , Genes de Insetos , Análise de Sequência de DNA/métodos , Cromossomo Y/genética , Algoritmos , Animais , Evolução Molecular , Dados de Sequência Molecular , Reprodutibilidade dos TestesRESUMO
Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy and middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. Further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads.
Assuntos
Drosophila melanogaster/genética , Genoma , Animais , Mapeamento Cromossômico , Cromossomos Artificiais Bacterianos , Biologia Computacional , Mapeamento de Sequências Contíguas , Sequenciamento de Nucleotídeos em Larga Escala , Hibridização in Situ Fluorescente , Dados de Sequência Molecular , Cromossomos Politênicos , Mapeamento por RestriçãoRESUMO
The Y chromosome of Drosophila melanogaster has <20 protein-coding genes. These genes originated from the duplication of autosomal genes and have male-related functions. In 1993, Russell and Kaiser found three Y-linked pseudogenes of the Mst77F gene, which is a testis-expressed autosomal gene that is essential for male fertility. We did a thorough search using experimental and computational methods and found 18 Y-linked copies of this gene (named Mst77Y-1-Mst77Y-18). Ten Mst77Y genes encode defective proteins and the other eight are potentially functional. These eight genes produce approximately 20% of the functional Mst77F-like mRNA, and molecular evolutionary analysis shows that they evolved under purifying selection. Hence several Mst77Y genes have all the features of functional genes. Mst77Y genes are present only in D. melanogaster, and phylogenetic analysis confirmed that the duplication is a recent event. The identification of functional Mst77Y genes reinforces the previous finding that gene gains play a prominent role in the evolution of the Drosophila Y chromosome.
Assuntos
Proteínas de Drosophila/genética , Drosophila melanogaster/genética , Dosagem de Genes , Genes de Insetos/genética , Histonas/genética , Cromossomo Y/genética , Animais , Enzimas de Restrição do DNA/metabolismo , Proteínas de Drosophila/metabolismo , Evolução Molecular , Feminino , Genes Ligados ao Cromossomo Y/genética , Histonas/metabolismo , Masculino , Análise de Sequência de DNA , Transcrição GênicaRESUMO
The Y chromosome and other heterochromatic regions present special challenges for genome sequencing and for the annotation of genes. Here we describe two new genes (ARY and WDY) on the Drosophila melanogaster Y, bringing its number of known single-copy genes to 12. WDY may correspond to the fertility factor kl-1.
Assuntos
Drosophila melanogaster/genética , Genes Ligados ao Cromossomo Y , Cromossomo Y/genética , Aldeído Redutase/genética , Animais , Proteínas de Drosophila/genética , Masculino , Dados de Sequência MolecularRESUMO
Whole genome shotgun assemblies have proven remarkably successful in reconstructing the bulk of euchromatic genes, with the only limit appearing to be determined by the sequencing depth. For genes imbedded in heterochromatin, however, the low cloning efficiency of repetitive sequences, combined with the computational challenges, demand that additional clues be used to annotate the sequences. One approach that has proven very successful in identifying protein coding genes in Y-linked heterochromatin of Drosophila melanogaster has been to make a BLASTable database of the small, unmapped contigs and fragments leftover at the end of a shotgun assembly, and to attempt to capture these by blasting with an appropriate query sequence. This approach often yields a staggered alignment of contigs from the unmapped set to the query sequence, as though the disjoint contigs represent small portions of the gene. Further inspection frequently shows that the contigs are broken by very large, heterochromatic introns. Methods of this sort are being expanded to make best use of all available clues to determine which unmapped contigs are associated with genes. These include use of EST libraries, and, in the case of the Y chromosome, testing of male specific genes and reduced shotgun depth of relevant contigs. It appears much more hopeful than anyone would have imagined that whole genome shotgun assemblies can recover the great bulk of even heterochromatic genes.
Assuntos
Drosophila melanogaster/genética , Heterocromatina/genética , Cromossomo Y/genética , Animais , Bases de Dados de Ácidos Nucleicos , Etiquetas de Sequências Expressas , Análise de Sequência de DNA/métodosRESUMO
Three recent findings are making a deep impact on our understanding of the Drosophila Y. First, the sequencing of the Drosophila genome and the development of proper computational methods increased the number of known single-copy Y-linked genes from 1 to 16, and revealed a chromosome packed with genes acquired from the autosomes. Second, this, coupled with the finding that B-chromosomes are able to show very regular segregation from the X chromosome, reinforce the hypothesis that the Drosophila Y is a specialized B-chromosome, instead of a degenerated homologue of the X. Third and finally, Y chromosomes seem to have a strong effect on male fitness.
Assuntos
Drosophila/genética , Evolução Molecular , Cromossomo Y/genética , Animais , Masculino , Espermatozoides/metabolismoRESUMO
BACKGROUND: Most eukaryotic genomes include a substantial repeat-rich fraction termed heterochromatin, which is concentrated in centric and telomeric regions. The repetitive nature of heterochromatic sequence makes it difficult to assemble and analyze. To better understand the heterochromatic component of the Drosophila melanogaster genome, we characterized and annotated portions of a whole-genome shotgun sequence assembly. RESULTS: WGS3, an improved whole-genome shotgun assembly, includes 20.7 Mb of draft-quality sequence not represented in the Release 3 sequence spanning the euchromatin. We annotated this sequence using the methods employed in the re-annotation of the Release 3 euchromatic sequence. This analysis predicted 297 protein-coding genes and six non-protein-coding genes, including known heterochromatic genes, and regions of similarity to known transposable elements. Bacterial artificial chromosome (BAC)-based fluorescence in situ hybridization analysis was used to correlate the genomic sequence with the cytogenetic map in order to refine the genomic definition of the centric heterochromatin; on the basis of our cytological definition, the annotated Release 3 euchromatic sequence extends into the centric heterochromatin on each chromosome arm. CONCLUSIONS: Whole-genome shotgun assembly produced a reliable draft-quality sequence of a significant part of the Drosophila heterochromatin. Annotation of this sequence defined the intron-exon structures of 30 known protein-coding genes and 267 protein-coding gene models. The cytogenetic mapping suggests that an additional 150 predicted genes are located in heterochromatin at the base of the Release 3 euchromatic sequence. Our analysis suggests strategies for improving the sequence and annotation of the heterochromatic portions of the Drosophila and other complex genomes.