RESUMO
RNA sequencing (RNA-seq) has recently been used in translational research settings to facilitate diagnoses of Mendelian disorders. A significant obstacle for clinical laboratories in adopting RNA-seq is the low or absent expression of a significant number of disease-associated genes/transcripts in clinically accessible samples. As this is especially problematic in neurological diseases, we developed a clinical diagnostic approach that enhanced the detection and evaluation of tissue-specific genes/transcripts through fibroblast-to-neuron cell transdifferentiation. The approach is designed specifically to suit clinical implementation, emphasizing simplicity, cost effectiveness, turnaround time, and reproducibility. For clinical validation, we generated induced neurons (iNeurons) from 71 individuals with primary neurological phenotypes recruited to the Undiagnosed Diseases Network. The overall diagnostic yield was 25.4%. Over a quarter of the diagnostic findings benefited from transdifferentiation and could not be achieved by fibroblast RNA-seq alone. This iNeuron transcriptomic approach can be effectively integrated into diagnostic whole-transcriptome evaluation of individuals with genetic disorders.
Assuntos
Transdiferenciação Celular , Fibroblastos , Neurônios , Análise de Sequência de RNA , Humanos , Transdiferenciação Celular/genética , Fibroblastos/metabolismo , Fibroblastos/citologia , Análise de Sequência de RNA/métodos , Neurônios/metabolismo , Neurônios/citologia , Transcriptoma , Reprodutibilidade dos Testes , Doenças do Sistema Nervoso/genética , Doenças do Sistema Nervoso/diagnóstico , RNA-Seq/métodos , Feminino , MasculinoRESUMO
Systemic sclerosis (SSc) is a heterogeneous rare autoimmune fibrosing disorder affecting connective tissue. The etiology of systemic sclerosis is largely unknown and many genes have been suggested as susceptibility loci of modest impact by genome-wide association study (GWAS). Multiple factors can contribute to the pathological process of the disease, which makes it more difficult to identify possible disease-causing genetic alterations. In this study, we have applied whole genome sequencing (WGS) in 101 indexed family trios, supplemented with transcriptome sequencing on cultured fibroblast cells of four patients and five family controls where available. Single nucleotide variants (SNVs) and copy number variants (CNVs) were examined, with emphasis on de novo variants. We also performed enrichment test for rare variants in candidate genes previously proposed in association with systemic sclerosis. We identified 42 exonic and 34 ncRNA de novo SNV changes in 101 trios, from a total of over 6000 de novo variants genome wide. We observed higher than expected de novo variants in PRKXP1 gene. We also observed such phenomenon along with increased expression in patient group in NEK7 gene. Additionally, we also observed significant enrichment of rare variants in candidate genes in the patient cohort, further supporting the complexity/multi-factorial etiology of systemic sclerosis. Our findings identify new candidate genes including PRKXP1 and NEK7 for future studies in SSc. We observed rare variant enrichment in candidate genes previously proposed in association with SSc, which suggest more efforts should be pursued to further investigate possible pathogenetic mechanisms associated with those candidate genes.
Assuntos
Variações do Número de Cópias de DNA , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Escleroderma Sistêmico , Sequenciamento Completo do Genoma , Humanos , Escleroderma Sistêmico/genética , Escleroderma Sistêmico/patologia , Variações do Número de Cópias de DNA/genética , Masculino , Feminino , Adulto , Quinases Relacionadas a NIMA/genética , Pessoa de Meia-Idade , Fibroblastos/metabolismo , Fibroblastos/patologiaRESUMO
In contrast to the western honey bee, Apis mellifera, other honey bee species have been largely neglected despite their importance and diversity. The genetic basis of the evolutionary diversification of honey bees remains largely unknown. Here, we provide a genome-wide comparison of three honey bee species, each representing one of the three subgenera of honey bees, namely the dwarf (Apis florea), giant (A. dorsata), and cavity-nesting (A. mellifera) honey bees with bumblebees as an outgroup. Our analyses resolve the phylogeny of honey bees with the dwarf honey bees diverging first. We find that evolution of increased eusocial complexity in Apis proceeds via increases in the complexity of gene regulation, which is in agreement with previous studies. However, this process seems to be related to pathways other than transcriptional control. Positive selection patterns across Apis reveal a trade-off between maintaining genome stability and generating genetic diversity, with a rapidly evolving piRNA pathway leading to genomes depleted of transposable elements, and a rapidly evolving DNA repair pathway associated with high recombination rates in all Apis species. Diversification within Apis is accompanied by positive selection in several genes whose putative functions present candidate mechanisms for lineage-specific adaptations, such as migration, immunity, and nesting behavior.
RESUMO
BACKGROUND: Helios (encoded by IKZF2), a member of the Ikaros family of transcription factors, is a zinc finger protein involved in embryogenesis and immune function. Although predominantly recognised for its role in the development and function of T lymphocytes, particularly the CD4+ regulatory T cells (Tregs), the expression and function of Helios extends beyond the immune system. During embryogenesis, Helios is expressed in a wide range of tissues, making genetic variants that disrupt the function of Helios strong candidates for causing widespread immune-related and developmental abnormalities in humans. METHODS: We performed detailed phenotypic, genomic and functional investigations on two unrelated individuals with a phenotype of immune dysregulation combined with syndromic features including craniofacial differences, sensorineural hearing loss and congenital abnormalities. RESULTS: Genome sequencing revealed de novo heterozygous variants that alter the critical DNA-binding zinc fingers (ZFs) of Helios. Proband 1 had a tandem duplication of ZFs 2 and 3 in the DNA-binding domain of Helios (p.Gly136_Ser191dup) and Proband 2 had a missense variant impacting one of the key residues for specific base recognition and DNA interaction in ZF2 of Helios (p.Gly153Arg). Functional studies confirmed that both these variant proteins are expressed and that they interfere with the ability of the wild-type Helios protein to perform its canonical function-repressing IL2 transcription activity-in a dominant negative manner. CONCLUSION: This study is the first to describe dominant negative IKZF2 variants. These variants cause a novel genetic syndrome characterised by immunodysregulation, craniofacial anomalies, hearing impairment, athelia and developmental delay.
Assuntos
Anormalidades Craniofaciais , Deficiências do Desenvolvimento , Perda Auditiva , Fator de Transcrição Ikaros , Humanos , Proteínas de Ligação a DNA/genética , Fator de Transcrição Ikaros/genética , Síndrome , Deficiências do Desenvolvimento/genética , Anormalidades Craniofaciais/genéticaRESUMO
Studies of Y Chromosome evolution have focused primarily on gene decay, a consequence of suppression of crossing-over with the X Chromosome. Here, we provide evidence that suppression of X-Y crossing-over unleashed a second dynamic: selfish X-Y arms races that reshaped the sex chromosomes in mammals as different as cattle, mice, and men. Using super-resolution sequencing, we explore the Y Chromosome of Bos taurus (bull) and find it to be dominated by massive, lineage-specific amplification of testis-expressed gene families, making it the most gene-dense Y Chromosome sequenced to date. As in mice, an X-linked homolog of a bull Y-amplified gene has become testis-specific and amplified. This evolutionary convergence implies that lineage-specific X-Y coevolution through gene amplification, and the selfish forces underlying this phenomenon, were dominatingly powerful among diverse mammalian lineages. Together with Y gene decay, X-Y arms races molded mammalian sex chromosomes and influenced the course of mammalian evolution.
Assuntos
Análise de Sequência de DNA/veterinária , Cromossomo X/genética , Cromossomo Y/genética , Animais , Bovinos , Linhagem da Célula , Troca Genética , Evolução Molecular , Feminino , Amplificação de Genes , Humanos , Masculino , Camundongos , Especificidade de Órgãos , Testículo/químicaRESUMO
Our understanding of the evolutionary history of primates is undergoing continual revision due to ongoing genome sequencing efforts. Bolstered by growing fossil evidence, these data have led to increased acceptance of once controversial hypotheses regarding phylogenetic relationships, hybridization and introgression, and the biogeographical history of primate groups. Among these findings is a pattern of recent introgression between species within all major primate groups examined to date, though little is known about introgression deeper in time. To address this and other phylogenetic questions, here, we present new reference genome assemblies for 3 Old World monkey (OWM) species: Colobus angolensis ssp. palliatus (the black and white colobus), Macaca nemestrina (southern pig-tailed macaque), and Mandrillus leucophaeus (the drill). We combine these data with 23 additional primate genomes to estimate both the species tree and individual gene trees using thousands of loci. While our species tree is largely consistent with previous phylogenetic hypotheses, the gene trees reveal high levels of genealogical discordance associated with multiple primate radiations. We use strongly asymmetric patterns of gene tree discordance around specific branches to identify multiple instances of introgression between ancestral primate lineages. In addition, we exploit recent fossil evidence to perform fossil-calibrated molecular dating analyses across the tree. Taken together, our genome-wide data help to resolve multiple contentious sets of relationships among primates, while also providing insight into the biological processes and technical artifacts that led to the disagreements in the first place.
Assuntos
Introgressão Genética/genética , Primatas/genética , Animais , Evolução Biológica , Cercopithecidae/genética , Biologia Computacional/métodos , Bases de Dados Genéticas , Fósseis , Fluxo Gênico/genética , Genoma/genética , Modelos Genéticos , Filogenia , Análise de Sequência de DNA/métodosRESUMO
Stroke causes significant disability and is a common cause of death worldwide. Previous studies have estimated that 1%-5% of stroke is attributable to monogenic etiologies. We set out to assess the utility of clinical exome sequencing (ES) in the evaluation of stroke. We retrospectively analyzed 124 individuals who received ES at the Baylor Genetics reference lab between 2012 and 2021 who had stroke as a major part of their reported phenotype. Ages ranged from 10 days to 69 years. 8.9% of the cohort received a diagnosis, including 25% of infants less than 1 year old; an additional 10.5% of the cohort received a probable diagnosis. We identified several syndromes that predispose to stroke such as COL4A1-related brain small vessel disease, homocystinuria caused by CBS mutation, POLG-related disorders, TTC19-linked mitochondrial disease, and RNASEH2A associated Aicardi-Goutieres syndrome. We also observed pathogenic variants in NSD1, PKHD1, HRAS, and ATP13A2, which are genes rarely associated with stroke. Although stroke is a complex phenotype with varying pathologies and risk factors, these results show that use of exome sequencing can be highly relevant in stroke, especially for those presenting <1 year of age.
Assuntos
Exoma , Acidente Vascular Cerebral , Exoma/genética , Humanos , Fenótipo , Estudos Retrospectivos , Acidente Vascular Cerebral/diagnóstico , Acidente Vascular Cerebral/genética , Sequenciamento do Exoma/métodosRESUMO
Leiomodin-2 (LMOD2) is an important regulator of the thin filament length, known to promote elongation of actin through polymerization at pointed ends. Mice with Lmod2 deficiency die around 3 weeks of age due to severe dilated cardiomyopathy (DCM), resulting from decreased heart contractility due to shorter thin filaments. To date, there have been three infants from two families reported with biallelic variants in LMOD2, presenting with perinatal onset DCM. Here, we describe a third family with a child harboring a previously described homozygous frameshift variant, c.1243_1244delCT (p.L415Vfs*108) with DCM, presenting later in infancy at 9 months of age. Family history was relevant for a sibling who died suddenly at 1 year of age after being diagnosed with cardiomegaly. LMOD2-related cardiomyopathy is a rare form of inherited cardiomyopathy resulting from thin filament length dysregulation and should be considered in genetic evaluation of newborns and infants with suspected autosomal recessive inheritance or sporadic early onset cardiomyopathy.
Assuntos
Cardiomiopatias , Cardiomiopatia Dilatada , Citoesqueleto de Actina/genética , Animais , Cardiomiopatia Dilatada/diagnóstico , Cardiomiopatia Dilatada/genética , Proteínas do Citoesqueleto/genética , Coração , Humanos , Recém-Nascido , Camundongos , Proteínas Musculares/genética , SarcômerosRESUMO
BACKGROUND: The western flower thrips, Frankliniella occidentalis (Pergande), is a globally invasive pest and plant virus vector on a wide array of food, fiber, and ornamental crops. The underlying genetic mechanisms of the processes governing thrips pest and vector biology, feeding behaviors, ecology, and insecticide resistance are largely unknown. To address this gap, we present the F. occidentalis draft genome assembly and official gene set. RESULTS: We report on the first genome sequence for any member of the insect order Thysanoptera. Benchmarking Universal Single-Copy Ortholog (BUSCO) assessments of the genome assembly (size = 415.8 Mb, scaffold N50 = 948.9 kb) revealed a relatively complete and well-annotated assembly in comparison to other insect genomes. The genome is unusually GC-rich (50%) compared to other insect genomes to date. The official gene set (OGS v1.0) contains 16,859 genes, of which ~ 10% were manually verified and corrected by our consortium. We focused on manual annotation, phylogenetic, and expression evidence analyses for gene sets centered on primary themes in the life histories and activities of plant-colonizing insects. Highlights include the following: (1) divergent clades and large expansions in genes associated with environmental sensing (chemosensory receptors) and detoxification (CYP4, CYP6, and CCE enzymes) of substances encountered in agricultural environments; (2) a comprehensive set of salivary gland genes supported by enriched expression; (3) apparent absence of members of the IMD innate immune defense pathway; and (4) developmental- and sex-specific expression analyses of genes associated with progression from larvae to adulthood through neometaboly, a distinct form of maturation differing from either incomplete or complete metamorphosis in the Insecta. CONCLUSIONS: Analysis of the F. occidentalis genome offers insights into the polyphagous behavior of this insect pest that finds, colonizes, and survives on a widely diverse array of plants. The genomic resources presented here enable a more complete analysis of insect evolution and biology, providing a missing taxon for contemporary insect genomics-based analyses. Our study also offers a genomic benchmark for molecular and evolutionary investigations of other Thysanoptera species.
Assuntos
Genoma de Inseto , Características de História de Vida , Tisanópteros/fisiologia , Transcriptoma , Animais , Produtos Agrícolas , Comportamento Alimentar , Cadeia Alimentar , Imunidade Inata/genética , Percepção , Filogenia , Reprodução/genética , Tisanópteros/genética , Tisanópteros/imunologiaRESUMO
An amendment to this paper has been published and can be accessed via the original article.
RESUMO
BACKGROUND: Human chromosome 19 has many unique characteristics including gene density more than double the genome-wide average and 20 large tandemly clustered gene families. It also has the highest GC content of any chromosome, especially outside gene clusters. The high GC content and concomitant high content of hypermutable CpG sites raises the possibility chromosome 19 exhibits higher levels of nucleotide diversity both within and between species, and may possess greater variation in DNA methylation that regulates gene expression. RESULTS: We examined GC and CpG content of chromosome 19 orthologs across representatives of the primate order. In all 12 primate species with suitable genome assemblies, chromosome 19 orthologs have the highest GC content of any chromosome. CpG dinucleotides and CpG islands are also more prevalent in chromosome 19 orthologs than other chromosomes. GC and CpG content are generally higher outside the gene clusters. Intra-species variation based on SNPs in human common dbSNP, rhesus, crab eating macaque, baboon and marmoset datasets is most prevalent on chromosome 19 and its orthologs. Inter-species comparisons based on phyloP conservation show accelerated nucleotide evolution for chromosome 19 promoter flanking and enhancer regions. These same regulatory regions show the highest CpG density of any chromosome suggesting they possess considerable methylome regulatory potential. CONCLUSIONS: The pattern of high GC and CpG content in chromosome 19 orthologs, particularly outside gene clusters, is present from human to mouse lemur representing 74 million years of primate evolution. Much CpG variation exists both within and between primate species with a portion of this variation occurring in regulatory regions.
Assuntos
Cromossomos Humanos Par 19/genética , Sequência Conservada , Primatas/classificação , Primatas/genética , Animais , Composição de Bases , Sequência de Bases , Cromossomos/genética , Sequência Conservada/genética , Ilhas de CpG , Metilação de DNA , Fosfatos de Dinucleosídeos/genética , Genoma , Humanos , Lemur/classificação , Lemur/genética , Camundongos , Família Multigênica , Filogenia , Regiões Promotoras Genéticas/genética , Sequências Reguladoras de Ácido Nucleico/genéticaRESUMO
BACKGROUND: Trichogrammatids are minute parasitoid wasps that develop within other insect eggs. They are less than half a millimeter long, smaller than some protozoans. The Trichogrammatidae are one of the earliest branching families of Chalcidoidea: a diverse superfamily of approximately half a million species of parasitoid wasps, proposed to have evolved from a miniaturized ancestor. Trichogramma are frequently used in agriculture, released as biological control agents against major moth and butterfly pests. Additionally, Trichogramma are well known for their symbiotic bacteria that induce asexual reproduction in infected females. Knowledge of the genome sequence of Trichogramma is a major step towards further understanding its biology and potential applications in pest control. RESULTS: We report the 195-Mb genome sequence of Trichogramma pretiosum and uncover signatures of miniaturization and adaptation in Trichogramma and related parasitoids. Comparative analyses reveal relatively rapid evolution of proteins involved in ribosome biogenesis and function, transcriptional regulation, and ploidy regulation. Chalcids also show loss or especially rapid evolution of 285 gene clusters conserved in other Hymenoptera, including many that are involved in signal transduction and embryonic development. Comparisons between sexual and asexual lineages of Trichogramma pretiosum reveal that there is no strong evidence for genome degradation (e.g., gene loss) in the asexual lineage, although it does contain a lower repeat content than the sexual lineage. Trichogramma shows particularly rapid genome evolution compared to other hymenopterans. We speculate these changes reflect adaptations to miniaturization, and to life as a specialized egg parasitoid. CONCLUSIONS: The genomes of Trichogramma and related parasitoids are a valuable resource for future studies of these diverse and economically important insects, including explorations of parasitoid biology, symbiosis, asexuality, biological control, and the evolution of miniaturization. Understanding the molecular determinants of parasitism can also inform mass rearing of Trichogramma and other parasitoids for biological control.
Assuntos
Evolução Molecular , Controle Biológico de Vetores , Vespas/classificação , Vespas/genética , Animais , Genômica , Mariposas/parasitologia , Filogenia , Vespas/patogenicidade , Sequenciamento Completo do Genoma/métodosRESUMO
BACKGROUND: Having conquered water surfaces worldwide, the semi-aquatic bugs occupy ponds, streams, lakes, mangroves, and even open oceans. The diversity of this group has inspired a range of scientific studies from ecology and evolution to developmental genetics and hydrodynamics of fluid locomotion. However, the lack of a representative water strider genome hinders our ability to more thoroughly investigate the molecular mechanisms underlying the processes of adaptation and diversification within this group. RESULTS: Here we report the sequencing and manual annotation of the Gerris buenoi (G. buenoi) genome; the first water strider genome to be sequenced thus far. The size of the G. buenoi genome is approximately 1,000 Mb, and this sequencing effort has recovered 20,949 predicted protein-coding genes. Manual annotation uncovered a number of local (tandem and proximal) gene duplications and expansions of gene families known for their importance in a variety of processes associated with morphological and physiological adaptations to a water surface lifestyle. These expansions may affect key processes associated with growth, vision, desiccation resistance, detoxification, olfaction and epigenetic regulation. Strikingly, the G. buenoi genome contains three insulin receptors, suggesting key changes in the rewiring and function of the insulin pathway. Other genomic changes affecting with opsin genes may be associated with wavelength sensitivity shifts in opsins, which is likely to be key in facilitating specific adaptations in vision for diverse water habitats. CONCLUSIONS: Our findings suggest that local gene duplications might have played an important role during the evolution of water striders. Along with these findings, the sequencing of the G. buenoi genome now provides us the opportunity to pursue exciting research opportunities to further understand the genomic underpinnings of traits associated with the extreme body plan and life history of water striders.
Assuntos
Genoma , Heterópteros/genética , Heterópteros/fisiologia , Proteínas de Insetos/genética , Adaptação Fisiológica , Animais , Evolução Molecular , Genômica , Heterópteros/classificação , Fenótipo , FilogeniaRESUMO
Chemosensory-related gene (CRG) families have been studied extensively in insects, but their evolutionary history across the Arthropoda had remained relatively unexplored. Here, we address current hypotheses and prior conclusions on CRG family evolution using a more comprehensive data set. In particular, odorant receptors were hypothesized to have proliferated during terrestrial colonization by insects (hexapods), but their association with other pancrustacean clades and with independent terrestrial colonizations in other arthropod subphyla have been unclear. We also examine hypotheses on which arthropod CRG family is most ancient. Thus, we reconstructed phylogenies of CRGs, including those from new arthropod genomes and transcriptomes, and mapped CRG gains and losses across arthropod lineages. Our analysis was strengthened by including crustaceans, especially copepods, which reside outside the hexapod/branchiopod clade within the subphylum Pancrustacea. We generated the first high-resolution genome sequence of the copepod Eurytemora affinis and annotated its CRGs. We found odorant receptors and odorant binding proteins present only in hexapods (insects) and absent from all other arthropod lineages, indicating that they are not universal adaptations to land. Gustatory receptors likely represent the oldest chemosensory receptors among CRGs, dating back to the Placozoa. We also clarified and confirmed the evolutionary history of antennal ionotropic receptors across the Arthropoda. All antennal ionotropic receptors in E. affinis were expressed more highly in males than in females, suggestive of an association with male mate-recognition behavior. This study is the most comprehensive comparative analysis to date of CRG family evolution across the largest and most speciose metazoan phylum Arthropoda.
Assuntos
Artrópodes/genética , Receptores Odorantes/genética , Animais , Células Quimiorreceptoras/fisiologia , Copépodes/genética , Crustáceos/genética , Bases de Dados de Ácidos Nucleicos , Evolução Molecular , Genoma/genética , Insetos/genética , Família Multigênica/genética , FilogeniaRESUMO
Hyalella azteca is a cryptic species complex of epibenthic amphipods of interest to ecotoxicology and evolutionary biology. It is the primary crustacean used in North America for sediment toxicity testing and an emerging model for molecular ecotoxicology. To provide molecular resources for sediment quality assessments and evolutionary studies, we sequenced, assembled, and annotated the genome of the H. azteca U.S. Lab Strain. The genome quality and completeness is comparable with other ecotoxicological model species. Through targeted investigation and use of gene expression data sets of H. azteca exposed to pesticides, metals, and other emerging contaminants, we annotated and characterized the major gene families involved in sequestration, detoxification, oxidative stress, and toxicant response. Our results revealed gene loss related to light sensing, but a large expansion in chemoreceptors, likely underlying sensory shifts necessary in their low light habitats. Gene family expansions were also noted for cytochrome P450 genes, cuticle proteins, ion transporters, and include recent gene duplications in the metal sequestration protein, metallothionein. Mapping of differentially expressed transcripts to the genome significantly increased the ability to functionally annotate toxicant responsive genes. The H. azteca genome will greatly facilitate development of genomic tools for environmental assessments and promote an understanding of how evolution shapes toxicological pathways with implications for environmental and human health.
Assuntos
Anfípodes , Poluentes Químicos da Água , Animais , Ecotoxicologia , Sedimentos Geológicos , América do Norte , Testes de ToxicidadeRESUMO
Genomic information has become a ubiquitous and almost essential aspect of biological research. Over the last 10-15 years, the cost of generating sequence data from DNA or RNA samples has dramatically declined and our ability to interpret those data increased just as remarkably. Although it is still possible for biologists to conduct interesting and valuable research on species for which genomic data are not available, the impact of having access to a high quality whole genome reference assembly for a given species is nothing short of transformational. Research on a species for which we have no DNA or RNA sequence data is restricted in fundamental ways. In contrast, even access to an initial draft quality genome (see below for definitions) opens a wide range of opportunities that are simply not available without that reference genome assembly. Although a complete discussion of the impact of genome sequencing and assembly is beyond the scope of this short paper, the goal of this review is to summarize the most common and highest impact contributions that whole genome sequencing and assembly has had on comparative and evolutionary biology.
Assuntos
Sequência de Bases/genética , Mapeamento Cromossômico , Biologia Computacional , Genoma/genética , Animais , Mapeamento Cromossômico/economia , Biologia Computacional/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Análise de Sequência de DNA/métodosRESUMO
A major challenge of biology is understanding the relationship between molecular genetic variation and variation in quantitative traits, including fitness. This relationship determines our ability to predict phenotypes from genotypes and to understand how evolutionary forces shape variation within and between species. Previous efforts to dissect the genotype-phenotype map were based on incomplete genotypic information. Here, we describe the Drosophila melanogaster Genetic Reference Panel (DGRP), a community resource for analysis of population genomics and quantitative traits. The DGRP consists of fully sequenced inbred lines derived from a natural population. Population genomic analyses reveal reduced polymorphism in centromeric autosomal regions and the X chromosome, evidence for positive and negative selection, and rapid evolution of the X chromosome. Many variants in novel genes, most at low frequency, are associated with quantitative traits and explain a large fraction of the phenotypic variance. The DGRP facilitates genotype-phenotype mapping using the power of Drosophila genetics.
Assuntos
Drosophila melanogaster/genética , Estudo de Associação Genômica Ampla , Genômica , Locos de Características Quantitativas/genética , Alelos , Animais , Centrômero/genética , Cromossomos de Insetos/genética , Genótipo , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Seleção Genética/genética , Inanição/genética , Telômero/genética , Cromossomo X/genéticaRESUMO
BACKGROUND: The de novo assembly of repeat-rich mammalian genomes using only high-throughput short read sequencing data typically results in highly fragmented genome assemblies that limit downstream applications. Here, we present an iterative approach to hybrid de novo genome assembly that incorporates datasets stemming from multiple genomic technologies and methods. We used this approach to improve the gray mouse lemur (Microcebus murinus) genome from early draft status to a near chromosome-scale assembly. METHODS: We used a combination of advanced genomic technologies to iteratively resolve conflicts and super-scaffold the M. murinus genome. RESULTS: We improved the M. murinus genome assembly to a scaffold N50 of 93.32 Mb. Whole genome alignments between our primary super-scaffolds and 23 human chromosomes revealed patterns that are congruent with historical comparative cytogenetic data, thus demonstrating the accuracy of our de novo scaffolding approach and allowing assignment of scaffolds to M. murinus chromosomes. Moreover, we utilized our independent datasets to discover and characterize sequences associated with centromeres across the mouse lemur genome. Quality assessment of the final assembly found 96% of mouse lemur canonical transcripts nearly complete, comparable to other published high-quality reference genome assemblies. CONCLUSIONS: We describe a new assembly of the gray mouse lemur (Microcebus murinus) genome with chromosome-scale scaffolds produced using a hybrid bioinformatic and sequencing approach. The approach is cost effective and produces superior results based on metrics of contiguity and completeness. Our results show that emerging genomic technologies can be used in combination to characterize centromeres of non-model species and to produce accurate de novo chromosome-scale genome assemblies of complex mammalian genomes.
Assuntos
Centrômero/genética , Cheirogaleidae/genética , Genoma , Animais , Biologia Computacional , Feminino , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNARESUMO
BACKGROUND: The duplication of genes can occur through various mechanisms and is thought to make a major contribution to the evolutionary diversification of organisms. There is increasing evidence for a large-scale duplication of genes in some chelicerate lineages including two rounds of whole genome duplication (WGD) in horseshoe crabs. To investigate this further, we sequenced and analyzed the genome of the common house spider Parasteatoda tepidariorum. RESULTS: We found pervasive duplication of both coding and non-coding genes in this spider, including two clusters of Hox genes. Analysis of synteny conservation across the P. tepidariorum genome suggests that there has been an ancient WGD in spiders. Comparison with the genomes of other chelicerates, including that of the newly sequenced bark scorpion Centruroides sculpturatus, suggests that this event occurred in the common ancestor of spiders and scorpions, and is probably independent of the WGDs in horseshoe crabs. Furthermore, characterization of the sequence and expression of the Hox paralogs in P. tepidariorum suggests that many have been subject to neo-functionalization and/or sub-functionalization since their duplication. CONCLUSIONS: Our results reveal that spiders and scorpions are likely the descendants of a polyploid ancestor that lived more than 450 MYA. Given the extensive morphological diversity and ecological adaptations found among these animals, rivaling those of vertebrates, our study of the ancient WGD event in Arachnopulmonata provides a new comparative platform to explore common and divergent evolutionary outcomes of polyploidization events across eukaryotes.