RESUMO
Heteromorphic sex chromosomes are usually thought to have originated from a pair of autosomes that acquired a sex-determining locus and subsequently stopped recombining, leading to degeneration of the sex-limited chromosome. The majority of nematode species lack heteromorphic sex chromosomes and determine sex using an X-chromosome counting mechanism, with males being hemizygous for one or more X chromosomes (XX/X0). Some filarial nematode species, including important parasites of humans, have heteromorphic XX/XY karyotypes. It has been assumed that sex is determined by a Y-linked locus in these species. However, karyotypic analyses suggested that filarial Y chromosomes are derived from the unfused homologue of an autosome involved in an X-autosome fusion event. Here, we generated a chromosome-level reference genome for Litomosoides sigmodontis, a filarial nematode with the ancestral filarial karyotype and sex determination mechanism (XX/X0). By mapping the assembled chromosomes to the rhabditid nematode ancestral linkage (or Nigon) elements, we infer that the ancestral filarial X chromosome was the product of a fusion between NigonX (the ancestrally X-linked element) and NigonD (ancestrally autosomal). In the two filarial lineages with XY systems, there have been two independent X-autosome chromosome fusion events involving different autosomal Nigon elements. In both lineages, the region shared by the neo-X and neo-Y chromosomes is within the ancestrally autosomal portion of the X, confirming that the filarial Y chromosomes are derived from the unfused homologue of the autosome. Sex determination in XY filarial nematodes therefore likely continues to operate via the ancestral X-chromosome counting mechanism, rather than via a Y-linked sex-determining locus.
Assuntos
Filarioidea , Nematoides , Animais , Masculino , Humanos , Cromossomo Y/genética , Cromossomos Sexuais , Cromossomo X/genética , Cromossomos Humanos X , Filarioidea/genéticaRESUMO
The blue whale, Balaenoptera musculus, is the largest animal known to have ever existed, making it an important case study in longevity and resistance to cancer. To further this and other blue whale-related research, we report a reference-quality, long-read-based genome assembly of this fascinating species. We assembled the genome from PacBio long reads and utilized Illumina/10×, optical maps, and Hi-C data for scaffolding, polishing, and manual curation. We also provided long read RNA-seq data to facilitate the annotation of the assembly by NCBI and Ensembl. Additionally, we annotated both haplotypes using TOGA and measured the genome size by flow cytometry. We then compared the blue whale genome with other cetaceans and artiodactyls, including vaquita (Phocoena sinus), the world's smallest cetacean, to investigate blue whale's unique biological traits. We found a dramatic amplification of several genes in the blue whale genome resulting from a recent burst in segmental duplications, though the possible connection between this amplification and giant body size requires further study. We also discovered sites in the insulin-like growth factor-1 gene correlated with body size in cetaceans. Finally, using our assembly to examine the heterozygosity and historical demography of Pacific and Atlantic blue whale populations, we found that the genomes of both populations are highly heterozygous and that their genetic isolation dates to the last interglacial period. Taken together, these results indicate how a high-quality, annotated blue whale genome will serve as an important resource for biology, evolution, and conservation research.
Assuntos
Balaenoptera , Neoplasias , Animais , Balaenoptera/genética , Duplicações Segmentares Genômicas , Genoma , Demografia , Neoplasias/genéticaRESUMO
Advances in long-read sequencing technologies and genome assembly methods have enabled the recent completion of the first telomere-to-telomere human genome assembly, which resolves complex segmental duplications and large tandem repeats, including centromeric satellite arrays in a complete hydatidiform mole (CHM13). Although derived from highly accurate sequences, evaluation revealed evidence of small errors and structural misassemblies in the initial draft assembly. To correct these errors, we designed a new repeat-aware polishing strategy that made accurate assembly corrections in large repeats without overcorrection, ultimately fixing 51% of the existing errors and improving the assembly quality value from 70.2 to 73.9 measured from PacBio high-fidelity and Illumina k-mers. By comparing our results to standard automated polishing tools, we outline common polishing errors and offer practical suggestions for genome projects with limited resources. We also show how sequencing biases in both high-fidelity and Oxford Nanopore Technologies reads cause signature assembly errors that can be corrected with a diverse panel of sequencing technologies.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Nanoporos , Feminino , Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Gravidez , Análise de Sequência de DNA/métodos , Telômero/genéticaRESUMO
We have generated an improved assembly and gene annotation of the pig X Chromosome, and a first draft assembly of the pig Y Chromosome, by sequencing BAC and fosmid clones from Duroc animals and incorporating information from optical mapping and fiber-FISH. The X Chromosome carries 1033 annotated genes, 690 of which are protein coding. Gene order closely matches that found in primates (including humans) and carnivores (including cats and dogs), which is inferred to be ancestral. Nevertheless, several protein-coding genes present on the human X Chromosome were absent from the pig, and 38 pig-specific X-chromosomal genes were annotated, 22 of which were olfactory receptors. The pig Y-specific Chromosome sequence generated here comprises 30 megabases (Mb). A 15-Mb subset of this sequence was assembled, revealing two clusters of male-specific low copy number genes, separated by an ampliconic region including the HSFY gene family, which together make up most of the short arm. Both clusters contain palindromes with high sequence identity, presumably maintained by gene conversion. Many of the ancestral X-related genes previously reported in at least one mammalian Y Chromosome are represented either as active genes or partial sequences. This sequencing project has allowed us to identify genes--both single copy and amplified--on the pig Y Chromosome, to compare the pig X and Y Chromosomes for homologous sequences, and thereby to reveal mechanisms underlying pig X and Y Chromosome evolution.
Assuntos
Cromossomos de Mamíferos/genética , Evolução Molecular , Suínos/genética , Cromossomo X/genética , Cromossomo Y/genética , Animais , Sequência de Bases , Gatos/genética , Cães/genética , Feminino , Conversão Gênica , Expressão Gênica , Biblioteca Gênica , Ordem dos Genes , Humanos , Masculino , Dados de Sequência Molecular , Alinhamento de Sequência , Análise de Sequência de DNARESUMO
Theory predicts that compensatory genetic changes reduce negative indirect effects of selected variants during adaptive evolution, but evidence is scarce. Here, we test this in a wild population of Hawaiian crickets using temporal genomics and a high-quality chromosome-level cricket genome. In this population, a mutation, flatwing, silences males and rapidly spread due to an acoustically-orienting parasitoid. Our sampling spanned a social transition during which flatwing fixed and the population went silent. We find long-range linkage disequilibrium around the putative flatwing locus was maintained over time, and hitchhiking genes had functions related to negative flatwing-associated effects. We develop a combinatorial enrichment approach using transcriptome data to test for compensatory, intragenomic coevolution. Temporal changes in genomic selection were distributed genome-wide and functionally associated with the population's transition to silence, particularly behavioural responses to silent environments. Our results demonstrate how 'adaptation begets adaptation'; changes to the sociogenetic environment accompanying rapid trait evolution can generate selection provoking further, compensatory adaptation.
Assuntos
Genômica , Gryllidae , Animais , Gryllidae/genética , Gryllidae/fisiologia , Masculino , Genômica/métodos , Havaí , Adaptação Fisiológica/genética , Desequilíbrio de Ligação , Genoma de Inseto , Evolução Biológica , Feminino , Mutação , Seleção Genética , Evolução Molecular , Transcriptoma/genéticaRESUMO
The intertidal gastropod Littorina saxatilis is a model system to study speciation and local adaptation. The repeated occurrence of distinct ecotypes showing different levels of genetic divergence makes L. saxatilis particularly suited to study different stages of the speciation continuum in the same lineage. A major finding is the presence of several large chromosomal inversions associated with the divergence of ecotypes and, specifically, the species offers a system to study the role of inversions in this divergence. The genome of L. saxatilis is 1.35â Gb and composed of 17 chromosomes. The first reference genome of the species was assembled using Illumina data, was highly fragmented (N50 of 44â kb), and was quite incomplete, with a BUSCO completeness of 80.1% on the Metazoan dataset. A linkage map of one full-sibling family enabled the placement of 587â Mbp of the genome into 17 linkage groups corresponding to the haploid number of chromosomes, but the fragmented nature of this reference genome limited the understanding of the interplay between divergent selection and gene flow during ecotype formation. Here, we present a newly generated reference genome that is highly contiguous, with a N50 of 67â Mb and 90.4% of the total assembly length placed in 17 super-scaffolds. It is also highly complete with a BUSCO completeness of 94.1% of the Metazoa dataset. This new reference will allow for investigations into the genomic regions implicated in ecotype formation as well as better characterization of the inversions and their role in speciation.
Assuntos
Cromossomos , Genoma , Animais , Cromossomos/genética , Gastrópodes/genética , Inversão Cromossômica , EcótipoRESUMO
Suncus etruscus is one of the world's smallest mammals, with an average body mass of about 2 grams. The Etruscan shrew's small body is accompanied by a very high energy demand and numerous metabolic adaptations. Here we report a chromosome-level genome assembly using PacBio long read sequencing, 10X Genomics linked short reads, optical mapping, and Hi-C linked reads. The assembly is partially phased, with the 2.472 Gbp primary pseudohaplotype and 1.515 Gbp alternate. We manually curated the primary assembly and identified 22 chromosomes, including X and Y sex chromosomes. The NCBI genome annotation pipeline identified 39,091 genes, 19,819 of them protein-coding. We also identified segmental duplications, inferred GO term annotations, and computed orthologs of human and mouse genes. This reference-quality genome will be an important resource for research on mammalian development, metabolism, and body size control.
Assuntos
Cromossomos , Musaranhos , Animais , Camundongos , Cromossomos/genética , Genoma , Genômica , Anotação de Sequência Molecular , Musaranhos/genéticaRESUMO
We present genome assembly from individual female An. coustani (African malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae) from Lopé, Gabon. The genome sequence is 270 megabases in span. Most of the assembly is scaffolded into three chromosomal pseudomolecules with the X sex chromosome assembled for both species. The complete mitochondrial genome was also assembled and is 15.4 kilobases in length.
RESUMO
Sex-limited polymorphism has evolved in many species including our own. Yet, we lack a detailed understanding of the underlying genetic variation and evolutionary processes at work. The brood parasitic common cuckoo (Cuculus canorus) is a prime example of female-limited color polymorphism, where adult males are monochromatic gray and females exhibit either gray or rufous plumage. This polymorphism has been hypothesized to be governed by negative frequency-dependent selection whereby the rarer female morph is protected against harassment by males or from mobbing by parasitized host species. Here, we show that female plumage dichromatism maps to the female-restricted genome. We further demonstrate that, consistent with balancing selection, ancestry of the rufous phenotype is shared with the likewise female dichromatic sister species, the oriental cuckoo (Cuculus optatus). This study shows that sex-specific polymorphism in trait variation can be resolved by genetic variation residing on a sex-limited chromosome and be maintained across species boundaries.
Assuntos
Polimorfismo Genético , Animais , Feminino , Masculino , Aves/genética , Fenótipo , Evolução Biológica , Pigmentação/genética , Caracteres Sexuais , Evolução MolecularRESUMO
A genomic database of all Earth's eukaryotic species could contribute to many scientific discoveries; however, only a tiny fraction of species have genomic information available. In 2018, scientists across the world united under the Earth BioGenome Project (EBP), aiming to produce a database of high-quality reference genomes containing all ~1.5 million recognized eukaryotic species. As the European node of the EBP, the European Reference Genome Atlas (ERGA) sought to implement a new decentralised, equitable and inclusive model for producing reference genomes. For this, ERGA launched a Pilot Project establishing the first distributed reference genome production infrastructure and testing it on 98 eukaryotic species from 33 European countries. Here we outline the infrastructure and explore its effectiveness for scaling high-quality reference genome production, whilst considering equity and inclusion. The outcomes and lessons learned provide a solid foundation for ERGA while offering key learnings to other transnational, national genomic resource projects and the EBP.
RESUMO
We present a genome assembly from an individual male Anopheles moucheti (the malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae), from a wild population in Cameroon. The genome sequence is 271 megabases in span. The majority of the assembly is scaffolded into three chromosomal pseudomolecules with the X sex chromosome assembled. The complete mitochondrial genome was also assembled and is 15.5 kilobases in length.
RESUMO
Numerous novel adaptations characterise the radiation of notothenioids, the dominant fish group in the freezing seas of the Southern Ocean. To improve understanding of the evolution of this iconic fish group, here we generate and analyse new genome assemblies for 24 species covering all major subgroups of the radiation, including five long-read assemblies. We present a new estimate for the onset of the radiation at 10.7 million years ago, based on a time-calibrated phylogeny derived from genome-wide sequence data. We identify a two-fold variation in genome size, driven by expansion of multiple transposable element families, and use the long-read data to reconstruct two evolutionarily important, highly repetitive gene family loci. First, we present the most complete reconstruction to date of the antifreeze glycoprotein gene family, whose emergence enabled survival in sub-zero temperatures, showing the expansion of the antifreeze gene locus from the ancestral to the derived state. Second, we trace the loss of haemoglobin genes in icefishes, the only vertebrates lacking functional haemoglobins, through complete reconstruction of the two haemoglobin gene clusters across notothenioid families. Both the haemoglobin and antifreeze genomic loci are characterised by multiple transposon expansions that may have driven the evolutionary history of these genes.
Assuntos
Peixes , Perciformes , Animais , Peixes/genética , Genômica , Vertebrados , Filogenia , Hemoglobinas/genética , Regiões AntárticasRESUMO
Hermetia illucens L. (Diptera: Stratiomyidae), the Black Soldier Fly (BSF) is an increasingly important species for bioconversion of organic material into animal feed. We generated a high-quality chromosome-scale genome assembly of the BSF using Pacific Bioscience, 10X Genomics linked read and high-throughput chromosome conformation capture sequencing technology. Scaffolding the final assembly with Hi-C data produced a highly contiguous 1.01 Gb genome with 99.75% of scaffolds assembled into pseudochromosomes representing seven chromosomes with 16.01 Mb contig and 180.46 Mb scaffold N50 values. The highly complete genome obtained a Benchmarking Universal Single-Copy Orthologs (BUSCO) completeness of 98.6%. We masked 67.32% of the genome as repetitive sequences and annotated a total of 16,478 protein-coding genes using the BRAKER2 pipeline. We analyzed an established lab population to investigate the genomic variation and architecture of the BSF revealing six autosomes and an X chromosome. Additionally, we estimated the inbreeding coefficient (1.9%) of the lab population by assessing runs of homozygosity. This provided evidence for inbreeding events including long runs of homozygosity on chromosome 5. The release of this novel chromosome-scale BSF genome assembly will provide an improved resource for further genomic studies, functional characterization of genes of interest and genetic modification of this economically important species.
Assuntos
Cromossomos , Dípteros , Animais , Cromossomos/genética , Dípteros/genética , Genoma , Genômica , Sequências Repetitivas de Ácido NucleicoRESUMO
BACKGROUND: The tufted duck is a non-model organism that experiences high mortality in highly pathogenic avian influenza outbreaks. It belongs to the same bird family (Anatidae) as the mallard, one of the best-studied natural hosts of low-pathogenic avian influenza viruses. Studies in non-model bird species are crucial to disentangle the role of the host response in avian influenza virus infection in the natural reservoir. Such endeavour requires a high-quality genome assembly and transcriptome. FINDINGS: This study presents the first high-quality, chromosome-level reference genome assembly of the tufted duck using the Vertebrate Genomes Project pipeline. We sequenced RNA (complementary DNA) from brain, ileum, lung, ovary, spleen, and testis using Illumina short-read and Pacific Biosciences long-read sequencing platforms, which were used for annotation. We found 34 autosomes plus Z and W sex chromosomes in the curated genome assembly, with 99.6% of the sequence assigned to chromosomes. Functional annotation revealed 14,099 protein-coding genes that generate 111,934 transcripts, which implies a mean of 7.9 isoforms per gene. We also identified 246 small RNA families. CONCLUSIONS: This annotated genome contributes to continuing research into the host response in avian influenza virus infections in a natural reservoir. Our findings from a comparison between short-read and long-read reference transcriptomics contribute to a deeper understanding of these competing options. In this study, both technologies complemented each other. We expect this annotation to be a foundation for further comparative and evolutionary genomic studies, including many waterfowl relatives with differing susceptibilities to avian influenza viruses.
Assuntos
Patos , Influenza Aviária , Animais , Patos/genética , Feminino , Genoma , Genômica , Humanos , Influenza Aviária/epidemiologia , Influenza Aviária/genética , Masculino , TranscriptomaRESUMO
Optical mapping is a technology that gathers long-range information on genome sequences similar to ordered restriction digest maps. Because it is not subject to cloning, amplification, hybridisation or sequencing bias, it is ideally suited to the improvement of fragmented genome assemblies that can no longer be improved by classical methods. In addition, its low cost and rapid turnaround make it equally useful during the scaffolding process of de novo assembly from high throughput sequencing reads. We describe how optical mapping has been used in practice to produce high quality vertebrate genome assemblies. In particular, we detail the efforts undertaken by the Genome Reference Consortium (GRC), which maintains the reference genomes for human, mouse, zebrafish and chicken, and uses different optical mapping platforms for genome curation.