Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 76
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Nature ; 611(7936): 519-531, 2022 Nov.
Article in English | MEDLINE | ID: mdl-36261518

ABSTRACT

The current human reference genome, GRCh38, represents over 20 years of effort to generate a high-quality assembly, which has benefitted society1,2. However, it still has many gaps and errors, and does not represent a biological genome as it is a blend of multiple individuals3,4. Recently, a high-quality telomere-to-telomere reference, CHM13, was generated with the latest long-read technologies, but it was derived from a hydatidiform mole cell line with a nearly homozygous genome5. To address these limitations, the Human Pangenome Reference Consortium formed with the goal of creating high-quality, cost-effective, diploid genome assemblies for a pangenome reference that represents human genetic diversity6. Here, in our first scientific report, we determined which combination of current genome sequencing and assembly approaches yield the most complete and accurate diploid genome assembly with minimal manual curation. Approaches that used highly accurate long reads and parent-child data with graph-based haplotype phasing during assembly outperformed those that did not. Developing a combination of the top-performing methods, we generated our first high-quality diploid reference assembly, containing only approximately four gaps per chromosome on average, with most chromosomes within ±1% of the length of CHM13. Nearly 48% of protein-coding genes have non-synonymous amino acid changes between haplotypes, and centromeric regions showed the highest diversity. Our findings serve as a foundation for assembling near-complete diploid human genomes at scale for a pangenome reference to capture global genetic variation from single nucleotides to structural rearrangements.


Subject(s)
Chromosome Mapping , Diploidy , Genome, Human , Genomics , Humans , Chromosome Mapping/standards , Genome, Human/genetics , Haplotypes/genetics , High-Throughput Nucleotide Sequencing/methods , High-Throughput Nucleotide Sequencing/standards , Sequence Analysis, DNA/methods , Sequence Analysis, DNA/standards , Reference Standards , Genomics/methods , Genomics/standards , Chromosomes, Human/genetics , Genetic Variation/genetics
2.
Nature ; 612(7940): 495-502, 2022 12.
Article in English | MEDLINE | ID: mdl-36450981

ABSTRACT

Fanconi anaemia (FA), a model syndrome of genome instability, is caused by a deficiency in DNA interstrand crosslink repair resulting in chromosome breakage1-3. The FA repair pathway protects against endogenous and exogenous carcinogenic aldehydes4-7. Individuals with FA are hundreds to thousands fold more likely to develop head and neck (HNSCC), oesophageal and anogenital squamous cell carcinomas8 (SCCs). Molecular studies of SCCs from individuals with FA (FA SCCs) are limited, and it is unclear how FA SCCs relate to sporadic HNSCCs primarily driven by tobacco and alcohol exposure or infection with human papillomavirus9 (HPV). Here, by sequencing genomes and exomes of FA SCCs, we demonstrate that the primary genomic signature of FA repair deficiency is the presence of high numbers of structural variants. Structural variants are enriched for small deletions, unbalanced translocations and fold-back inversions, and are often connected, thereby forming complex rearrangements. They arise in the context of TP53 loss, but not in the context of HPV infection, and lead to somatic copy-number alterations of HNSCC driver genes. We further show that FA pathway deficiency may lead to epithelial-to-mesenchymal transition and enhanced keratinocyte-intrinsic inflammatory signalling, which would contribute to the aggressive nature of FA SCCs. We propose that the genomic instability in sporadic HPV-negative HNSCC may arise as a result of the FA repair pathway being overwhelmed by DNA interstrand crosslink damage caused by alcohol and tobacco-derived aldehydes, making FA SCC a powerful model to study tumorigenesis resulting from DNA-crosslinking damage.


Subject(s)
DNA Repair , Fanconi Anemia , Genomics , Head and Neck Neoplasms , Humans , Aldehydes/adverse effects , Aldehydes/metabolism , DNA Repair/genetics , Fanconi Anemia/genetics , Fanconi Anemia/metabolism , Fanconi Anemia/pathology , Head and Neck Neoplasms/chemically induced , Head and Neck Neoplasms/genetics , Head and Neck Neoplasms/metabolism , Head and Neck Neoplasms/pathology , Papillomavirus Infections , Squamous Cell Carcinoma of Head and Neck/chemically induced , Squamous Cell Carcinoma of Head and Neck/genetics , Squamous Cell Carcinoma of Head and Neck/metabolism , Squamous Cell Carcinoma of Head and Neck/pathology , DNA Damage/drug effects
3.
Nature ; 594(7862): 227-233, 2021 06.
Article in English | MEDLINE | ID: mdl-33910227

ABSTRACT

The accurate and complete assembly of both haplotype sequences of a diploid organism is essential to understanding the role of variation in genome functions, phenotypes and diseases1. Here, using a trio-binning approach, we present a high-quality, diploid reference genome, with both haplotypes assembled independently at the chromosome level, for the common marmoset (Callithrix jacchus), an primate model system that is widely used in biomedical research2,3. The full spectrum of heterozygosity between the two haplotypes involves 1.36% of the genome-much higher than the 0.13% indicated by the standard estimation based on single-nucleotide heterozygosity alone. The de novo mutation rate is 0.43 × 10-8 per site per generation, and the paternal inherited genome acquired twice as many mutations as the maternal. Our diploid assembly enabled us to discover a recent expansion of the sex-differentiation region and unique evolutionary changes in the marmoset Y chromosome. In addition, we identified many genes with signatures of positive selection that might have contributed to the evolution of Callithrix biological features. Brain-related genes were highly conserved between marmosets and humans, although several genes experienced lineage-specific copy number variations or diversifying selection, with implications for the use of marmosets as a model system.


Subject(s)
Callithrix/genetics , Diploidy , Evolution, Molecular , Genome/genetics , Genomics/standards , Animals , Biomedical Research , DNA Copy Number Variations , Female , Germ-Line Mutation/genetics , Haplotypes/genetics , Heterozygote , Humans , INDEL Mutation/genetics , Male , Reference Standards , Selection, Genetic , Sex Differentiation/genetics , Y Chromosome/genetics
4.
Nature ; 592(7856): 756-762, 2021 04.
Article in English | MEDLINE | ID: mdl-33408411

ABSTRACT

Egg-laying mammals (monotremes) are the only extant mammalian outgroup to therians (marsupial and eutherian animals) and provide key insights into mammalian evolution1,2. Here we generate and analyse reference genomes of the platypus (Ornithorhynchus anatinus) and echidna (Tachyglossus aculeatus), which represent the only two extant monotreme lineages. The nearly complete platypus genome assembly has anchored almost the entire genome onto chromosomes, markedly improving the genome continuity and gene annotation. Together with our echidna sequence, the genomes of the two species allow us to detect the ancestral and lineage-specific genomic changes that shape both monotreme and mammalian evolution. We provide evidence that the monotreme sex chromosome complex originated from an ancestral chromosome ring configuration. The formation of such a unique chromosome complex may have been facilitated by the unusually extensive interactions between the multi-X and multi-Y chromosomes that are shared by the autosomal homologues in humans. Further comparative genomic analyses unravel marked differences between monotremes and therians in haptoglobin genes, lactation genes and chemosensory receptor genes for smell and taste that underlie the ecological adaptation of monotremes.


Subject(s)
Biological Evolution , Genome , Platypus/genetics , Tachyglossidae/genetics , Animals , Female , Male , Mammals/genetics , Phylogeny , Sex Chromosomes/genetics
5.
Nature ; 592(7856): 737-746, 2021 04.
Article in English | MEDLINE | ID: mdl-33911273

ABSTRACT

High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species1-4. To address this issue, the international Genome 10K (G10K) consortium5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences.


Subject(s)
Genome , Genomics/methods , Vertebrates/genetics , Animals , Birds , Gene Library , Genome Size , Genome, Mitochondrial , Haplotypes , High-Throughput Nucleotide Sequencing , Molecular Sequence Annotation , Sequence Alignment , Sequence Analysis, DNA , Sex Chromosomes/genetics
6.
Proc Natl Acad Sci U S A ; 121(15): e2319506121, 2024 Apr 09.
Article in English | MEDLINE | ID: mdl-38557186

ABSTRACT

Genomes are typically mosaics of regions with different evolutionary histories. When speciation events are closely spaced in time, recombination makes the regions sharing the same history small, and the evolutionary history changes rapidly as we move along the genome. When examining rapid radiations such as the early diversification of Neoaves 66 Mya, typically no consistent history is observed across segments exceeding kilobases of the genome. Here, we report an exception. We found that a 21-Mb region in avian genomes, mapped to chicken chromosome 4, shows an extremely strong and discordance-free signal for a history different from that of the inferred species tree. Such a strong discordance-free signal, indicative of suppressed recombination across many millions of base pairs, is not observed elsewhere in the genome for any deep avian relationships. Although long regions with suppressed recombination have been documented in recently diverged species, our results pertain to relationships dating circa 65 Mya. We provide evidence that this strong signal may be due to an ancient rearrangement that blocked recombination and remained polymorphic for several million years prior to fixation. We show that the presence of this region has misled previous phylogenomic efforts with lower taxon sampling, showing the interplay between taxon and locus sampling. We predict that similar ancient rearrangements may confound phylogenetic analyses in other clades, pointing to a need for new analytical models that incorporate the possibility of such events.


Subject(s)
Biological Evolution , Genome , Animals , Phylogeny , Genome/genetics , Birds , Recombination, Genetic
7.
Nature ; 583(7817): 578-584, 2020 07.
Article in English | MEDLINE | ID: mdl-32699395

ABSTRACT

Bats possess extraordinary adaptations, including flight, echolocation, extreme longevity and unique immunity. High-quality genomes are crucial for understanding the molecular basis and evolution of these traits. Here we incorporated long-read sequencing and state-of-the-art scaffolding protocols1 to generate, to our knowledge, the first reference-quality genomes of six bat species (Rhinolophus ferrumequinum, Rousettus aegyptiacus, Phyllostomus discolor, Myotis myotis, Pipistrellus kuhlii and Molossus molossus). We integrated gene projections from our 'Tool to infer Orthologs from Genome Alignments' (TOGA) software with de novo and homology gene predictions as well as short- and long-read transcriptomics to generate highly complete gene annotations. To resolve the phylogenetic position of bats within Laurasiatheria, we applied several phylogenetic methods to comprehensive sets of orthologous protein-coding and noncoding regions of the genome, and identified a basal origin for bats within Scrotifera. Our genome-wide screens revealed positive selection on hearing-related genes in the ancestral branch of bats, which is indicative of laryngeal echolocation being an ancestral trait in this clade. We found selection and loss of immunity-related genes (including pro-inflammatory NF-κB regulators) and expansions of anti-viral APOBEC3 genes, which highlights molecular mechanisms that may contribute to the exceptional immunity of bats. Genomic integrations of diverse viruses provide a genomic record of historical tolerance to viral infection in bats. Finally, we found and experimentally validated bat-specific variation in microRNAs, which may regulate bat-specific gene-expression programs. Our reference-quality bat genomes provide the resources required to uncover and validate the genomic basis of adaptations of bats, and stimulate new avenues of research that are directly relevant to human health and disease1.


Subject(s)
Adaptation, Physiological/genetics , Chiroptera/genetics , Evolution, Molecular , Genome/genetics , Genomics/standards , Adaptation, Physiological/immunology , Animals , Chiroptera/classification , Chiroptera/immunology , DNA Transposable Elements/genetics , Immunity/genetics , Molecular Sequence Annotation/standards , Phylogeny , RNA, Untranslated/genetics , Reference Standards , Reproducibility of Results , Virus Integration/genetics , Viruses/genetics
8.
Proc Natl Acad Sci U S A ; 120(7): e2201076120, 2023 02 14.
Article in English | MEDLINE | ID: mdl-36749728

ABSTRACT

Sea turtles represent an ancient lineage of marine vertebrates that evolved from terrestrial ancestors over 100 Mya. The genomic basis of the unique physiological and ecological traits enabling these species to thrive in diverse marine habitats remains largely unknown. Additionally, many populations have drastically declined due to anthropogenic activities over the past two centuries, and their recovery is a high global conservation priority. We generated and analyzed high-quality reference genomes for the leatherback (Dermochelys coriacea) and green (Chelonia mydas) turtles, representing the two extant sea turtle families. These genomes are highly syntenic and homologous, but localized regions of noncollinearity were associated with higher copy numbers of immune, zinc-finger, and olfactory receptor (OR) genes in green turtles, with ORs related to waterborne odorants greatly expanded in green turtles. Our findings suggest that divergent evolution of these key gene families may underlie immunological and sensory adaptations assisting navigation, occupancy of neritic versus pelagic environments, and diet specialization. Reduced collinearity was especially prevalent in microchromosomes, with greater gene content, heterozygosity, and genetic distances between species, supporting their critical role in vertebrate evolutionary adaptation. Finally, diversity and demographic histories starkly contrasted between species, indicating that leatherback turtles have had a low yet stable effective population size, exhibit extremely low diversity compared with other reptiles, and harbor a higher genetic load compared with green turtles, reinforcing concern over their persistence under future climate scenarios. These genomes provide invaluable resources for advancing our understanding of evolution and conservation best practices in an imperiled vertebrate lineage.


Subject(s)
Turtles , Animals , Ecosystem , Population Dynamics
9.
Mol Biol Evol ; 41(3)2024 Mar 01.
Article in English | MEDLINE | ID: mdl-38376487

ABSTRACT

The blue whale, Balaenoptera musculus, is the largest animal known to have ever existed, making it an important case study in longevity and resistance to cancer. To further this and other blue whale-related research, we report a reference-quality, long-read-based genome assembly of this fascinating species. We assembled the genome from PacBio long reads and utilized Illumina/10×, optical maps, and Hi-C data for scaffolding, polishing, and manual curation. We also provided long read RNA-seq data to facilitate the annotation of the assembly by NCBI and Ensembl. Additionally, we annotated both haplotypes using TOGA and measured the genome size by flow cytometry. We then compared the blue whale genome with other cetaceans and artiodactyls, including vaquita (Phocoena sinus), the world's smallest cetacean, to investigate blue whale's unique biological traits. We found a dramatic amplification of several genes in the blue whale genome resulting from a recent burst in segmental duplications, though the possible connection between this amplification and giant body size requires further study. We also discovered sites in the insulin-like growth factor-1 gene correlated with body size in cetaceans. Finally, using our assembly to examine the heterozygosity and historical demography of Pacific and Atlantic blue whale populations, we found that the genomes of both populations are highly heterozygous and that their genetic isolation dates to the last interglacial period. Taken together, these results indicate how a high-quality, annotated blue whale genome will serve as an important resource for biology, evolution, and conservation research.


Subject(s)
Balaenoptera , Neoplasms , Animals , Balaenoptera/genetics , Segmental Duplications, Genomic , Genome , Demography , Neoplasms/genetics
10.
J Hered ; 115(3): 311-316, 2024 May 09.
Article in English | MEDLINE | ID: mdl-38513109

ABSTRACT

Animals living in caves are of broad relevance to evolutionary biologists interested in understanding the mechanisms underpinning convergent evolution. In the Eastern Andes of Colombia, populations from at least two distinct clades of Trichomycterus catfishes (Siluriformes) independently colonized cave environments and converged in phenotype by losing their eyes and pigmentation. We are pursuing several research questions using genomics to understand the evolutionary forces and molecular mechanisms responsible for repeated morphological changes in this system. As a foundation for such studies, here we describe a diploid, chromosome-scale, long-read reference genome for Trichomycterus rosablanca, a blind, depigmented species endemic to the karstic system of the department of Santander. The nuclear genome comprises 1 Gb in 27 chromosomes, with a 40.0× HiFi long-read genome coverage having an N50 scaffold of 40.4 Mb and N50 contig of 13.1 Mb, with 96.9% (Eukaryota) and 95.4% (Actinopterygii) universal single-copy orthologs (BUSCO). This assembly provides the first reference genome for the speciose genus Trichomycterus, serving as a key resource for research on the genomics of phenotypic evolution.


Subject(s)
Biological Evolution , Catfishes , Caves , Genome , Catfishes/genetics , Male , Animals , Sequence Analysis, DNA , Eye , Pigmentation , Chromosomes , Phenotype
11.
J Hered ; 115(2): 212-220, 2024 Mar 13.
Article in English | MEDLINE | ID: mdl-38245832

ABSTRACT

The dugong (Dugong dugon) is a marine mammal widely distributed throughout the Indo-Pacific and the Red Sea, with a Vulnerable conservation status, and little is known about many of the more peripheral populations, some of which are thought to be close to extinction. We present a de novo high-quality genome assembly for the dugong from an individual belonging to the well-monitored Moreton Bay population in Queensland, Australia. Our assembly uses long-read PacBio HiFi sequencing and Omni-C data following the Vertebrate Genome Project pipeline to reach chromosome-level contiguity (24 chromosome-level scaffolds; 3.16 Gbp) and high completeness (97.9% complete BUSCOs). We observed relatively high genome-wide heterozygosity, which likely reflects historical population abundance before the last interglacial period, approximately 125,000 yr ago. Demographic inference suggests that dugong populations began declining as sea levels fell after the last interglacial period, likely a result of population fragmentation and habitat loss due to the exposure of seagrass meadows. We find no evidence for ongoing recent inbreeding in this individual. However, runs of homozygosity indicate some past inbreeding. Our draft genome assembly will enable range-wide assessments of genetic diversity and adaptation, facilitate effective management of dugong populations, and allow comparative genomics analyses including with other sirenians, the oldest marine mammal lineage.


Subject(s)
Caniformia , Dugong , Animals , Australia , Ecosystem , Indian Ocean , Cetacea , Chromosomes
12.
BMC Biol ; 21(1): 267, 2023 11 22.
Article in English | MEDLINE | ID: mdl-37993882

ABSTRACT

BACKGROUND: The red junglefowl, the wild outgroup of domestic chickens, has historically served as a reference for genomic studies of domestic chickens. These studies have provided insight into the etiology of traits of commercial importance. However, the use of a single reference genome does not capture diversity present among modern breeds, many of which have accumulated molecular changes due to drift and selection. While reference-based resequencing is well-suited to cataloging simple variants such as single-nucleotide changes and short insertions and deletions, it is mostly inadequate to discover more complex structural variation in the genome. METHODS: We present a pangenome for the domestic chicken consisting of thirty assemblies of chickens from different breeds and research lines. RESULTS: We demonstrate how this pangenome can be used to catalog structural variants present in modern breeds and untangle complex nested variation. We show that alignment of short reads from 100 diverse wild and domestic chickens to this pangenome reduces reference bias by 38%, which affects downstream genotyping results. This approach also allows for the accurate genotyping of a large and complex pair of structural variants at the K feathering locus using short reads, which would not be possible using a linear reference. CONCLUSIONS: We expect that this new paradigm of genomic reference will allow better pinpointing of exact mutations responsible for specific phenotypes, which will in turn be necessary for breeding chickens that meet new sustainability criteria and are resilient to quickly evolving pathogen threats.


Subject(s)
Chickens , Genome , Animals , Chickens/genetics , Genotype , Sequence Analysis, DNA , Genomics
13.
Bioinformatics ; 38(17): 4214-4216, 2022 09 02.
Article in English | MEDLINE | ID: mdl-35799367

ABSTRACT

MOTIVATION: With the current pace at which reference genomes are being produced, the availability of tools that can reliably and efficiently generate genome assembly summary statistics has become critical. Additionally, with the emergence of new algorithms and data types, tools that can improve the quality of existing assemblies through automated and manual curation are required. RESULTS: We sought to address both these needs by developing gfastats, as part of the Vertebrate Genomes Project (VGP) effort to generate high-quality reference genomes at scale. Gfastats is a standalone tool to compute assembly summary statistics and manipulate assembly sequences in FASTA, FASTQ or GFA [.gz] format. Gfastats stores assembly sequences internally in a GFA-like format. This feature allows gfastats to seamlessly convert FAST* to and from GFA [.gz] files. Gfastats can also build an assembly graph that can in turn be used to manipulate the underlying sequences following instructions provided by the user, while simultaneously generating key metrics for the new sequences. AVAILABILITY AND IMPLEMENTATION: Gfastats is implemented in C++. Precompiled releases (Linux, MacOS, Windows) and commented source code for gfastats are available under MIT licence at https://github.com/vgl-hub/gfastats. Examples of how to run gfastats are provided in the GitHub. Gfastats is also available in Bioconda, in Galaxy (https://assembly.usegalaxy.eu) and as a MultiQC module (https://github.com/ewels/MultiQC). An automated test workflow is available to ensure consistency of software updates. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Genome , Software , Algorithms , Workflow , Licensure
14.
J Hered ; 114(3): 279-285, 2023 05 25.
Article in English | MEDLINE | ID: mdl-36866448

ABSTRACT

The Aeolian wall lizard, Podarcis raffonei, is an endangered species endemic to the Aeolian archipelago, Italy, where it is present only in 3 tiny islets and a narrow promontory of a larger island. Because of the extremely limited area of occupancy, severe population fragmentation and observed decline, it has been classified as Critically Endangered by the International Union for the Conservation of Nature (IUCN). Using Pacific Biosciences (PacBio) High Fidelity (HiFi) long-read sequencing, Bionano optical mapping and Arima chromatin conformation capture sequencing (Hi-C), we produced a high-quality, chromosome-scale reference genome for the Aeolian wall lizard, including Z and W sexual chromosomes. The final assembly spans 1.51 Gb across 28 scaffolds with a contig N50 of 61.4 Mb, a scaffold N50 of 93.6 Mb, and a BUSCO completeness score of 97.3%. This genome constitutes a valuable resource for the species to guide potential conservation efforts and more generally for the squamate reptiles that are underrepresented in terms of available high-quality genomic resources.


Subject(s)
Genome , Lizards , Animals , Chromosomes/genetics , Genomics , Molecular Sequence Annotation , Lizards/genetics , Sex Chromosomes
15.
BMC Biol ; 20(1): 245, 2022 11 08.
Article in English | MEDLINE | ID: mdl-36344967

ABSTRACT

BACKGROUND: The Nile rat (Avicanthis niloticus) is an important animal model because of its robust diurnal rhythm, a cone-rich retina, and a propensity to develop diet-induced diabetes without chemical or genetic modifications. A closer similarity to humans in these aspects, compared to the widely used Mus musculus and Rattus norvegicus models, holds the promise of better translation of research findings to the clinic. RESULTS: We report a 2.5 Gb, chromosome-level reference genome assembly with fully resolved parental haplotypes, generated with the Vertebrate Genomes Project (VGP). The assembly is highly contiguous, with contig N50 of 11.1 Mb, scaffold N50 of 83 Mb, and 95.2% of the sequence assigned to chromosomes. We used a novel workflow to identify 3613 segmental duplications and quantify duplicated genes. Comparative analyses revealed unique genomic features of the Nile rat, including some that affect genes associated with type 2 diabetes and metabolic dysfunctions. We discuss 14 genes that are heterozygous in the Nile rat or highly diverged from the house mouse. CONCLUSIONS: Our findings reflect the exceptional level of genomic resolution present in this assembly, which will greatly expand the potential of the Nile rat as a model organism.


Subject(s)
Diabetes Mellitus, Type 2 , Humans , Animals , Haplotypes , Diabetes Mellitus, Type 2/genetics , Murinae , Genome , Genomics
16.
Int J Mol Sci ; 24(19)2023 Oct 01.
Article in English | MEDLINE | ID: mdl-37834264

ABSTRACT

The European mink Mustela lutreola (Mustelidae) ranks among the most endangered mammalian species globally, experiencing a rapid and severe decline in population size, density, and distribution. Given the critical need for effective conservation strategies, understanding its genomic characteristics becomes paramount. To address this challenge, the platinum-quality, chromosome-level reference genome assembly for the European mink was successfully generated under the project of the European Mink Centre consortium. Leveraging PacBio HiFi long reads, we obtained a 2586.3 Mbp genome comprising 25 scaffolds, with an N50 length of 154.1 Mbp. Through Hi-C data, we clustered and ordered the majority of the assembly (>99.9%) into 20 chromosomal pseudomolecules, including heterosomes, ranging from 6.8 to 290.1 Mbp. The newly sequenced genome displays a GC base content of 41.9%. Additionally, we successfully assembled the complete mitochondrial genome, spanning 16.6 kbp in length. The assembly achieved a BUSCO (Benchmarking Universal Single-Copy Orthologs) completeness score of 98.2%. This high-quality reference genome serves as a valuable genomic resource for future population genomics studies concerning the European mink and related taxa. Furthermore, the newly assembled genome holds significant potential in addressing key conservation challenges faced by M. lutreola. Its applications encompass potential revision of management units, assessment of captive breeding impacts, resolution of phylogeographic questions, and facilitation of monitoring and evaluating the efficiency and effectiveness of dedicated conservation strategies for the European mink. This species serves as an example that highlights the paramount importance of prioritizing endangered species in genome sequencing projects due to the race against time, which necessitates the comprehensive exploration and characterization of their genomic resources before their populations face extinction.


Subject(s)
Endangered Species , Mink , Animals , Mink/genetics , Platinum , Conservation of Natural Resources , Genomics
17.
Am J Primatol ; 83(6): e23255, 2021 06.
Article in English | MEDLINE | ID: mdl-33792947

ABSTRACT

The novel coronavirus SARS-CoV-2, which in humans leads to the disease COVID-19, has caused global disruption and more than 2 million fatalities since it first emerged in late 2019. As we write, infection rates are at their highest point globally and are rising extremely rapidly in some areas due to more infectious variants. The primary target of SARS-CoV-2 is the cellular receptor angiotensin-converting enzyme-2 (ACE2). Recent sequence analyses of the ACE2 gene predict that many nonhuman primates are also likely to be highly susceptible to infection. However, the anticipated risk is not equal across the Order. Furthermore, some taxonomic groups show high ACE2 amino acid conservation, while others exhibit high variability at this locus. As an example of the latter, analyses of strepsirrhine primate ACE2 sequences to date indicate large variation among lemurs and lorises compared to other primate clades despite low sampling effort. Here, we report ACE2 gene and protein sequences for 71 individual strepsirrhines, spanning 51 species and 19 genera. Our study reinforces previous results while finding additional variability in other strepsirrhine species, and suggests several clades of lemurs have high potential susceptibility to SARS-CoV-2 infection. Troublingly, some species, including the rare and endangered aye-aye (Daubentonia madagascariensis), as well as those in the genera Avahi and Propithecus, may be at high risk. Given that lemurs are endemic to Madagascar and among the primates at highest risk of extinction globally, further understanding of the potential threat of COVID-19 to their health should be a conservation priority. All feasible actions should be taken to limit their exposure to SARS-CoV-2.


Subject(s)
COVID-19/veterinary , Lemur , Lorisidae , Primate Diseases/epidemiology , Angiotensin-Converting Enzyme 2/chemistry , Angiotensin-Converting Enzyme 2/genetics , Animals , COVID-19/epidemiology , Lemur/genetics , Lorisidae/genetics , Primate Diseases/virology , Risk Factors
18.
FASEB J ; 33(12): 13825-13836, 2019 12.
Article in English | MEDLINE | ID: mdl-31604057

ABSTRACT

The zebra finch has been used as a valuable vocal learning animal model for human spoken language. It is representative of vocal learning songbirds specifically, which comprise half of all bird species, and of Neoaves broadly, which comprise 95% of all bird species. Although transgenesis in the zebra finch has been accomplished, it is with a very low efficiency of germ-line transmission and far from the efficiency with a more genetically tractable but vocal nonlearning species, the chicken (a Galloanseriformes). To improve germ-line transmission in the zebra finch, we identified and characterized its primordial germ cells (PGCs) and compared them with chicken. We found striking differences between the 2 species, including that zebra finch PGCs were more numerous, more widely distributed in early embryos before colonization into the gonads, had slower timing of colonization, and had a different developmental gene-expression program. We improved conditions for isolating and culturing zebra finch PGCs in vitro and were able to transfect them with gene-expression vectors and incorporate them into the gonads of host embryos. Our findings demonstrate important differences in the PGCs of the zebra finch and advance the first stage of creating PGC-mediated germ-line transgenics of a vocal learning species.-Jung, K. M., Kim, Y. M., Keyte, A. L., Biegler, M. T., Rengaraj, D., Lee, H. J., Mello, C. V., Velho, T. A. F., Fedrigo, O., Haase, B., Jarvis, E. D., Han, J. Y. Identification and characterization of primordial germ cells in a vocal learning Neoaves species, the zebra finch.


Subject(s)
Finches/physiology , Germ Cells/physiology , Learning/physiology , Animals , Disease Models, Animal , Embryo, Nonmammalian/physiology , Female , Gene Expression/physiology , Male
SELECTION OF CITATIONS
SEARCH DETAIL