Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
1.
J Hered ; 115(3): 311-316, 2024 May 09.
Artigo em Inglês | MEDLINE | ID: mdl-38513109

RESUMO

Animals living in caves are of broad relevance to evolutionary biologists interested in understanding the mechanisms underpinning convergent evolution. In the Eastern Andes of Colombia, populations from at least two distinct clades of Trichomycterus catfishes (Siluriformes) independently colonized cave environments and converged in phenotype by losing their eyes and pigmentation. We are pursuing several research questions using genomics to understand the evolutionary forces and molecular mechanisms responsible for repeated morphological changes in this system. As a foundation for such studies, here we describe a diploid, chromosome-scale, long-read reference genome for Trichomycterus rosablanca, a blind, depigmented species endemic to the karstic system of the department of Santander. The nuclear genome comprises 1 Gb in 27 chromosomes, with a 40.0× HiFi long-read genome coverage having an N50 scaffold of 40.4 Mb and N50 contig of 13.1 Mb, with 96.9% (Eukaryota) and 95.4% (Actinopterygii) universal single-copy orthologs (BUSCO). This assembly provides the first reference genome for the speciose genus Trichomycterus, serving as a key resource for research on the genomics of phenotypic evolution.


Assuntos
Evolução Biológica , Peixes-Gato , Cavernas , Genoma , Peixes-Gato/genética , Masculino , Animais , Análise de Sequência de DNA , Olho , Pigmentação , Cromossomos , Fenótipo
2.
J Hered ; 115(2): 212-220, 2024 Mar 13.
Artigo em Inglês | MEDLINE | ID: mdl-38245832

RESUMO

The dugong (Dugong dugon) is a marine mammal widely distributed throughout the Indo-Pacific and the Red Sea, with a Vulnerable conservation status, and little is known about many of the more peripheral populations, some of which are thought to be close to extinction. We present a de novo high-quality genome assembly for the dugong from an individual belonging to the well-monitored Moreton Bay population in Queensland, Australia. Our assembly uses long-read PacBio HiFi sequencing and Omni-C data following the Vertebrate Genome Project pipeline to reach chromosome-level contiguity (24 chromosome-level scaffolds; 3.16 Gbp) and high completeness (97.9% complete BUSCOs). We observed relatively high genome-wide heterozygosity, which likely reflects historical population abundance before the last interglacial period, approximately 125,000 yr ago. Demographic inference suggests that dugong populations began declining as sea levels fell after the last interglacial period, likely a result of population fragmentation and habitat loss due to the exposure of seagrass meadows. We find no evidence for ongoing recent inbreeding in this individual. However, runs of homozygosity indicate some past inbreeding. Our draft genome assembly will enable range-wide assessments of genetic diversity and adaptation, facilitate effective management of dugong populations, and allow comparative genomics analyses including with other sirenians, the oldest marine mammal lineage.


Assuntos
Caniformia , Dugong , Animais , Austrália , Ecossistema , Oceano Índico , Cetáceos , Cromossomos
3.
BMC Bioinformatics ; 24(1): 288, 2023 Jul 18.
Artigo em Inglês | MEDLINE | ID: mdl-37464285

RESUMO

BACKGROUND:  PacBio high fidelity (HiFi) sequencing reads are both long (15-20 kb) and highly accurate (> Q20). Because of these properties, they have revolutionised genome assembly leading to more accurate and contiguous genomes. In eukaryotes the mitochondrial genome is sequenced alongside the nuclear genome often at very high coverage. A dedicated tool for mitochondrial genome assembly using HiFi reads is still missing. RESULTS:  MitoHiFi was developed within the Darwin Tree of Life Project to assemble mitochondrial genomes from the HiFi reads generated for target species. The input for MitoHiFi is either the raw reads or the assembled contigs, and the tool outputs a mitochondrial genome sequence fasta file along with annotation of protein and RNA genes. Variants arising from heteroplasmy are assembled independently, and nuclear insertions of mitochondrial sequences are identified and not used in organellar genome assembly. MitoHiFi has been used to assemble 374 mitochondrial genomes (368 Metazoa and 6 Fungi species) for the Darwin Tree of Life Project, the Vertebrate Genomes Project and the Aquatic Symbiosis Genome Project. Inspection of 60 mitochondrial genomes assembled with MitoHiFi for species that already have reference sequences in public databases showed the widespread presence of previously unreported repeats. CONCLUSIONS:  MitoHiFi is able to assemble mitochondrial genomes from a wide phylogenetic range of taxa from Pacbio HiFi data. MitoHiFi is written in python and is freely available on GitHub ( https://github.com/marcelauliano/MitoHiFi ). MitoHiFi is available with its dependencies as a Docker container on GitHub (ghcr.io/marcelauliano/mitohifi:master).


Assuntos
Genoma Mitocondrial , Filogenia , RNA , Eucariotos , Análise de Sequência de DNA , Sequenciamento de Nucleotídeos em Larga Escala
4.
Bioinformatics ; 38(17): 4214-4216, 2022 09 02.
Artigo em Inglês | MEDLINE | ID: mdl-35799367

RESUMO

MOTIVATION: With the current pace at which reference genomes are being produced, the availability of tools that can reliably and efficiently generate genome assembly summary statistics has become critical. Additionally, with the emergence of new algorithms and data types, tools that can improve the quality of existing assemblies through automated and manual curation are required. RESULTS: We sought to address both these needs by developing gfastats, as part of the Vertebrate Genomes Project (VGP) effort to generate high-quality reference genomes at scale. Gfastats is a standalone tool to compute assembly summary statistics and manipulate assembly sequences in FASTA, FASTQ or GFA [.gz] format. Gfastats stores assembly sequences internally in a GFA-like format. This feature allows gfastats to seamlessly convert FAST* to and from GFA [.gz] files. Gfastats can also build an assembly graph that can in turn be used to manipulate the underlying sequences following instructions provided by the user, while simultaneously generating key metrics for the new sequences. AVAILABILITY AND IMPLEMENTATION: Gfastats is implemented in C++. Precompiled releases (Linux, MacOS, Windows) and commented source code for gfastats are available under MIT licence at https://github.com/vgl-hub/gfastats. Examples of how to run gfastats are provided in the GitHub. Gfastats is also available in Bioconda, in Galaxy (https://assembly.usegalaxy.eu) and as a MultiQC module (https://github.com/ewels/MultiQC). An automated test workflow is available to ensure consistency of software updates. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Genoma , Software , Algoritmos , Fluxo de Trabalho , Licenciamento
5.
Int J Mol Sci ; 24(19)2023 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-37834264

RESUMO

The European mink Mustela lutreola (Mustelidae) ranks among the most endangered mammalian species globally, experiencing a rapid and severe decline in population size, density, and distribution. Given the critical need for effective conservation strategies, understanding its genomic characteristics becomes paramount. To address this challenge, the platinum-quality, chromosome-level reference genome assembly for the European mink was successfully generated under the project of the European Mink Centre consortium. Leveraging PacBio HiFi long reads, we obtained a 2586.3 Mbp genome comprising 25 scaffolds, with an N50 length of 154.1 Mbp. Through Hi-C data, we clustered and ordered the majority of the assembly (>99.9%) into 20 chromosomal pseudomolecules, including heterosomes, ranging from 6.8 to 290.1 Mbp. The newly sequenced genome displays a GC base content of 41.9%. Additionally, we successfully assembled the complete mitochondrial genome, spanning 16.6 kbp in length. The assembly achieved a BUSCO (Benchmarking Universal Single-Copy Orthologs) completeness score of 98.2%. This high-quality reference genome serves as a valuable genomic resource for future population genomics studies concerning the European mink and related taxa. Furthermore, the newly assembled genome holds significant potential in addressing key conservation challenges faced by M. lutreola. Its applications encompass potential revision of management units, assessment of captive breeding impacts, resolution of phylogeographic questions, and facilitation of monitoring and evaluating the efficiency and effectiveness of dedicated conservation strategies for the European mink. This species serves as an example that highlights the paramount importance of prioritizing endangered species in genome sequencing projects due to the race against time, which necessitates the comprehensive exploration and characterization of their genomic resources before their populations face extinction.


Assuntos
Espécies em Perigo de Extinção , Vison , Animais , Vison/genética , Platina , Conservação dos Recursos Naturais , Genômica
6.
Wellcome Open Res ; 9: 361, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39239167

RESUMO

We present a reference genome assembly from an individual male Rhynchonycteris naso (Chordata; Mammalia; Chiroptera; Emballonuridae). The genome sequence is 2.46 Gb in span. The majority of the assembly is scaffolded into 22 chromosomal pseudomolecules, with the Y sex chromosome assembled.

7.
Wellcome Open Res ; 9: 522, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39411461

RESUMO

We present a genome assembly from an individual female Molossus alvarezi (Chordata; Mammalia; Chiroptera; Molossidae). The genome sequence is 2.490 Gb in span. The majority of the assembly is scaffolded into 24 chromosomal pseudomolecules, with the X sex chromosomes assembled.

8.
Sci Data ; 11(1): 176, 2024 Feb 07.
Artigo em Inglês | MEDLINE | ID: mdl-38326333

RESUMO

Suncus etruscus is one of the world's smallest mammals, with an average body mass of about 2 grams. The Etruscan shrew's small body is accompanied by a very high energy demand and numerous metabolic adaptations. Here we report a chromosome-level genome assembly using PacBio long read sequencing, 10X Genomics linked short reads, optical mapping, and Hi-C linked reads. The assembly is partially phased, with the 2.472 Gbp primary pseudohaplotype and 1.515 Gbp alternate. We manually curated the primary assembly and identified 22 chromosomes, including X and Y sex chromosomes. The NCBI genome annotation pipeline identified 39,091 genes, 19,819 of them protein-coding. We also identified segmental duplications, inferred GO term annotations, and computed orthologs of human and mouse genes. This reference-quality genome will be an important resource for research on mammalian development, metabolism, and body size control.


Assuntos
Cromossomos , Musaranhos , Animais , Camundongos , Cromossomos/genética , Genoma , Genômica , Anotação de Sequência Molecular , Musaranhos/genética
9.
G3 (Bethesda) ; 13(7)2023 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-37141262

RESUMO

The Rock Ptarmigan (Lagopus muta) is a cold-adapted, largely sedentary, game bird with a Holarctic distribution. The species represents an important example of an organism likely to be affected by ongoing climatic shifts across a disparate range. We provide here a high-quality reference genome and mitogenome for the Rock Ptarmigan assembled from PacBio HiFi and Hi-C sequencing of a female bird from Iceland. The total size of the genome is 1.03 Gb with a scaffold N50 of 71.23 Mb and a contig N50 of 17.91 Mb. The final scaffolds represent all 40 predicted chromosomes, and the mitochondria with a BUSCO score of 98.6%. Gene annotation resulted in 16,078 protein-coding genes out of a total 19,831 predicted (81.08% excluding pseudogenes). The genome included 21.07% repeat sequences, and the average length of genes, exons, and introns were 33605, 394, and 4265 bp, respectively. The availability of a new reference-quality genome will contribute to understanding the Rock Ptarmigan's unique evolutionary history, vulnerability to climate change, and demographic trajectories around the globe while serving as a benchmark for species in the family Phasianidae (order Galliformes).


Assuntos
Galliformes , Codorniz , Animais , Feminino , Galliformes/genética , Sequências Repetitivas de Ácido Nucleico , Cromossomos/genética , Genoma , Filogenia
10.
Sci Data ; 10(1): 880, 2023 Dec 08.
Artigo em Inglês | MEDLINE | ID: mdl-38066002

RESUMO

Chub mackerels (Scomber japonicus) are a migratory marine fish widely distributed in the Indo-Pacific Ocean. They are globally consumed for their high Omega-3 content, but their population is declining due to global warming. Here, we generated the first chromosome-level genome assembly of chub mackerel (fScoJap1) using the Vertebrate Genomes Project assembly pipeline with PacBio HiFi genomic sequencing and Arima Hi-C chromosome contact data. The final assembly is 828.68 Mb with 24 chromosomes, nearly all containing telomeric repeats at their ends. We annotated 31,656 genes and discovered that approximately 2.19% of the genome contained DNA transposon elements repressed within duplicated genes. Analyzing 5-methylcytosine (5mC) modifications using HiFi reads, we observed open/close chromatin patterns at gene promoters, including the FADS2 gene involved in Omega-3 production. This chromosome-level reference genome provides unprecedented opportunities for advancing our knowledge of chub mackerels in biology, industry, and conservation.


Assuntos
Cyprinidae , Genoma , Perciformes , Animais , Cromossomos , Cyprinidae/genética , Oceano Pacífico , Perciformes/genética
11.
Cell Rep ; 42(1): 111992, 2023 01 31.
Artigo em Inglês | MEDLINE | ID: mdl-36662619

RESUMO

Insights into the evolution of non-model organisms are limited by the lack of reference genomes of high accuracy, completeness, and contiguity. Here, we present a chromosome-level, karyotype-validated reference genome and pangenome for the barn swallow (Hirundo rustica). We complement these resources with a reference-free multialignment of the reference genome with other bird genomes and with the most comprehensive catalog of genetic markers for the barn swallow. We identify potentially conserved and accelerated genes using the multialignment and estimate genome-wide linkage disequilibrium using the catalog. We use the pangenome to infer core and accessory genes and to detect variants using it as a reference. Overall, these resources will foster population genomics studies in the barn swallow, enable detection of candidate genes in comparative genomics studies, and help reduce bias toward a single reference genome.


Assuntos
Andorinhas , Animais , Andorinhas/genética , Metagenômica , Genoma/genética , Genômica , Cromossomos
12.
bioRxiv ; 2023 Jun 30.
Artigo em Inglês | MEDLINE | ID: mdl-37425881

RESUMO

Improvements in genome sequencing and assembly are enabling high-quality reference genomes for all species. However, the assembly process is still laborious, computationally and technically demanding, lacks standards for reproducibility, and is not readily scalable. Here we present the latest Vertebrate Genomes Project assembly pipeline and demonstrate that it delivers high-quality reference genomes at scale across a set of vertebrate species arising over the last ~500 million years. The pipeline is versatile and combines PacBio HiFi long-reads and Hi-C-based haplotype phasing in a new graph-based paradigm. Standardized quality control is performed automatically to troubleshoot assembly issues and assess biological complexities. We make the pipeline freely accessible through Galaxy, accommodating researchers even without local computational resources and enhanced reproducibility by democratizing the training and assembly process. We demonstrate the flexibility and reliability of the pipeline by assembling reference genomes for 51 vertebrate species from major taxonomic groups (fish, amphibians, reptiles, birds, and mammals).

13.
Ecol Evol ; 11(23): 17191-17201, 2021 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-34938502

RESUMO

Understanding the forces that drive genotypic and phenotypic change in wild populations is a central goal of evolutionary biology. We examined exome variation in populations of deer mice from two of the California Channel Islands: Peromyscus maniculatus elusus from Santa Barbara Island and P. m. santacruzae from Santa Cruz Island exhibit significant differences in olfactory predator recognition, activity timing, aggressive behavior, morphology, prevalence of Sin Nombre virus, and population densities. We characterized variation in protein-coding regions using exome capture and sequencing of 25 mice from Santa Barbara Island and 22 mice from Santa Cruz Island. We identified and examined 386,256 SNPs using three complementary methods (BayeScan, pcadapt, and LFMM). We found strong differences in molecular variation between the two populations and 710 outlier SNPs in protein-coding genes that were detected by all three methods. We identified 35 candidate genes from this outlier set that were related to differences in phenotypes between island populations. Enrichment analyses demonstrated that patterns of molecular variation were associated with biological processes related to response to chemical stimuli and regulation of immune processes. Candidate genes associated with olfaction (Gfy, Tlr2, Vmn13r2, numerous olfactory receptor genes), circadian activity (Cry1), anxiety (Brca1), immunity (Cd28, Eif2ak4, Il12a, Syne1), aggression (Cyp19a, Lama2), and body size (Bc16, Syne1) exhibited non-synonymous mutations predicted to have moderate to large effects. Variation in olfaction-related genes, including a stop codon in the Santa Barbara Island population, suggests loss of predator-recognition traits at the molecular level, consistent with a lack of behavioral aversion to fox feces. These findings also suggest that divergent pathogen prevalence and population density may have influenced adaptive immunity and behavioral phenotypes, such as reduced aggression. Overall, our study indicates that ecological differences between islands are associated with signatures of selection in protein-coding genes underlying phenotypes that promote success in those environments.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA