Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Proc Natl Acad Sci U S A ; 121(15): e2319506121, 2024 Apr 09.
Artículo en Inglés | MEDLINE | ID: mdl-38557186

RESUMEN

Genomes are typically mosaics of regions with different evolutionary histories. When speciation events are closely spaced in time, recombination makes the regions sharing the same history small, and the evolutionary history changes rapidly as we move along the genome. When examining rapid radiations such as the early diversification of Neoaves 66 Mya, typically no consistent history is observed across segments exceeding kilobases of the genome. Here, we report an exception. We found that a 21-Mb region in avian genomes, mapped to chicken chromosome 4, shows an extremely strong and discordance-free signal for a history different from that of the inferred species tree. Such a strong discordance-free signal, indicative of suppressed recombination across many millions of base pairs, is not observed elsewhere in the genome for any deep avian relationships. Although long regions with suppressed recombination have been documented in recently diverged species, our results pertain to relationships dating circa 65 Mya. We provide evidence that this strong signal may be due to an ancient rearrangement that blocked recombination and remained polymorphic for several million years prior to fixation. We show that the presence of this region has misled previous phylogenomic efforts with lower taxon sampling, showing the interplay between taxon and locus sampling. We predict that similar ancient rearrangements may confound phylogenetic analyses in other clades, pointing to a need for new analytical models that incorporate the possibility of such events.


Asunto(s)
Evolución Biológica , Genoma , Animales , Filogenia , Genoma/genética , Aves , Recombinación Genética
3.
bioRxiv ; 2023 Jun 30.
Artículo en Inglés | MEDLINE | ID: mdl-37425881

RESUMEN

Improvements in genome sequencing and assembly are enabling high-quality reference genomes for all species. However, the assembly process is still laborious, computationally and technically demanding, lacks standards for reproducibility, and is not readily scalable. Here we present the latest Vertebrate Genomes Project assembly pipeline and demonstrate that it delivers high-quality reference genomes at scale across a set of vertebrate species arising over the last ~500 million years. The pipeline is versatile and combines PacBio HiFi long-reads and Hi-C-based haplotype phasing in a new graph-based paradigm. Standardized quality control is performed automatically to troubleshoot assembly issues and assess biological complexities. We make the pipeline freely accessible through Galaxy, accommodating researchers even without local computational resources and enhanced reproducibility by democratizing the training and assembly process. We demonstrate the flexibility and reliability of the pipeline by assembling reference genomes for 51 vertebrate species from major taxonomic groups (fish, amphibians, reptiles, birds, and mammals).

4.
G3 (Bethesda) ; 13(7)2023 07 05.
Artículo en Inglés | MEDLINE | ID: mdl-37141262

RESUMEN

The Rock Ptarmigan (Lagopus muta) is a cold-adapted, largely sedentary, game bird with a Holarctic distribution. The species represents an important example of an organism likely to be affected by ongoing climatic shifts across a disparate range. We provide here a high-quality reference genome and mitogenome for the Rock Ptarmigan assembled from PacBio HiFi and Hi-C sequencing of a female bird from Iceland. The total size of the genome is 1.03 Gb with a scaffold N50 of 71.23 Mb and a contig N50 of 17.91 Mb. The final scaffolds represent all 40 predicted chromosomes, and the mitochondria with a BUSCO score of 98.6%. Gene annotation resulted in 16,078 protein-coding genes out of a total 19,831 predicted (81.08% excluding pseudogenes). The genome included 21.07% repeat sequences, and the average length of genes, exons, and introns were 33605, 394, and 4265 bp, respectively. The availability of a new reference-quality genome will contribute to understanding the Rock Ptarmigan's unique evolutionary history, vulnerability to climate change, and demographic trajectories around the globe while serving as a benchmark for species in the family Phasianidae (order Galliformes).


Asunto(s)
Galliformes , Codorniz , Animales , Femenino , Galliformes/genética , Secuencias Repetitivas de Ácidos Nucleicos , Cromosomas/genética , Genoma , Filogenia
5.
Bioinformatics ; 38(17): 4214-4216, 2022 09 02.
Artículo en Inglés | MEDLINE | ID: mdl-35799367

RESUMEN

MOTIVATION: With the current pace at which reference genomes are being produced, the availability of tools that can reliably and efficiently generate genome assembly summary statistics has become critical. Additionally, with the emergence of new algorithms and data types, tools that can improve the quality of existing assemblies through automated and manual curation are required. RESULTS: We sought to address both these needs by developing gfastats, as part of the Vertebrate Genomes Project (VGP) effort to generate high-quality reference genomes at scale. Gfastats is a standalone tool to compute assembly summary statistics and manipulate assembly sequences in FASTA, FASTQ or GFA [.gz] format. Gfastats stores assembly sequences internally in a GFA-like format. This feature allows gfastats to seamlessly convert FAST* to and from GFA [.gz] files. Gfastats can also build an assembly graph that can in turn be used to manipulate the underlying sequences following instructions provided by the user, while simultaneously generating key metrics for the new sequences. AVAILABILITY AND IMPLEMENTATION: Gfastats is implemented in C++. Precompiled releases (Linux, MacOS, Windows) and commented source code for gfastats are available under MIT licence at https://github.com/vgl-hub/gfastats. Examples of how to run gfastats are provided in the GitHub. Gfastats is also available in Bioconda, in Galaxy (https://assembly.usegalaxy.eu) and as a MultiQC module (https://github.com/ewels/MultiQC). An automated test workflow is available to ensure consistency of software updates. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Genoma , Programas Informáticos , Algoritmos , Flujo de Trabajo , Concesión de Licencias
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...