Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
2.
bioRxiv ; 2023 Jun 30.
Artigo em Inglês | MEDLINE | ID: mdl-37425881

RESUMO

Improvements in genome sequencing and assembly are enabling high-quality reference genomes for all species. However, the assembly process is still laborious, computationally and technically demanding, lacks standards for reproducibility, and is not readily scalable. Here we present the latest Vertebrate Genomes Project assembly pipeline and demonstrate that it delivers high-quality reference genomes at scale across a set of vertebrate species arising over the last ~500 million years. The pipeline is versatile and combines PacBio HiFi long-reads and Hi-C-based haplotype phasing in a new graph-based paradigm. Standardized quality control is performed automatically to troubleshoot assembly issues and assess biological complexities. We make the pipeline freely accessible through Galaxy, accommodating researchers even without local computational resources and enhanced reproducibility by democratizing the training and assembly process. We demonstrate the flexibility and reliability of the pipeline by assembling reference genomes for 51 vertebrate species from major taxonomic groups (fish, amphibians, reptiles, birds, and mammals).

3.
iScience ; 26(3): 106236, 2023 Mar 17.
Artigo em Inglês | MEDLINE | ID: mdl-36915682

RESUMO

Today, breeds with ornamental traits such as exceptionally long tail feathers are economically valuable. However, the genetic basis of long-tail feathers is yet to be understood. To provide better understanding of long tail feathers, we sequenced Korean long-tailed chicken (KLC) genomes and compared them with genomes of other chicken breeds. We first analyzed the genome structure of KLC and its genomic relationship with other chickens and observed unique characteristics. Subsequently, we searched for genomic regions under selection. Feather keratin 1-like enriched region and several genes were found to have novel putative functions and effects on the long tail trait in KLC. Our findings support the value of KLC as a unique genetic resource and cast light on the genetic basis of long tail traits in avian species. We expect this novel knowledge to provide new genomic evidence and options for designing and implementing genetic improvements of ornamental chicken productivity through precision crossbreeding aids.

4.
Genome Biol ; 23(1): 204, 2022 09 27.
Artigo em Inglês | MEDLINE | ID: mdl-36167554

RESUMO

BACKGROUND: Many short-read genome assemblies have been found to be incomplete and contain mis-assemblies. The Vertebrate Genomes Project has been producing new reference genome assemblies with an emphasis on being as complete and error-free as possible, which requires utilizing long reads, long-range scaffolding data, new assembly algorithms, and manual curation. A more thorough evaluation of the recent references relative to prior assemblies can provide a detailed overview of the types and magnitude of improvements. RESULTS: Here we evaluate new vertebrate genome references relative to the previous assemblies for the same species and, in two cases, the same individuals, including a mammal (platypus), two birds (zebra finch, Anna's hummingbird), and a fish (climbing perch). We find that up to 11% of genomic sequence is entirely missing in the previous assemblies. In the Vertebrate Genomes Project zebra finch assembly, we identify eight new GC- and repeat-rich micro-chromosomes with high gene density. The impact of missing sequences is biased towards GC-rich 5'-proximal promoters and 5' exon regions of protein-coding genes and long non-coding RNAs. Between 26 and 60% of genes include structural or sequence errors that could lead to misunderstanding of their function when using the previous genome assemblies. CONCLUSIONS: Our findings reveal novel regulatory landscapes and protein coding sequences that have been greatly underestimated in previous assemblies and are now present in the Vertebrate Genomes Project reference genomes.


Assuntos
Genoma , Vertebrados , Animais , Composição de Bases/genética , Cromossomos , Genoma/genética , Análise de Sequência de DNA , Vertebrados/genética
5.
Genome Biol ; 23(1): 205, 2022 09 27.
Artigo em Inglês | MEDLINE | ID: mdl-36167596

RESUMO

BACKGROUND: False duplications in genome assemblies lead to false biological conclusions. We quantified false duplications in popularly used previous genome assemblies for platypus, zebra finch, and Anna's Hummingbird, and their new counterparts of the same species generated by the Vertebrate Genomes Project, of which the Vertebrate Genomes Project pipeline attempted to eliminate false duplications through haplotype phasing and purging. These assemblies are among the first generated by the Vertebrate Genomes Project where there was a prior chromosomal level reference assembly to compare with. RESULTS: Whole genome alignments revealed that 4 to 16% of the sequences are falsely duplicated in the previous assemblies, impacting hundreds to thousands of genes. These lead to overestimated gene family expansions. The main source of the false duplications is heterotype duplications, where the haplotype sequences were relatively more divergent than other parts of the genome leading the assembly algorithms to classify them as separate genes or genomic regions. A minor source is sequencing errors. Ancient ATP nucleotide binding gene families have a higher prevalence of false duplications compared to other gene families. Although present in a smaller proportion, we observe false duplications remaining in the Vertebrate Genomes Project assemblies that can be identified and purged. CONCLUSIONS: This study highlights the need for more advanced assembly methods that better separate haplotypes and sequence errors, and the need for cautious analyses on gene gains.


Assuntos
Genoma , Genômica , Trifosfato de Adenosina , Animais , Duplicação Gênica , Nucleotídeos , Análise de Sequência de DNA/métodos , Vertebrados/genética
6.
BMC Genomics ; 22(1): 594, 2021 Aug 05.
Artigo em Inglês | MEDLINE | ID: mdl-34348642

RESUMO

BACKGROUND: Annual molt is a critical stage in the life cycle of birds. Although the most extensively documented aspects of molt are the renewing of plumage and the remodeling of the reproductive tract in laying hens, in chicken, molt deeply affects various tissues and physiological functions. However, with exception of the reproductive tract, the effect of molt on gene expression across the tissues known to be affected by molt has to date never been investigated. The present study aimed to decipher the transcriptomic effects of molt in Ginkkoridak, a Korean long-tailed chicken. Messenger RNA data available across 24 types of tissue samples (9 males) and a combination of mRNA and miRNA data on 10 males and 10 females blood were used. RESULTS: The impact of molt on gene expression and gene transcript usage appeared to vary substantially across tissues types in terms of histological entities or physiological functions particularly related to nervous system. Blood was the tissue most affected by molt in terms of differentially expressed genes in both sexes, closely followed by meninges, bone marrow and heart. The effect of molt in blood appeared to differ between males and females, with a more than fivefold difference in the number of down-regulated genes between both sexes. The blueprint of molt in roosters appeared to be specific to tissues or group of tissues, with relatively few genes replicating extensively across tissues, excepted for the spliceosome genes (U1, U4) and the ribosomal proteins (RPL21, RPL23). By integrating miRNA and mRNA data, when chickens molt, potential roles of miRNA were discovered such as regulation of neurogenesis, regulation of immunity and development of various organs. Furthermore, reliable candidate biomarkers of molt were found, which are related to cell dynamics, nervous system or immunity, processes or functions that have been shown to be extensively modulated in response to molt. CONCLUSIONS: Our results provide a comprehensive description at the scale of the whole organism deciphering the effects of molt on the transcriptome in chicken. Also, the conclusion of this study can be used as a valuable resource in transcriptome analyses of chicken in the future and provide new insights related to molt.


Assuntos
Galinhas , Transcriptoma , Animais , Galinhas/genética , Feminino , Perfilação da Expressão Gênica , Masculino , Muda/genética , República da Coreia
7.
Nature ; 592(7856): 737-746, 2021 04.
Artigo em Inglês | MEDLINE | ID: mdl-33911273

RESUMO

High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species1-4. To address this issue, the international Genome 10K (G10K) consortium5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences.


Assuntos
Genoma , Genômica/métodos , Vertebrados/genética , Animais , Aves , Biblioteca Gênica , Tamanho do Genoma , Genoma Mitocondrial , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala , Anotação de Sequência Molecular , Alinhamento de Sequência , Análise de Sequência de DNA , Cromossomos Sexuais/genética
8.
Animals (Basel) ; 9(10)2019 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-31618984

RESUMO

Copy number variation (CNV) has great significance both functionally and evolutionally. Various CNV studies are in progress to find the cause of human disease and to understand the population structure of livestock. Recent advances in next-generation sequencing (NGS) technology have made CNV detection more reliable and accurate at whole-genome level. However, there is a lack of CNV studies on chickens using NGS. Therefore, we obtained whole-genome sequencing data of 65 chickens including Red Jungle Fowl, Cornish (broiler), Rhode Island Red (hybrid), and White Leghorn (layer) from the public databases for CNV region (CNVR) detection. Using CNVnator, a read-depth based software, a total of 663 domesticated-specific CNVRs were identified across autosomes. Gene ontology analysis of genes annotated in CNVRs showed that mainly enriched terms involved in organ development, metabolism, and immune regulation. Population analysis revealed that CN and RIR are closer to each other than WL, and many genes (LOC772271, OR52R1, RD3, ADH6, TLR2B, PRSS2, TPK1, POPDC3, etc.) with different copy numbers between breeds found. In conclusion, this study has helped to understand the genetic characteristics of domestic chickens at CNV level, which may provide useful information for the development of breeding systems in chickens.

9.
Genes Genomics ; 40(5): 521-530, 2018 05.
Artigo em Inglês | MEDLINE | ID: mdl-29892957

RESUMO

To optimize conservation efforts, it is necessary to determine the risk of extinction by collecting reliable population information for a given species. We developed eight novel, polymorphic microsatellite markers and used these markers in conjunction with twelve existing markers to measure genetic diversity of South Korean populations of leopard cat (Prionailurus bengalensis), a species for which population size and habitat area data are unknown in the country, to assess its conservation status. The average number of alleles and the observed heterozygosity of the species were 3.8 and 0.41, respectively, and microsatellite diversity was lower than the average genetic diversity of 57 populations of 12 other felid species, and lower than that of other mammal populations occurring in South Korea, including the raccoon dog (Nyctereutes procyonoides), water deer (Hydropotes inermis), and endangered long-tailed goral (Naemorhedus caudatus). Furthermore, analysis of genetic structure in the national leopard cat population showed no clear genetic differentiation, suggesting that it is not necessary to divide the South Korean leopard cat population into multiple management units for the purposes of conservation. These results indicate that the genetic diversity of the leopard cat in South Korea is unexpectedly low, and that the risk of local extinction is, as a result, substantial. Thus, it is necessary to begin appropriate conservation efforts at a national level to conserve the leopard cat population in South Korea.


Assuntos
Variação Genética/genética , Panthera/genética , Alelos , Animais , Gatos , DNA , Espécies em Perigo de Extinção , Felidae/genética , Frequência do Gene/genética , Repetições de Microssatélites/genética , República da Coreia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA