Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
1.
Nature ; 629(8010): 136-145, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38570684

RESUMO

Human centromeres have been traditionally very difficult to sequence and assemble owing to their repetitive nature and large size1. As a result, patterns of human centromeric variation and models for their evolution and function remain incomplete, despite centromeres being among the most rapidly mutating regions2,3. Here, using long-read sequencing, we completely sequenced and assembled all centromeres from a second human genome and compared it to the finished reference genome4,5. We find that the two sets of centromeres show at least a 4.1-fold increase in single-nucleotide variation when compared with their unique flanks and vary up to 3-fold in size. Moreover, we find that 45.8% of centromeric sequence cannot be reliably aligned using standard methods owing to the emergence of new α-satellite higher-order repeats (HORs). DNA methylation and CENP-A chromatin immunoprecipitation experiments show that 26% of the centromeres differ in their kinetochore position by >500 kb. To understand evolutionary change, we selected six chromosomes and sequenced and assembled 31 orthologous centromeres from the common chimpanzee, orangutan and macaque genomes. Comparative analyses reveal a nearly complete turnover of α-satellite HORs, with characteristic idiosyncratic changes in α-satellite HORs for each species. Phylogenetic reconstruction of human haplotypes supports limited to no recombination between the short (p) and long (q) arms across centromeres and reveals that novel α-satellite HORs share a monophyletic origin, providing a strategy to estimate the rate of saltatory amplification and mutation of human centromeric DNA.


Assuntos
Centrômero , Evolução Molecular , Variação Genética , Animais , Humanos , Centrômero/genética , Centrômero/metabolismo , Proteína Centromérica A/metabolismo , Metilação de DNA/genética , DNA Satélite/genética , Cinetocoros/metabolismo , Macaca/genética , Pan troglodytes/genética , Polimorfismo de Nucleotídeo Único/genética , Pongo/genética , Masculino , Feminino , Padrões de Referência , Imunoprecipitação da Cromatina , Haplótipos , Mutação , Amplificação de Genes , Alinhamento de Sequência , Cromatina/genética , Cromatina/metabolismo , Especificidade da Espécie
2.
Genome Res ; 34(3): 454-468, 2024 04 25.
Artigo em Inglês | MEDLINE | ID: mdl-38627094

RESUMO

Reference-free genome phasing is vital for understanding allele inheritance and the impact of single-molecule DNA variation on phenotypes. To achieve thorough phasing across homozygous or repetitive regions of the genome, long-read sequencing technologies are often used to perform phased de novo assembly. As a step toward reducing the cost and complexity of this type of analysis, we describe new methods for accurately phasing Oxford Nanopore Technologies (ONT) sequence data with the Shasta genome assembler and a modular tool for extending phasing to the chromosome scale called GFAse. We test using new variants of ONT PromethION sequencing, including those using proximity ligation, and show that newer, higher accuracy ONT reads substantially improve assembly quality.


Assuntos
Nanoporos , Humanos , Análise de Sequência de DNA/métodos , Sequenciamento por Nanoporos/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Software , Genômica/métodos
3.
Genome Res ; 34(3): 498-513, 2024 04 25.
Artigo em Inglês | MEDLINE | ID: mdl-38508693

RESUMO

Hydractinia is a colonial marine hydroid that shows remarkable biological properties, including the capacity to regenerate its entire body throughout its lifetime, a process made possible by its adult migratory stem cells, known as i-cells. Here, we provide an in-depth characterization of the genomic structure and gene content of two Hydractinia species, Hydractinia symbiolongicarpus and Hydractinia echinata, placing them in a comparative evolutionary framework with other cnidarian genomes. We also generated and annotated a single-cell transcriptomic atlas for adult male H. symbiolongicarpus and identified cell-type markers for all major cell types, including key i-cell markers. Orthology analyses based on the markers revealed that Hydractinia's i-cells are highly enriched in genes that are widely shared amongst animals, a striking finding given that Hydractinia has a higher proportion of phylum-specific genes than any of the other 41 animals in our orthology analysis. These results indicate that Hydractinia's stem cells and early progenitor cells may use a toolkit shared with all animals, making it a promising model organism for future exploration of stem cell biology and regenerative medicine. The genomic and transcriptomic resources for Hydractinia presented here will enable further studies of their regenerative capacity, colonial morphology, and ability to distinguish self from nonself.


Assuntos
Genoma , Hidrozoários , Animais , Hidrozoários/genética , Evolução Molecular , Transcriptoma , Células-Tronco/metabolismo , Masculino , Filogenia , Análise de Célula Única/métodos
4.
Nat Methods ; 21(6): 967-970, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38730258

RESUMO

Despite advances in long-read sequencing technologies, constructing a near telomere-to-telomere assembly is still computationally demanding. Here we present hifiasm (UL), an efficient de novo assembly algorithm combining multiple sequencing technologies to scale up population-wide near telomere-to-telomere assemblies. Applied to 22 human and two plant genomes, our algorithm produces better diploid assemblies at a cost of an order of magnitude lower than existing methods, and it also works with polyploid genomes.


Assuntos
Algoritmos , Diploide , Poliploidia , Telômero , Humanos , Telômero/genética , Genoma de Planta , Genoma Humano , Análise de Sequência de DNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos
5.
bioRxiv ; 2024 Jun 20.
Artigo em Inglês | MEDLINE | ID: mdl-38529499

RESUMO

Haplotype information is crucial for biomedical and population genetics research. However, current strategies to produce de-novo haplotype-resolved assemblies often require either difficult-to-acquire parental data or an intermediate haplotype-collapsed assembly. Here, we present Graphasing, a workflow which synthesizes the global phase signal of Strand-seq with assembly graph topology to produce chromosome-scale de-novo haplotypes for diploid genomes. Graphasing readily integrates with any assembly workflow that both outputs an assembly graph and has a haplotype assembly mode. Graphasing performs comparably to trio-phasing in contiguity, phasing accuracy, and assembly quality, outperforms Hi-C in phasing accuracy, and generates human assemblies with over 18 chromosome-spanning haplotypes.

6.
ArXiv ; 2024 Mar 03.
Artigo em Inglês | MEDLINE | ID: mdl-38903742

RESUMO

Metagenomic studies have primarily relied on de novo assembly for reconstructing genes and genomes from microbial mixtures. While reference-guided approaches have been employed in the assembly of single organisms, they have not been used in a metagenomic context. Here we describe the first effective approach for reference-guided metagenomic assembly that can complement and improve upon de novo metagenomic assembly methods for certain organisms. Such approaches will be increasingly useful as more genomes are sequenced and made publicly available.

7.
Microbiol Resour Announc ; 13(7): e0006224, 2024 Jul 18.
Artigo em Inglês | MEDLINE | ID: mdl-38899875

RESUMO

The draft genome of Mucor velutinosus NIH1002, a 2011 isolate from a case of disseminated disease, was sequenced using PacBio long-read and HiSeq short-read technologies. The genome has 43 contigs, an N50 of 2.65 Mb, and 13,295 protein-coding genes. It is the most complete M. velutinosus genome to date.

8.
Res Sq ; 2024 Apr 03.
Artigo em Inglês | MEDLINE | ID: mdl-38712074

RESUMO

Reference genomes of cattle and sheep have lacked contiguous assemblies of the sex-determining Y chromosome. We assembled complete and gapless telomere to telomere (T2T) Y chromosomes for these species. The pseudo-autosomal regions were similar in length, but the total chromosome size was substantially different, with the cattle Y more than twice the length of the sheep Y. The length disparity was accounted for by expanded ampliconic region in cattle. The genic amplification in cattle contrasts with pseudogenization in sheep suggesting opposite evolutionary mechanisms since their divergence 18MYA. The centromeres also differed dramatically despite the close relationship between these species at the overall genome sequence level. These Y chromosome have been added to the current reference assemblies in GenBank opening new opportunities for the study of evolution and variation while supporting efforts to improve sustainability in these important livestock species that generally use sire-driven genetic improvement strategies.

9.
Sci Data ; 11(1): 540, 2024 May 25.
Artigo em Inglês | MEDLINE | ID: mdl-38796485

RESUMO

Amongst fishes, zebrafish (Danio rerio) has gained popularity as a model system over most other species and while their value as a model is well documented, their usefulness is limited in certain fields of research such as behavior. By embracing other, less conventional experimental organisms, opportunities arise to gain broader insights into evolution and development, as well as studying behavioral aspects not available in current popular model systems. The anabantoid paradise fish (Macropodus opercularis), an "air-breather" species has a highly complex behavioral repertoire and has been the subject of many ethological investigations but lacks genomic resources. Here we report the reference genome assembly of M. opercularis using long-read sequences at 150-fold coverage. The final assembly consisted of 483,077,705 base pairs (~483 Mb) on 152 contigs. Within the assembled genome we identified and annotated 20,157 protein coding genes and assigned ~90% of them to orthogroups.


Assuntos
Peixes , Genoma , Animais , Peixes/genética
10.
bioRxiv ; 2024 Mar 19.
Artigo em Inglês | MEDLINE | ID: mdl-38529488

RESUMO

The combination of ultra-long Oxford Nanopore (ONT) sequencing reads with long, accurate PacBio HiFi reads has enabled the completion of a human genome and spurred similar efforts to complete the genomes of many other species. However, this approach for complete, "telomere-to-telomere" genome assembly relies on multiple sequencing platforms, limiting its accessibility. ONT "Duplex" sequencing reads, where both strands of the DNA are read to improve quality, promise high per-base accuracy. To evaluate this new data type, we generated ONT Duplex data for three widely-studied genomes: human HG002, Solanum lycopersicum Heinz 1706 (tomato), and Zea mays B73 (maize). For the diploid, heterozygous HG002 genome, we also used "Pore-C" chromatin contact mapping to completely phase the haplotypes. We found the accuracy of Duplex data to be similar to HiFi sequencing, but with read lengths tens of kilobases longer, and the Pore-C data to be compatible with existing diploid assembly algorithms. This combination of read length and accuracy enables the construction of a high-quality initial assembly, which can then be further resolved using the ultra-long reads, and finally phased into chromosome-scale haplotypes with Pore-C. The resulting assemblies have a base accuracy exceeding 99.999% (Q50) and near-perfect continuity, with most chromosomes assembled as single contigs. We conclude that ONT sequencing is a viable alternative to HiFi sequencing for de novo genome assembly, and has the potential to provide a single-instrument solution for the reconstruction of complete genomes.

11.
Nat Genet ; 56(8): 1566-1573, 2024 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-39103649

RESUMO

Telomere-to-telomere (T2T) assemblies reveal new insights into the structure and function of the previously 'invisible' parts of the genome and allow comparative analyses of complete genomes across entire clades. We present here an open collaborative effort, termed the 'Ruminant T2T Consortium' (RT2T), that aims to generate complete diploid assemblies for numerous species of the Artiodactyla suborder Ruminantia to examine chromosomal evolution in the context of natural selection and domestication of species used as livestock.


Assuntos
Ruminantes , Telômero , Telômero/genética , Animais , Ruminantes/genética , Evolução Molecular , Genoma/genética , Seleção Genética , Filogenia , Diploide
12.
Genes (Basel) ; 14(12)2023 12 14.
Artigo em Inglês | MEDLINE | ID: mdl-38137031

RESUMO

BACKGROUND: Insects are a sustainable source of protein for human food and animal feed. We present a genome assembly, CRISPR gene editing, and life stage-specific transcriptomes for the yellow mealworm, Tenebrio molitor, one of the most intensively farmed insects worldwide. METHODS: Long and short reads and long-range data were obtained from a T. molitor male pupa. Sequencing transcripts from 12 T. molitor life stages resulted in 279 million reads for gene prediction and genetic engineering. A unique plasmid delivery system containing guide RNAs targeting the eye color gene vermilion flanking the muscle actin gene promoter and EGFP marker was used in CRISPR/Cas9 transformation. RESULTS: The assembly is approximately 53% of the genome size of 756.8 ± 9.6 Mb, measured using flow cytometry. Assembly was complicated by a satellitome of at least 11 highly conserved satDNAs occupying 28% of the genome. The injection of the plasmid into embryos resulted in knock-out of Tm vermilion and knock-in of EGFP. CONCLUSIONS: The genome of T. molitor is longer than current assemblies (including ours) due to a substantial amount (26.5%) of only one highly abundant satellite DNA sequence. Genetic sequences and transformation tools for an insect important to the food and feed industries will promote the sustainable utilization of mealworms and other farmed insects.


Assuntos
Tenebrio , Animais , Masculino , Humanos , Tenebrio/genética , Tenebrio/metabolismo , RNA Guia de Sistemas CRISPR-Cas , Cor de Olho , Ração Animal/análise , Larva/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA