Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 62
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Mol Cell ; 81(14): 2975-2988.e6, 2021 07 15.
Artículo en Inglés | MEDLINE | ID: mdl-34157308

RESUMEN

The heterogeneous nature of eukaryotic replication kinetics and the low efficiency of individual initiation sites make mapping the location and timing of replication initiation in human cells difficult. To address this challenge, we have developed optical replication mapping (ORM), a high-throughput single-molecule approach, and used it to map early-initiation events in human cells. The single-molecule nature of our data and a total of >2,500-fold coverage of the human genome on 27 million fibers averaging ∼300 kb in length allow us to identify initiation sites and their firing probability with high confidence. We find that the distribution of human replication initiation is consistent with inefficient, stochastic activation of heterogeneously distributed potential initiation complexes enriched in accessible chromatin. These observations are consistent with stochastic models of initiation-timing regulation and suggest that stochastic regulation of replication kinetics is a fundamental feature of eukaryotic replication, conserved from yeast to humans.


Asunto(s)
Replicación del ADN/genética , Células Eucariotas/fisiología , Genoma Humano/genética , Línea Celular Tumoral , Cromatina/genética , Momento de Replicación del ADN/genética , Genoma Fúngico/genética , Estudio de Asociación del Genoma Completo/métodos , Células HeLa , Humanos , Origen de Réplica/genética , Saccharomyces cerevisiae/genética , Sitio de Iniciación de la Transcripción/fisiología
2.
Nature ; 611(7936): 519-531, 2022 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-36261518

RESUMEN

The current human reference genome, GRCh38, represents over 20 years of effort to generate a high-quality assembly, which has benefitted society1,2. However, it still has many gaps and errors, and does not represent a biological genome as it is a blend of multiple individuals3,4. Recently, a high-quality telomere-to-telomere reference, CHM13, was generated with the latest long-read technologies, but it was derived from a hydatidiform mole cell line with a nearly homozygous genome5. To address these limitations, the Human Pangenome Reference Consortium formed with the goal of creating high-quality, cost-effective, diploid genome assemblies for a pangenome reference that represents human genetic diversity6. Here, in our first scientific report, we determined which combination of current genome sequencing and assembly approaches yield the most complete and accurate diploid genome assembly with minimal manual curation. Approaches that used highly accurate long reads and parent-child data with graph-based haplotype phasing during assembly outperformed those that did not. Developing a combination of the top-performing methods, we generated our first high-quality diploid reference assembly, containing only approximately four gaps per chromosome on average, with most chromosomes within ±1% of the length of CHM13. Nearly 48% of protein-coding genes have non-synonymous amino acid changes between haplotypes, and centromeric regions showed the highest diversity. Our findings serve as a foundation for assembling near-complete diploid human genomes at scale for a pangenome reference to capture global genetic variation from single nucleotides to structural rearrangements.


Asunto(s)
Mapeo Cromosómico , Diploidia , Genoma Humano , Genómica , Humanos , Mapeo Cromosómico/normas , Genoma Humano/genética , Haplotipos/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/normas , Análisis de Secuencia de ADN/métodos , Análisis de Secuencia de ADN/normas , Estándares de Referencia , Genómica/métodos , Genómica/normas , Cromosomas Humanos/genética , Variación Genética/genética
3.
Nature ; 594(7861): 77-81, 2021 06.
Artículo en Inglés | MEDLINE | ID: mdl-33953399

RESUMEN

The divergence of chimpanzee and bonobo provides one of the few examples of recent hominid speciation1,2. Here we describe a fully annotated, high-quality bonobo genome assembly, which was constructed without guidance from reference genomes by applying a multiplatform genomics approach. We generate a bonobo genome assembly in which more than 98% of genes are completely annotated and 99% of the gaps are closed, including the resolution of about half of the segmental duplications and almost all of the full-length mobile elements. We compare the bonobo genome to those of other great apes1,3-5 and identify more than 5,569 fixed structural variants that specifically distinguish the bonobo and chimpanzee lineages. We focus on genes that have been lost, changed in structure or expanded in the last few million years of bonobo evolution. We produce a high-resolution map of incomplete lineage sorting and estimate that around 5.1% of the human genome is genetically closer to chimpanzee or bonobo and that more than 36.5% of the genome shows incomplete lineage sorting if we consider a deeper phylogeny including gorilla and orangutan. We also show that 26% of the segments of incomplete lineage sorting between human and chimpanzee or human and bonobo are non-randomly distributed and that genes within these clustered segments show significant excess of amino acid replacement compared to the rest of the genome.


Asunto(s)
Evolución Molecular , Genoma/genética , Genómica , Pan paniscus/genética , Filogenia , Animales , Factor 4A Eucariótico de Iniciación/genética , Femenino , Genes , Gorilla gorilla/genética , Anotación de Secuencia Molecular/normas , Pan troglodytes/genética , Pongo/genética , Duplicaciones Segmentarias en el Genoma , Análisis de Secuencia de ADN
4.
Am J Hum Genet ; 109(10): 1789-1813, 2022 10 06.
Artículo en Inglés | MEDLINE | ID: mdl-36152629

RESUMEN

Chromosome 16p11.2 reciprocal genomic disorder, resulting from recurrent copy-number variants (CNVs), involves intellectual disability, autism spectrum disorder (ASD), and schizophrenia, but the responsible mechanisms are not known. To systemically dissect molecular effects, we performed transcriptome profiling of 350 libraries from six tissues (cortex, cerebellum, striatum, liver, brown fat, and white fat) in mouse models harboring CNVs of the syntenic 7qF3 region, as well as cellular, transcriptional, and single-cell analyses in 54 isogenic neural stem cell, induced neuron, and cerebral organoid models of CRISPR-engineered 16p11.2 CNVs. Transcriptome-wide differentially expressed genes were largely tissue-, cell-type-, and dosage-specific, although more effects were shared between deletion and duplication and across tissue than expected by chance. The broadest effects were observed in the cerebellum (2,163 differentially expressed genes), and the greatest enrichments were associated with synaptic pathways in mouse cerebellum and human induced neurons. Pathway and co-expression analyses identified energy and RNA metabolism as shared processes and enrichment for ASD-associated, loss-of-function constraint, and fragile X messenger ribonucleoprotein target gene sets. Intriguingly, reciprocal 16p11.2 dosage changes resulted in consistent decrements in neurite and electrophysiological features, and single-cell profiling of organoids showed reciprocal alterations to the proportions of excitatory and inhibitory GABAergic neurons. Changes both in neuronal ratios and in gene expression in our organoid analyses point most directly to calretinin GABAergic inhibitory neurons and the excitatory/inhibitory balance as targets of disruption that might contribute to changes in neurodevelopmental and cognitive function in 16p11.2 carriers. Collectively, our data indicate the genomic disorder involves disruption of multiple contributing biological processes and that this disruption has relative impacts that are context specific.


Asunto(s)
Trastorno del Espectro Autista , Trastornos de los Cromosomas , Discapacidad Intelectual , Animales , Trastorno del Espectro Autista/genética , Calbindina 2/genética , Corteza Cerebral , Deleción Cromosómica , Trastornos de los Cromosomas/genética , Cromosomas Humanos Par 16/genética , Variaciones en el Número de Copia de ADN , Genómica , Humanos , Discapacidad Intelectual/genética , Ratones , Neuronas , ARN
5.
Nature ; 554(7690): 50-55, 2018 02 01.
Artículo en Inglés | MEDLINE | ID: mdl-29364872

RESUMEN

Salamanders serve as important tetrapod models for developmental, regeneration and evolutionary studies. An extensive molecular toolkit makes the Mexican axolotl (Ambystoma mexicanum) a key representative salamander for molecular investigations. Here we report the sequencing and assembly of the 32-gigabase-pair axolotl genome using an approach that combined long-read sequencing, optical mapping and development of a new genome assembler (MARVEL). We observed a size expansion of introns and intergenic regions, largely attributable to multiplication of long terminal repeat retroelements. We provide evidence that intron size in developmental genes is under constraint and that species-restricted genes may contribute to limb regeneration. The axolotl genome assembly does not contain the essential developmental gene Pax3. However, mutation of the axolotl Pax3 paralogue Pax7 resulted in an axolotl phenotype that was similar to those seen in Pax3-/- and Pax7-/- mutant mice. The axolotl genome provides a rich biological resource for developmental and evolutionary studies.


Asunto(s)
Ambystoma mexicanum/genética , Evolución Molecular , Genoma/genética , Genómica , Animales , ADN Intergénico/genética , Genes Esenciales/genética , Proteínas de Homeodominio/genética , Intrones/genética , Masculino , Ratones , Factor de Transcripción PAX3/genética , Factor de Transcripción PAX7/genética , Picea/genética , Pinus/genética , Regeneración/genética , Retroelementos/genética , Secuencias Repetidas Terminales/genética
6.
Nature ; 559(7712): E2, 2018 07.
Artículo en Inglés | MEDLINE | ID: mdl-29795340

RESUMEN

In the originally published version of this Article, the sequenced axolotl strain (the homozygous white mutant) was denoted as 'D/D' rather than 'd/d' in Fig. 1a and the accompanying legend, the main text and the Methods section. The original Article has been corrected online.

7.
Nature ; 546(7659): 524-527, 2017 06 22.
Artículo en Inglés | MEDLINE | ID: mdl-28605751

RESUMEN

Complete and accurate reference genomes and annotations provide fundamental tools for characterization of genetic and functional variation. These resources facilitate the determination of biological processes and support translation of research findings into improved and sustainable agricultural technologies. Many reference genomes for crop plants have been generated over the past decade, but these genomes are often fragmented and missing complex repeat regions. Here we report the assembly and annotation of a reference genome of maize, a genetic and agricultural model species, using single-molecule real-time sequencing and high-resolution optical mapping. Relative to the previous reference genome, our assembly features a 52-fold increase in contig length and notable improvements in the assembly of intergenic spaces and centromeres. Characterization of the repetitive portion of the genome revealed more than 130,000 intact transposable elements, allowing us to identify transposable element lineage expansions that are unique to maize. Gene annotations were updated using 111,000 full-length transcripts obtained by single-molecule real-time sequencing. In addition, comparative optical mapping of two other inbred maize lines revealed a prevalence of deletions in regions of low gene density and maize lineage-specific genes.


Asunto(s)
Genoma de Planta/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Imagen Individual de Molécula/métodos , Zea mays/genética , Centrómero/genética , Cromosomas de las Plantas/genética , Mapeo Contig , Productos Agrícolas/genética , Elementos Transponibles de ADN/genética , ADN Intergénico/genética , Genes de Plantas/genética , Anotación de Secuencia Molecular , Óptica y Fotónica , Filogenia , ARN Mensajero/análisis , ARN Mensajero/genética , Estándares de Referencia , Sorghum/genética
8.
Nature ; 544(7651): 427-433, 2017 04 26.
Artículo en Inglés | MEDLINE | ID: mdl-28447635

RESUMEN

Cereal grasses of the Triticeae tribe have been the major food source in temperate regions since the dawn of agriculture. Their large genomes are characterized by a high content of repetitive elements and large pericentromeric regions that are virtually devoid of meiotic recombination. Here we present a high-quality reference genome assembly for barley (Hordeum vulgare L.). We use chromosome conformation capture mapping to derive the linear order of sequences across the pericentromeric space and to investigate the spatial organization of chromatin in the nucleus at megabase resolution. The composition of genes and repetitive elements differs between distal and proximal regions. Gene family analyses reveal lineage-specific duplications of genes involved in the transport of nutrients to developing seeds and the mobilization of carbohydrates in grains. We demonstrate the importance of the barley reference sequence for breeding by inspecting the genomic partitioning of sequence variation in modern elite germplasm, highlighting regions vulnerable to genetic erosion.


Asunto(s)
Cromosomas de las Plantas/genética , Genoma de Planta/genética , Hordeum/genética , Núcleo Celular/genética , Centrómero/genética , Cromatina/genética , Cromatina/metabolismo , Mapeo Cromosómico , Cromosomas Artificiales Bacterianos/genética , Variación Genética , Genómica , Haplotipos/genética , Meiosis/genética , Secuencias Repetitivas de Ácidos Nucleicos/genética , Semillas/genética
9.
J Hered ; 114(5): 539-548, 2023 08 23.
Artículo en Inglés | MEDLINE | ID: mdl-37249392

RESUMEN

The black-footed ferret (Mustela nigripes) narrowly avoided extinction to become an oft-cited example of the benefits of intensive management, research, and collaboration to save a species through ex situ conservation breeding and reintroduction into its former range. However, the species remains at risk due to possible inbreeding, disease susceptibility, and multiple fertility challenges. Here, we report the de novo genome assembly of a male black-footed ferret generated through a combination of linked-read sequencing, optical mapping, and Hi-C proximity ligation. In addition, we report the karyotype for this species, which was used to anchor and assign chromosome numbers to the chromosome-length scaffolds. The draft assembly was ~2.5 Gb in length, with 95.6% of it anchored to 19 chromosome-length scaffolds, corresponding to the 2n = 38 chromosomes revealed by the karyotype. The assembly has contig and scaffold N50 values of 148.8 kbp and 145.4 Mbp, respectively, and is up to 96% complete based on BUSCO analyses. Annotation of the assembly, including evidence from RNA-seq data, identified 21,406 protein-coding genes and a repeat content of 37.35%. Phylogenomic analyses indicated that the black-footed ferret diverged from the European polecat/domestic ferret lineage 1.6 million yr ago. This assembly will enable research on the conservation genomics of black-footed ferrets and thereby aid in the further restoration of this endangered species.


Asunto(s)
Especies en Peligro de Extinción , Hurones , Animales , Masculino , Hurones/genética , Cariotipo , Cariotipificación , Fertilidad
10.
J Med Genet ; 59(10): 976-983, 2022 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-34911816

RESUMEN

PURPOSE: Approximately 20% of patients with clinical familial adenomatous polyposis (FAP) remain unsolved after molecular genetic analysis of the APC and other polyposis genes, suggesting additional pathomechanisms. METHODS: We applied multidimensional genomic analysis employing chromosomal microarray profiling, optical mapping, long-read genome and RNA sequencing combined with FISH and standard PCR of genomic and complementary DNA to decode a patient with an attenuated FAP that had remained unsolved by Sanger sequencing and multigene panel next-generation sequencing for years. RESULTS: We identified a complex 3.9 Mb rearrangement involving 14 fragments from chromosome 5q22.1q22.3 of which three were lost, 1 reinserted into chromosome 5 and 10 inserted into chromosome 10q21.3 in a seemingly random order and orientation thus fulfilling the major criteria of chromothripsis. The rearrangement separates APC promoter 1B from the coding ORF (open reading frame) thus leading to allele-specific downregulation of APC mRNA. The rearrangement also involves three additional genes implicated in the APC-Axin-GSK3B-ß-catenin signalling pathway. CONCLUSIONS: Based on comprehensive genomic analysis, we propose that constitutional chromothripsis dampening APC expression, possibly modified by additional APC-Axin-GSK3B-ß-catenin pathway disruptions, underlies the patient's clinical phenotype. The combinatorial approach we deployed provides a powerful tool set for deciphering unsolved familial polyposis and potentially other tumour syndromes and monogenic diseases.


Asunto(s)
Poliposis Adenomatosa del Colon , Cromotripsis , Neoplasias del Colon , Poliposis Adenomatosa del Colon/genética , Poliposis Adenomatosa del Colon/patología , Proteína de la Poliposis Adenomatosa del Colon/genética , Proteína Axina/genética , Neoplasias del Colon/complicaciones , Neoplasias del Colon/genética , ADN Complementario , Genes APC , Predisposición Genética a la Enfermedad , Humanos , ARN Mensajero , beta Catenina/genética
12.
Genome Res ; 29(4): 646-656, 2019 04.
Artículo en Inglés | MEDLINE | ID: mdl-30846530

RESUMEN

We report on the development of a methylation analysis workflow for optical detection of fluorescent methylation profiles along chromosomal DNA molecules. In combination with Bionano Genomics genome mapping technology, these profiles provide a hybrid genetic/epigenetic genome-wide map composed of DNA molecules spanning hundreds of kilobase pairs. The method provides kilobase pair-scale genomic methylation patterns comparable to whole-genome bisulfite sequencing (WGBS) along genes and regulatory elements. These long single-molecule reads allow for methylation variation calling and analysis of large structural aberrations such as pathogenic macrosatellite arrays not accessible to single-cell second-generation sequencing. The method is applied here to study facioscapulohumeral muscular dystrophy (FSHD), simultaneously recording the haplotype, copy number, and methylation status of the disease-associated, highly repetitive locus on Chromosome 4q.


Asunto(s)
Metilación de ADN , Análisis de Secuencia de ADN/métodos , Variación Genética , Humanos , Distrofia Muscular Facioescapulohumeral/genética , Análisis de Secuencia de ADN/normas
13.
Nature ; 538(7624): 243-247, 2016 Oct 13.
Artículo en Inglés | MEDLINE | ID: mdl-27706134

RESUMEN

Advances in genome assembly and phasing provide an opportunity to investigate the diploid architecture of the human genome and reveal the full range of structural variation across population groups. Here we report the de novo assembly and haplotype phasing of the Korean individual AK1 (ref. 1) using single-molecule real-time sequencing, next-generation mapping, microfluidics-based linked reads, and bacterial artificial chromosome (BAC) sequencing approaches. Single-molecule sequencing coupled with next-generation mapping generated a highly contiguous assembly, with a contig N50 size of 17.9 Mb and a scaffold N50 size of 44.8 Mb, resolving 8 chromosomal arms into single scaffolds. The de novo assembly, along with local assemblies and spanning long reads, closes 105 and extends into 72 out of 190 euchromatic gaps in the reference genome, adding 1.03 Mb of previously intractable sequence. High concordance between the assembly and paired-end sequences from 62,758 BAC clones provides strong support for the robustness of the assembly. We identify 18,210 structural variants by direct comparison of the assembly with the human reference, identifying thousands of breakpoints that, to our knowledge, have not been reported before. Many of the insertions are reflected in the transcriptome and are shared across the Asian population. We performed haplotype phasing of the assembly with short reads, long reads and linked reads from whole-genome sequencing and with short reads from 31,719 BAC clones, thereby achieving phased blocks with an N50 size of 11.6 Mb. Haplotigs assembled from single-molecule real-time reads assigned to haplotypes on phased blocks covered 89% of genes. The haplotigs accurately characterized the hypervariable major histocompatability complex region as well as demonstrating allele configuration in clinically relevant genes such as CYP2D6. This work presents the most contiguous diploid human genome assembly so far, with extensive investigation of unreported and Asian-specific structural variants, and high-quality haplotyping of clinically relevant alleles for precision medicine.


Asunto(s)
Pueblo Asiatico/genética , Mapeo Contig , Genoma Humano/genética , Genómica , Haplotipos/genética , Análisis de Secuencia de ADN , Alelos , Cromosomas Artificiales Bacterianos/genética , Citocromo P-450 CYP2D6/genética , Diploidia , Variación Genética/genética , Antígenos de Histocompatibilidad Clase II/genética , Humanos , Medicina de Precisión , Estándares de Referencia , República de Corea
14.
Nature ; 527(7579): 508-11, 2015 Nov 26.
Artículo en Inglés | MEDLINE | ID: mdl-26560029

RESUMEN

Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetium genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a 'near-complete' draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. The Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.


Asunto(s)
Genoma de Planta/genética , Poaceae/genética , Análisis de Secuencia de ADN/métodos , Aclimatación/genética , Mapeo Contig , Deshidratación , Desecación , Sequías , Genes de Plantas/genética , Genómica , Datos de Secuencia Molecular
15.
Plant J ; 100(5): 1066-1082, 2019 12.
Artículo en Inglés | MEDLINE | ID: mdl-31433882

RESUMEN

We report reference-quality genome assemblies and annotations for two accessions of soybean (Glycine max) and for one accession of Glycine soja, the closest wild relative of G. max. The G. max assemblies provided are for widely used US cultivars: the northern line Williams 82 (Wm82) and the southern line Lee. The Wm82 assembly improves the prior published assembly, and the Lee and G. soja assemblies are new for these accessions. Comparisons among the three accessions show generally high structural conservation, but nucleotide difference of 1.7 single-nucleotide polymorphisms (snps) per kb between Wm82 and Lee, and 4.7 snps per kb between these lines and G. soja. snp distributions and comparisons with genotypes of the Lee and Wm82 parents highlight patterns of introgression and haplotype structure. Comparisons against the US germplasm collection show placement of the sequenced accessions relative to global soybean diversity. Analysis of a pan-gene collection shows generally high conservation, with variation occurring primarily in genomically clustered gene families. We found approximately 40-42 inversions per chromosome between either Lee or Wm82v4 and G. soja, and approximately 32 inversions per chromosome between Wm82 and Lee. We also investigated five domestication loci. For each locus, we found two different alleles with functional differences between G. soja and the two domesticated accessions. The genome assemblies for multiple cultivated accessions and for the closest wild ancestor of soybean provides a valuable set of resources for identifying causal variants that underlie traits for the domestication and improvement of soybean, serving as a basis for future research and crop improvement efforts for this important crop species.


Asunto(s)
Fabaceae/genética , Variación Genética , Genoma de Planta , Alelos , Centrómero/genética , Resistencia a la Enfermedad/genética , Genética de Población , Genotipo , Haplotipos , Dureza , Familia de Multigenes , Filogenia , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Secuencias Repetitivas de Ácidos Nucleicos , Banco de Semillas/clasificación , Inversión de Secuencia , Telómero/genética
16.
Plant J ; 96(6): 1309-1316, 2018 12.
Artículo en Inglés | MEDLINE | ID: mdl-30256471

RESUMEN

Chromosomal inversions occur in natural populations of many species, and may underlie reproductive isolation and local adaptation. Traditional methods of inversion discovery are labor-intensive and lack sensitivity. Here, we report the use of three-dimensional contact probabilities between genomic loci as assayed by chromosome-conformation capture sequencing (Hi-C) to detect multi-megabase polymorphic inversions in four barley genotypes. Inversions are validated by fluorescence in situ hybridization and Bionano optical mapping. We propose Hi-C as a generally applicable method for inversion discovery in natural populations.


Asunto(s)
Inversión Cromosómica/genética , Cromosomas de las Plantas/genética , Genoma de Planta/genética , Hordeum/genética , Mapeo Cromosómico , Genotipo , Hibridación Fluorescente in Situ
17.
Nat Methods ; 13(7): 587-90, 2016 07.
Artículo en Inglés | MEDLINE | ID: mdl-27159086

RESUMEN

Despite tremendous progress in genome sequencing, the basic goal of producing a phased (haplotype-resolved) genome sequence with end-to-end contiguity for each chromosome at reasonable cost and effort is still unrealized. In this study, we describe an approach to performing de novo genome assembly and experimental phasing by integrating the data from Illumina short-read sequencing, 10X Genomics linked-read sequencing, and BioNano Genomics genome mapping to yield a high-quality, phased, de novo assembled human genome.


Asunto(s)
Mapeo Cromosómico/métodos , Genoma Humano , Genómica/métodos , Haplotipos/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos
18.
Adv Exp Med Biol ; 1129: 97-118, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-30968363

RESUMEN

Next Generation Sequencing (NGS) has rapidly advanced genomic research with tremendously increased throughput and reduced cost, through reading the fragmented genome content in massively parallel fashion. We have been able to sequence and map genomes to reference sequences with relative ease compared to the past. However, this mapping can only be accurately accomplished in the single copy regions of the genome, leaving out most duplicated genes and structural variation. Additionally, assembly of long genomic segments remains elusive since multi copy regions of the genome produce ambiguity when short read sequence is used.


Asunto(s)
Mapeo Cromosómico/métodos , Genómica/métodos , Haploidia , Secuenciación de Nucleótidos de Alto Rendimiento , Análisis de Secuencia de ADN
19.
Plant J ; 89(3): 617-635, 2017 02.
Artículo en Inglés | MEDLINE | ID: mdl-27754575

RESUMEN

Spirodela polyrhiza is a fast-growing aquatic monocot with highly reduced morphology, genome size and number of protein-coding genes. Considering these biological features of Spirodela and its basal position in the monocot lineage, understanding its genome architecture could shed light on plant adaptation and genome evolution. Like many draft genomes, however, the 158-Mb Spirodela genome sequence has not been resolved to chromosomes, and important genome characteristics have not been defined. Here we deployed rapid genome-wide physical maps combined with high-coverage short-read sequencing to resolve the 20 chromosomes of Spirodela and to empirically delineate its genome features. Our data revealed a dramatic reduction in the number of the rDNA repeat units in Spirodela to fewer than 100, which is even fewer than that reported for yeast. Consistent with its unique phylogenetic position, small RNA sequencing revealed 29 Spirodela-specific microRNA, with only two being shared with Elaeis guineensis (oil palm) and Musa balbisiana (banana). Combining DNA methylation data and small RNA sequencing enabled the accurate prediction of 20.5% long terminal repeats (LTRs) that doubled the previous estimate, and revealed a high Solo:Intact LTR ratio of 8.2. Interestingly, we found that Spirodela has the lowest global DNA methylation levels (9%) of any plant species tested. Taken together our results reveal a genome that has undergone reduction, likely through eliminating non-essential protein coding genes, rDNA and LTRs. In addition to delineating the genome features of this unique plant, the methodologies described and large-scale genome resources from this work will enable future evolutionary and functional studies of this basal monocot family.


Asunto(s)
Araceae/genética , Mapeo Cromosómico/métodos , Genoma de Planta/genética , Análisis de Secuencia de ADN/métodos , Cromosomas de las Plantas/genética , Metilación de ADN , Regulación de la Expresión Génica de las Plantas , Ontología de Genes , Genes de Plantas/genética , Variación Genética , Proteínas de Plantas/genética
20.
Nat Methods ; 12(8): 780-6, 2015 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-26121404

RESUMEN

We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality.


Asunto(s)
Biología Computacional/métodos , Genoma Humano , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Polimorfismo de Nucleótido Simple , Algoritmos , Mapeo Cromosómico , Diploidia , Biblioteca de Genes , Variación Genética , Genoma , Haplotipos , Humanos , Nucleótidos/genética , Reproducibilidad de los Resultados , Análisis de Secuencia de ADN , Secuencias Repetidas en Tándem
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA