Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 17 de 17
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
PLoS Comput Biol ; 16(9): e1008173, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32946435

RESUMO

Single-cell Hi-C (scHi-C) interrogates genome-wide chromatin interaction in individual cells, allowing us to gain insights into 3D genome organization. However, the extremely sparse nature of scHi-C data poses a significant barrier to analysis, limiting our ability to tease out hidden biological information. In this work, we approach this problem by applying topic modeling to scHi-C data. Topic modeling is well-suited for discovering latent topics in a collection of discrete data. For our analysis, we generate nine different single-cell combinatorial indexed Hi-C (sci-Hi-C) libraries from five human cell lines (GM12878, H1Esc, HFF, IMR90, and HAP1), consisting over 19,000 cells. We demonstrate that topic modeling is able to successfully capture cell type differences from sci-Hi-C data in the form of "chromatin topics." We further show enrichment of particular compartment structures associated with locus pairs in these topics.


Assuntos
Cromatina , Biologia Computacional/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Célula Única/métodos , Linhagem Celular , Cromatina/química , Cromatina/genética , Análise por Conglomerados , Biblioteca Gênica , Humanos , Processamento de Linguagem Natural
2.
Methods ; 170: 61-68, 2020 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-31536770

RESUMO

The highly dynamic nature of chromosome conformation and three-dimensional (3D) genome organization leads to cell-to-cell variability in chromatin interactions within a cell population, even if the cells of the population appear to be functionally homogeneous. Hence, although Hi-C is a powerful tool for mapping 3D genome organization, this heterogeneity of chromosome higher order structure among individual cells limits the interpretive power of population based bulk Hi-C assays. Moreover, single-cell studies have the potential to enable the identification and characterization of rare cell populations or cell subtypes in a heterogeneous population. However, it may require surveying relatively large numbers of single cells to achieve statistically meaningful observations in single-cell studies. By applying combinatorial cellular indexing to chromosome conformation capture, we developed single-cell combinatorial indexed Hi-C (sci-Hi-C), a high throughput method that enables mapping chromatin interactomes in large number of single cells. We demonstrated the use of sci-Hi-C data to separate cells by karytoypic and cell-cycle state differences and to identify cellular variability in mammalian chromosomal conformation. Here, we provide a detailed description of method design and step-by-step working protocols for sci-Hi-C.


Assuntos
Mapeamento Cromossômico/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Célula Única/métodos , Animais , Linhagem Celular , Núcleo Celular/genética , Separação Celular/métodos , Cromatina/genética , Cromatina/isolamento & purificação , Cromatina/metabolismo , Simulação por Computador , Biblioteca Gênica , Humanos , Camundongos , Conformação de Ácido Nucleico
3.
Nat Methods ; 14(3): 263-266, 2017 03.
Artigo em Inglês | MEDLINE | ID: mdl-28135255

RESUMO

We present single-cell combinatorial indexed Hi-C (sciHi-C), a method that applies combinatorial cellular indexing to chromosome conformation capture. In this proof of concept, we generate and sequence six sciHi-C libraries comprising a total of 10,696 single cells. We use sciHi-C data to separate cells by karyotypic and cell-cycle state differences and identify cell-to-cell heterogeneity in mammalian chromosomal conformation. Our results demonstrate that combinatorial indexing is a generalizable strategy for single-cell genomics.


Assuntos
Cromossomos/genética , DNA/genética , Genoma Humano/genética , Genômica/métodos , Conformação Molecular , Análise de Célula Única/métodos , Ciclo Celular/genética , Linhagem Celular Tumoral , DNA/análise , Biblioteca Gênica , Células HeLa , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Análise de Sequência de DNA/métodos
4.
Nature ; 500(7461): 207-11, 2013 Aug 08.
Artigo em Inglês | MEDLINE | ID: mdl-23925245

RESUMO

The HeLa cell line was established in 1951 from cervical cancer cells taken from a patient, Henrietta Lacks. This was the first successful attempt to immortalize human-derived cells in vitro. The robust growth and unrestricted distribution of HeLa cells resulted in its broad adoption--both intentionally and through widespread cross-contamination--and for the past 60 years it has served a role analogous to that of a model organism. The cumulative impact of the HeLa cell line on research is demonstrated by its occurrence in more than 74,000 PubMed abstracts (approximately 0.3%). The genomic architecture of HeLa remains largely unexplored beyond its karyotype, partly because like many cancers, its extensive aneuploidy renders such analyses challenging. We carried out haplotype-resolved whole-genome sequencing of the HeLa CCL-2 strain, examined point- and indel-mutation variations, mapped copy-number variations and loss of heterozygosity regions, and phased variants across full chromosome arms. We also investigated variation and copy-number profiles for HeLa S3 and eight additional strains. We find that HeLa is relatively stable in terms of point variation, with few new mutations accumulating after early passaging. Haplotype resolution facilitated reconstruction of an amplified, highly rearranged region of chromosome 8q24.21 at which integration of the human papilloma virus type 18 (HPV-18) genome occurred and that is likely to be the event that initiated tumorigenesis. We combined these maps with RNA-seq and ENCODE Project data sets to phase the HeLa epigenome. This revealed strong, haplotype-specific activation of the proto-oncogene MYC by the integrated HPV-18 genome approximately 500 kilobases upstream, and enabled global analyses of the relationship between gene dosage and expression. These data provide an extensively phased, high-quality reference genome for past and future experiments relying on HeLa, and demonstrate the value of haplotype resolution for characterizing cancer genomes and epigenomes.


Assuntos
Epigenômica , Genoma Humano/genética , Aneuploidia , Variações do Número de Cópias de DNA , Feminino , Genes myc/genética , Haplótipos , Células HeLa , Papillomavirus Humano 18/genética , Papillomavirus Humano 18/fisiologia , Humanos , Dados de Sequência Molecular , Mutação , Proto-Oncogene Mas , Análise de Sequência de DNA , Ativação Transcricional/genética , Neoplasias do Colo do Útero/genética , Neoplasias do Colo do Útero/patologia , Neoplasias do Colo do Útero/virologia
5.
Am J Hum Genet ; 86(1): 34-44, 2010 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-20085712

RESUMO

It is well known that average levels of population structure are higher on the X chromosome compared to autosomes in humans. However, there have been surprisingly few analyses on the spatial distribution of population structure along the X chromosome. With publicly available data from the HapMap Project and Perlegen Sciences, we show a strikingly punctuated pattern of X chromosome population structure. Specifically, 87% of X-linked HapMap SNPs within the top 1% of F(ST) values cluster into five distinct loci. The largest of these regions spans 5.4 Mb and contains 66% of the most highly differentiated HapMap SNPs on the X chromosome. We demonstrate that the extreme clustering of highly differentiated SNPs on the X chromosome is not an artifact of ascertainment bias, nor is it specific to the populations genotyped in the HapMap Project. Rather, additional analyses and resequencing data suggest that these five regions have been substrates of recent and strong adaptive evolution. Finally, we discuss the implications that patterns of X-linked population structure have on the evolutionary history of African populations.


Assuntos
População Negra/genética , Cromossomos Humanos X , Evolução Molecular , Genética Populacional , Grupos Populacionais/genética , Alelos , Bases de Dados Genéticas , Frequência do Gene , Genoma Humano , Genótipo , Humanos , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA , Estados Unidos
6.
Genome Biol ; 22(1): 279, 2021 09 27.
Artigo em Inglês | MEDLINE | ID: mdl-34579774

RESUMO

BACKGROUND: Mammalian development is associated with extensive changes in gene expression, chromatin accessibility, and nuclear structure. Here, we follow such changes associated with mouse embryonic stem cell differentiation and X inactivation by integrating, for the first time, allele-specific data from these three modalities obtained by high-throughput single-cell RNA-seq, ATAC-seq, and Hi-C. RESULTS: Allele-specific contact decay profiles obtained by single-cell Hi-C clearly show that the inactive X chromosome has a unique profile in differentiated cells that have undergone X inactivation. Loss of this inactive X-specific structure at mitosis is followed by its reappearance during the cell cycle, suggesting a "bookmark" mechanism. Differentiation of embryonic stem cells to follow the onset of X inactivation is associated with changes in contact decay profiles that occur in parallel on both the X chromosomes and autosomes. Single-cell RNA-seq and ATAC-seq show evidence of a delay in female versus male cells, due to the presence of two active X chromosomes at early stages of differentiation. The onset of the inactive X-specific structure in single cells occurs later than gene silencing, consistent with the idea that chromatin compaction is a late event of X inactivation. Single-cell Hi-C highlights evidence of discrete changes in nuclear structure characterized by the acquisition of very long-range contacts throughout the nucleus. Novel computational approaches allow for the effective alignment of single-cell gene expression, chromatin accessibility, and 3D chromosome structure. CONCLUSIONS: Based on trajectory analyses, three distinct nuclear structure states are detected reflecting discrete and profound simultaneous changes not only to the structure of the X chromosomes, but also to that of autosomes during differentiation. Our study reveals that long-range structural changes to chromosomes appear as discrete events, unlike progressive changes in gene expression and chromatin accessibility.


Assuntos
Diferenciação Celular/genética , Expressão Gênica , Células-Tronco Embrionárias Murinas/metabolismo , Inativação do Cromossomo X , Alelos , Animais , Ciclo Celular , Linhagem Celular , Núcleo Celular/genética , Feminino , Genoma , Masculino , Camundongos , RNA-Seq , Análise de Célula Única , Cromossomo X/química
7.
Cell Rep ; 26(9): 2465-2476.e4, 2019 02 26.
Artigo em Inglês | MEDLINE | ID: mdl-30811994

RESUMO

A complete view of eukaryotic gene regulation requires that we accurately delineate how transcription factors (TFs) and nucleosomes are arranged along linear DNA in a sensitive, unbiased manner. Here we introduce MNase-SSP, a single-stranded sequencing library preparation method for nuclease-digested chromatin that enables simultaneous mapping of TF and nucleosome positions. As a proof of concept, we apply MNase-SSP toward the genome-wide, high-resolution mapping of nucleosome and TF occupancy in murine embryonic stem cells (mESCs). Compared with existing MNase-seq protocols, MNase-SSP markedly enriches for short DNA fragments, enabling detection of binding by subnucleosomal particles and TFs, in addition to nucleosomes. From these same data, we identify multiple, sequence-dependent binding modes of the architectural TF Ctcf and extend this analysis to the TF Nrsf/Rest. Looking forward, we anticipate that single stranded protocol (SSP) adaptations of any protein-DNA interaction mapping technique (e.g., ChIP-exo and CUT&RUN) will enhance the information content of the resulting data.


Assuntos
Cromatina/química , Nuclease do Micrococo , Análise de Sequência de DNA/métodos , Animais , Sítios de Ligação , Linhagem Celular , DNA/química , DNA/metabolismo , DNA de Cadeia Simples/química , Células-Tronco Embrionárias/metabolismo , Feminino , Biblioteca Gênica , Masculino , Camundongos , Nucleossomos , Regiões Promotoras Genéticas , Sequências Reguladoras de Ácido Nucleico , Fatores de Transcrição/metabolismo
8.
Genome Biol Evol ; 11(12): 3353-3371, 2019 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-31702783

RESUMO

The genus Rhododendron (Ericaceae), which includes horticulturally important plants such as azaleas, is a highly diverse and widely distributed genus of >1,000 species. Here, we report the chromosome-scale de novo assembly and genome annotation of Rhododendron williamsianum as a basis for continued study of this large genus. We created multiple short fragment genomic libraries, which were assembled using ALLPATHS-LG. This was followed by contiguity preserving transposase sequencing (CPT-seq) and fragScaff scaffolding of a large fragment library, which improved the assembly by decreasing the number of scaffolds and increasing scaffold length. Chromosome-scale scaffolding was performed by proximity-guided assembly (LACHESIS) using chromatin conformation capture (Hi-C) data. Chromosome-scale scaffolding was further refined and linkage groups defined by restriction-site associated DNA (RAD) sequencing of the parents and progeny of a genetic cross. The resulting linkage map confirmed the LACHESIS clustering and ordering of scaffolds onto chromosomes and rectified large-scale inversions. Assessments of the R. williamsianum genome assembly and gene annotation estimate them to be 89% and 79% complete, respectively. Predicted coding sequences from genome annotation were used in syntenic analyses and for generating age distributions of synonymous substitutions/site between paralgous gene pairs, which identified whole-genome duplications (WGDs) in R. williamsianum. We then analyzed other publicly available Ericaceae genomes for shared WGDs. Based on our spatial and temporal analyses of paralogous gene pairs, we find evidence for two shared, ancient WGDs in Rhododendron and Vaccinium (cranberry/blueberry) members that predate the Ericaceae family and, in one case, the Ericales order.


Assuntos
Cromossomos de Plantas/genética , Ericaceae/genética , Genoma de Planta/genética , Rhododendron/genética , Sintenia , Sequência de Bases , Cromatina/genética , Mapeamento Cromossômico , Ligação Genética , Biblioteca Genômica , Anotação de Sequência Molecular , Transposases/genética
9.
Science ; 360(6393)2018 06 08.
Artigo em Inglês | MEDLINE | ID: mdl-29880660

RESUMO

Genetic studies of human evolution require high-quality contiguous ape genome assemblies that are not guided by the human reference. We coupled long-read sequence assembly and full-length complementary DNA sequencing with a multiplatform scaffolding approach to produce ab initio chimpanzee and orangutan genome assemblies. By comparing these with two long-read de novo human genome assemblies and a gorilla genome assembly, we characterized lineage-specific and shared great ape genetic variation ranging from single- to mega-base pair-sized variants. We identified ~17,000 fixed human-specific structural variants identifying genic and putative regulatory changes that have emerged in humans since divergence from nonhuman apes. Interestingly, these variants are enriched near genes that are down-regulated in human compared to chimpanzee cerebral organoids, particularly in cells analogous to radial glial neural progenitors.


Assuntos
Evolução Molecular , Genoma Humano , Hominidae/genética , Animais , Mapeamento de Sequências Contíguas , Variação Genética , Humanos , Anotação de Sequência Molecular , Análise de Sequência de DNA
11.
Nat Protoc ; 11(11): 2104-21, 2016 11.
Artigo em Inglês | MEDLINE | ID: mdl-27685100

RESUMO

With the advent of massively parallel sequencing, considerable work has gone into adapting chromosome conformation capture (3C) techniques to study chromosomal architecture at a genome-wide scale. We recently demonstrated that the inactive murine X chromosome adopts a bipartite structure using a novel 3C protocol, termed in situ DNase Hi-C. Like traditional Hi-C protocols, in situ DNase Hi-C requires that chromatin be chemically cross-linked, digested, end-repaired, and proximity-ligated with a biotinylated bridge adaptor. The resulting ligation products are optionally sheared, affinity-purified via streptavidin bead immobilization, and subjected to traditional next-generation library preparation for Illumina paired-end sequencing. Importantly, in situ DNase Hi-C obviates the dependence on a restriction enzyme to digest chromatin, instead relying on the endonuclease DNase I. Libraries generated by in situ DNase Hi-C have a higher effective resolution than traditional Hi-C libraries, which makes them valuable in cases in which high sequencing depth is allowed for, or when hybrid capture technologies are expected to be used. The protocol described here, which involves ∼4 d of bench work, is optimized for the study of mammalian cells, but it can be broadly applicable to any cell or tissue of interest, given experimental parameter optimization.


Assuntos
Mapeamento Cromossômico/métodos , Desoxirribonucleases/metabolismo , Biotinilação , Formaldeído/metabolismo , Biblioteca Gênica , Humanos
12.
Nat Biotechnol ; 33(9): 980-4, 2015 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-26237516

RESUMO

We present an unbiased method to globally resolve RNA structures through pairwise contact measurements between interacting regions. RNA proximity ligation (RPL) uses proximity ligation of native RNA followed by deep sequencing to yield chimeric reads with ligation junctions in the vicinity of structurally proximate bases. We apply RPL in both baker's yeast (Saccharomyces cerevisiae) and human cells and generate contact probability maps for ribosomal and other abundant RNAs, including yeast snoRNAs, the RNA subunit of the signal recognition particle and the yeast U2 spliceosomal RNA homolog. RPL measurements correlate with established secondary structures for these RNA molecules, including stem-loop structures and long-range pseudoknots. We anticipate that RPL will complement the current repertoire of computational and experimental approaches in enabling the high-throughput determination of secondary and tertiary RNA structures.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Modelos Químicos , Modelos Moleculares , RNA/genética , RNA/ultraestrutura , Análise de Sequência de RNA/métodos , Sequência de Bases , Simulação por Computador , Modelos Genéticos , Dados de Sequência Molecular , Conformação de Ácido Nucleico , Reação em Cadeia da Polimerase em Tempo Real/métodos , Alinhamento de Sequência/métodos
13.
Nat Biotechnol ; 31(12): 1119-25, 2013 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-24185095

RESUMO

Genomes assembled de novo from short reads are highly fragmented relative to the finished chromosomes of Homo sapiens and key model organisms generated by the Human Genome Project. To address this problem, we need scalable, cost-effective methods to obtain assemblies with chromosome-scale contiguity. Here we show that genome-wide chromatin interaction data sets, such as those generated by Hi-C, are a rich source of long-range information for assigning, ordering and orienting genomic sequences to chromosomes, including across centromeres. To exploit this finding, we developed an algorithm that uses Hi-C data for ultra-long-range scaffolding of de novo genome assemblies. We demonstrate the approach by combining shotgun fragment and short jump mate-pair sequences with Hi-C data to generate chromosome-scale de novo assemblies of the human, mouse and Drosophila genomes, achieving--for the human genome--98% accuracy in assigning scaffolds to chromosome groups and 99% accuracy in ordering and orienting scaffolds within chromosome groups. Hi-C data can also be used to validate chromosomal translocations in cancer genomes.


Assuntos
Algoritmos , Cromatina/genética , Mapeamento Cromossômico/métodos , Mapeamento de Sequências Contíguas/métodos , Análise de Sequência de DNA/métodos , Animais , Sequência de Bases , Drosophila , Humanos , Camundongos , Dados de Sequência Molecular
14.
Sci Transl Med ; 4(137): 137ra76, 2012 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-22674554

RESUMO

Analysis of cell-free fetal DNA in maternal plasma holds promise for the development of noninvasive prenatal genetic diagnostics. Previous studies have been restricted to detection of fetal trisomies, to specific paternally inherited mutations, or to genotyping common polymorphisms using material obtained invasively, for example, through chorionic villus sampling. Here, we combine genome sequencing of two parents, genome-wide maternal haplotyping, and deep sequencing of maternal plasma DNA to noninvasively determine the genome sequence of a human fetus at 18.5 weeks of gestation. Inheritance was predicted at 2.8 × 10(6) parental heterozygous sites with 98.1% accuracy. Furthermore, 39 of 44 de novo point mutations in the fetal genome were detected, albeit with limited specificity. Subsampling these data and analyzing a second family trio by the same approach indicate that parental haplotype blocks of ~300 kilo-base pairs combined with shallow sequencing of maternal plasma DNA is sufficient to substantially determine the inherited complement of a fetal genome. However, ultradeep sequencing of maternal plasma DNA is necessary for the practical detection of fetal de novo mutations genome-wide. Although technical and analytical challenges remain, we anticipate that noninvasive analysis of inherited variation and de novo mutations in fetal genomes will facilitate prenatal diagnosis of both recessive and dominant Mendelian disorders.


Assuntos
DNA/sangue , Feto/metabolismo , Diagnóstico Pré-Natal/métodos , DNA/genética , Feminino , Idade Gestacional , Haplótipos/genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Gravidez
15.
Nat Biotechnol ; 29(1): 59-63, 2011 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-21170042

RESUMO

Haplotype information is essential to the complete description and interpretation of genomes, genetic diversity and genetic ancestry. Although individual human genome sequencing is increasingly routine, nearly all such genomes are unresolved with respect to haplotype. Here we combine the throughput of massively parallel sequencing with the contiguity information provided by large-insert cloning to experimentally determine the haplotype-resolved genome of a South Asian individual. A single fosmid library was split into a modest number of pools, each providing ∼3% physical coverage of the diploid genome. Sequencing of each pool yielded reads overwhelmingly derived from only one homologous chromosome at any given location. These data were combined with whole-genome shotgun sequence to directly phase 94% of ascertained heterozygous single nucleotide polymorphisms (SNPs) into long haplotype blocks (N50 of 386 kilobases (kbp)). This method also facilitates the analysis of structural variation, for example, to anchor novel insertions to specific locations and haplotypes.


Assuntos
Povo Asiático/genética , Genoma Humano/genética , Haplótipos/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Sequência de Bases , Linhagem Celular , Heterozigoto , Humanos , Modelos Moleculares , Polimorfismo de Nucleotídeo Único/genética
16.
Genome Res ; 15(9): 1250-7, 2005 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-16140993

RESUMO

Allelic variation in codons that specify amino acids that line the peptide-binding pockets of HLA's Class II antigen-presenting proteins is superimposed on strikingly few deeply diverged haplotypes. These haplotypes appear to have been evolving almost independently for tens of millions of years. By complete resequencing of 20 haplotypes across the approximately 100-kbp region that spans the HLA-DQA1, -DQB1, and -DRB1 genes, we provide a detailed view of the way in which the genome structure at this locus has been shaped by the interplay of selection, gene-gene interaction, and recombination.


Assuntos
Genes MHC da Classe II , Alelos , Animais , Evolução Molecular , Variação Genética , Genoma Humano , Gorilla gorilla/genética , Gorilla gorilla/imunologia , Antígenos HLA-DQ/genética , Cadeias alfa de HLA-DQ , Cadeias beta de HLA-DQ , Antígenos HLA-DR/genética , Cadeias HLA-DRB1 , Haplótipos , Humanos , Desequilíbrio de Ligação , Modelos Genéticos , Dados de Sequência Molecular , Pan troglodytes/genética , Pan troglodytes/imunologia , Filogenia , Polimorfismo de Nucleotídeo Único , Recombinação Genética , Seleção Genética
17.
Genomics ; 86(6): 759-66, 2005 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-16249066

RESUMO

Currently, challenges exist to acquire long-range (hundreds of kilobase pairs) phase-discriminated sequence across substantial numbers of individuals. We have developed a straightforward method for isolating and characterizing specific genomic regions in a haplospecific manner. Real-time PCR is carried out to STS content map and genotype pools of fosmid clones arrayed in 384-well microtiter plates. Single-nucleotide polymorphisms, microsatellite markers, and insertion-deletion polymorphisms are used to differentiate the target region into haplotype-specific tiling paths. DNA of clones from these tiling paths is retrieved from the library and either sequenced by standard shotgun methods or amplified in vitro and sequenced by a primer-based, directed method. This approach provides convenient access to complete, haplotype-resolved resequencing data from multiple individuals across tens to hundreds of thousands of basepairs. We illustrate its implementation with a detailed example of more than 400 kbp from the human CFTR region, across 15 individuals, and summarize our experience applying it to many other human loci.


Assuntos
Genoma Humano/genética , Haplótipos/genética , Análise de Sequência de DNA/métodos , Sequência de Bases , Clonagem Molecular/métodos , Genótipo , Humanos , Repetições de Microssatélites/genética , Dados de Sequência Molecular , Polimorfismo de Nucleotídeo Único/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA