Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
Mais filtros

Base de dados
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
Cell ; 163(3): 698-711, 2015 Oct 22.
Artigo em Inglês | MEDLINE | ID: mdl-26496609

RESUMO

Most human transcripts are alternatively spliced, and many disease-causing mutations affect RNA splicing. Toward better modeling the sequence determinants of alternative splicing, we measured the splicing patterns of over two million (M) synthetic mini-genes, which include degenerate subsequences totaling over 100 M bases of variation. The massive size of these training data allowed us to improve upon current models of splicing, as well as to gain new mechanistic insights. Our results show that the vast majority of hexamer sequence motifs measurably influence splice site selection when positioned within alternative exons, with multiple motifs acting additively rather than cooperatively. Intriguingly, motifs that enhance (suppress) exon inclusion in alternative 5' splicing also enhance (suppress) exon inclusion in alternative 3' or cassette exon splicing, suggesting a universal mechanism for alternative exon recognition. Finally, our empirically trained models are highly predictive of the effects of naturally occurring variants on alternative splicing in vivo.


Assuntos
Processamento Alternativo , Genoma Humano , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Sequência de Bases , Humanos , Dados de Sequência Molecular , Motivos de Nucleotídeos , Sítios de Splice de RNA
2.
PLoS Genet ; 10(10): e1004592, 2014 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-25340400

RESUMO

In addition to their protein coding function, exons can also serve as transcriptional enhancers. Mutations in these exonic-enhancers (eExons) could alter both protein function and transcription. However, the functional consequence of eExon mutations is not well known. Here, using massively parallel reporter assays, we dissect the enhancer activity of three liver eExons (SORL1 exon 17, TRAF3IP2 exon 2, PPARG exon 6) at single nucleotide resolution in the mouse liver. We find that both synonymous and non-synonymous mutations have similar effects on enhancer activity and many of the deleterious mutation clusters overlap known liver-associated transcription factor binding sites. Carrying a similar massively parallel reporter assay in HeLa cells with these three eExons found differences in their mutation profiles compared to the liver, suggesting that enhancers could have distinct operating profiles in different tissues. Our results demonstrate that eExon mutations could lead to multiple phenotypes by disrupting both the protein sequence and enhancer activity and that enhancers can have distinct mutation profiles in different cell types.


Assuntos
Proteínas Adaptadoras de Transdução de Sinal/genética , Elementos Facilitadores Genéticos , Éxons/genética , Proteínas de Membrana Transportadoras/genética , PPAR gama/genética , Receptores de LDL/genética , Animais , Sítios de Ligação , Regulação da Expressão Gênica , Células HeLa , Humanos , Fígado/metabolismo , Camundongos , Mutação de Sentido Incorreto , Polimorfismo de Nucleotídeo Único , Splicing de RNA/genética , Fatores de Transcrição/biossíntese
3.
Nat Methods ; 7(2): 119-22, 2010 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-20081835

RESUMO

We demonstrate subassembly, an in vitro library construction method that extends the utility of short-read sequencing platforms to applications requiring long, accurate reads. A long DNA fragment library is converted to a population of nested sublibraries, and a tag sequence directs grouping of short reads derived from the same long fragment, enabling localized assembly of long fragment sequences. Subassembly may facilitate accurate de novo genome assembly and metagenome sequencing.


Assuntos
Mapeamento Cromossômico/métodos , Análise de Sequência de DNA/métodos , Sequência de Bases , Etiquetas de Sequências Expressas , Dados de Sequência Molecular
4.
Genome Biol Evol ; 11(12): 3353-3371, 2019 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-31702783

RESUMO

The genus Rhododendron (Ericaceae), which includes horticulturally important plants such as azaleas, is a highly diverse and widely distributed genus of >1,000 species. Here, we report the chromosome-scale de novo assembly and genome annotation of Rhododendron williamsianum as a basis for continued study of this large genus. We created multiple short fragment genomic libraries, which were assembled using ALLPATHS-LG. This was followed by contiguity preserving transposase sequencing (CPT-seq) and fragScaff scaffolding of a large fragment library, which improved the assembly by decreasing the number of scaffolds and increasing scaffold length. Chromosome-scale scaffolding was performed by proximity-guided assembly (LACHESIS) using chromatin conformation capture (Hi-C) data. Chromosome-scale scaffolding was further refined and linkage groups defined by restriction-site associated DNA (RAD) sequencing of the parents and progeny of a genetic cross. The resulting linkage map confirmed the LACHESIS clustering and ordering of scaffolds onto chromosomes and rectified large-scale inversions. Assessments of the R. williamsianum genome assembly and gene annotation estimate them to be 89% and 79% complete, respectively. Predicted coding sequences from genome annotation were used in syntenic analyses and for generating age distributions of synonymous substitutions/site between paralgous gene pairs, which identified whole-genome duplications (WGDs) in R. williamsianum. We then analyzed other publicly available Ericaceae genomes for shared WGDs. Based on our spatial and temporal analyses of paralogous gene pairs, we find evidence for two shared, ancient WGDs in Rhododendron and Vaccinium (cranberry/blueberry) members that predate the Ericaceae family and, in one case, the Ericales order.


Assuntos
Cromossomos de Plantas/genética , Ericaceae/genética , Genoma de Planta/genética , Rhododendron/genética , Sintenia , Sequência de Bases , Cromatina/genética , Mapeamento Cromossômico , Ligação Genética , Biblioteca Genômica , Anotação de Sequência Molecular , Transposases/genética
5.
Nat Biotechnol ; 31(12): 1119-25, 2013 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-24185095

RESUMO

Genomes assembled de novo from short reads are highly fragmented relative to the finished chromosomes of Homo sapiens and key model organisms generated by the Human Genome Project. To address this problem, we need scalable, cost-effective methods to obtain assemblies with chromosome-scale contiguity. Here we show that genome-wide chromatin interaction data sets, such as those generated by Hi-C, are a rich source of long-range information for assigning, ordering and orienting genomic sequences to chromosomes, including across centromeres. To exploit this finding, we developed an algorithm that uses Hi-C data for ultra-long-range scaffolding of de novo genome assemblies. We demonstrate the approach by combining shotgun fragment and short jump mate-pair sequences with Hi-C data to generate chromosome-scale de novo assemblies of the human, mouse and Drosophila genomes, achieving--for the human genome--98% accuracy in assigning scaffolds to chromosome groups and 99% accuracy in ordering and orienting scaffolds within chromosome groups. Hi-C data can also be used to validate chromosomal translocations in cancer genomes.


Assuntos
Algoritmos , Cromatina/genética , Mapeamento Cromossômico/métodos , Mapeamento de Sequências Contíguas/métodos , Análise de Sequência de DNA/métodos , Animais , Sequência de Bases , Drosophila , Humanos , Camundongos , Dados de Sequência Molecular
6.
Nat Genet ; 45(9): 1021-1028, 2013 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-23892608

RESUMO

Despite continual progress in the cataloging of vertebrate regulatory elements, little is known about their organization and regulatory architecture. Here we describe a massively parallel experiment to systematically test the impact of copy number, spacing, combination and order of transcription factor binding sites on gene expression. A complex library of ∼5,000 synthetic regulatory elements containing patterns from 12 liver-specific transcription factor binding sites was assayed in mice and in HepG2 cells. We find that certain transcription factors act as direct drivers of gene expression in homotypic clusters of binding sites, independent of spacing between sites, whereas others function only synergistically. Heterotypic enhancers are stronger than their homotypic analogs and favor specific transcription factor binding site combinations, mimicking putative native enhancers. Exhaustive testing of binding site permutations suggests that there is flexibility in binding site order. Our findings provide quantitative support for a flexible model of regulatory element activity and suggest a framework for the design of synthetic tissue-specific enhancers.


Assuntos
Regulação da Expressão Gênica , Modelos Biológicos , Sequências Reguladoras de Ácido Nucleico , Fatores de Transcrição/metabolismo , Animais , Sítios de Ligação , Linhagem Celular , Análise por Conglomerados , Elementos Facilitadores Genéticos , Amplificação de Genes , Dosagem de Genes , Expressão Gênica , Genes Reporter , Humanos , Fígado/metabolismo , Masculino , Camundongos , Motivos de Nucleotídeos , Especificidade de Órgãos/genética , Ligação Proteica
7.
Nat Biotechnol ; 30(3): 265-70, 2012 Feb 26.
Artigo em Inglês | MEDLINE | ID: mdl-22371081

RESUMO

The functional consequences of genetic variation in mammalian regulatory elements are poorly understood. We report the in vivo dissection of three mammalian enhancers at single-nucleotide resolution through a massively parallel reporter assay. For each enhancer, we synthesized a library of >100,000 mutant haplotypes with 2-3% divergence from the wild-type sequence. Each haplotype was linked to a unique sequence tag embedded within a transcriptional cassette. We introduced each enhancer library into mouse liver and measured the relative activities of individual haplotypes en masse by sequencing the transcribed tags. Linear regression analysis yielded highly reproducible estimates of the effect of every possible single-nucleotide change on enhancer activity. The functional consequence of most mutations was modest, with ∼22% affecting activity by >1.2-fold and ∼3% by >2-fold. Several, but not all, positions with higher effects showed evidence for purifying selection, or co-localized with known liver-associated transcription factor binding sites, demonstrating the value of empirical high-resolution functional analysis.


Assuntos
Elementos Facilitadores Genéticos , Fatores de Transcrição/genética , Animais , Sítios de Ligação , Evolução Molecular , Genes Reporter , Haplótipos , Humanos , Modelos Lineares , Fígado/metabolismo , Camundongos , Mutagênese , Mutação , Fatores de Transcrição/metabolismo , Transcrição Gênica
8.
Nat Biotechnol ; 29(1): 59-63, 2011 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-21170042

RESUMO

Haplotype information is essential to the complete description and interpretation of genomes, genetic diversity and genetic ancestry. Although individual human genome sequencing is increasingly routine, nearly all such genomes are unresolved with respect to haplotype. Here we combine the throughput of massively parallel sequencing with the contiguity information provided by large-insert cloning to experimentally determine the haplotype-resolved genome of a South Asian individual. A single fosmid library was split into a modest number of pools, each providing ∼3% physical coverage of the diploid genome. Sequencing of each pool yielded reads overwhelmingly derived from only one homologous chromosome at any given location. These data were combined with whole-genome shotgun sequence to directly phase 94% of ascertained heterozygous single nucleotide polymorphisms (SNPs) into long haplotype blocks (N50 of 386 kilobases (kbp)). This method also facilitates the analysis of structural variation, for example, to anchor novel insertions to specific locations and haplotypes.


Assuntos
Povo Asiático/genética , Genoma Humano/genética , Haplótipos/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Sequência de Bases , Linhagem Celular , Heterozigoto , Humanos , Modelos Moleculares , Polimorfismo de Nucleotídeo Único/genética
9.
Science ; 331(6017): 555-61, 2011 Feb 04.
Artigo em Inglês | MEDLINE | ID: mdl-21292972

RESUMO

We describe the draft genome of the microcrustacean Daphnia pulex, which is only 200 megabases and contains at least 30,907 genes. The high gene count is a consequence of an elevated rate of gene duplication resulting in tandem gene clusters. More than a third of Daphnia's genes have no detectable homologs in any other available proteome, and the most amplified gene families are specific to the Daphnia lineage. The coexpansion of gene families interacting within metabolic pathways suggests that the maintenance of duplicated genes is not random, and the analysis of gene expression under different environmental conditions reveals that numerous paralogs acquire divergent expression patterns soon after duplication. Daphnia-specific genes, including many additional loci within sequenced regions that are otherwise devoid of annotations, are the most responsive genes to ecological challenges.


Assuntos
Daphnia/genética , Ecossistema , Genoma , Adaptação Fisiológica , Sequência de Aminoácidos , Animais , Sequência de Bases , Mapeamento Cromossômico , Daphnia/fisiologia , Meio Ambiente , Evolução Molecular , Conversão Gênica , Duplicação Gênica , Expressão Gênica , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Genes , Genes Duplicados , Redes e Vias Metabólicas/genética , Anotação de Sequência Molecular , Dados de Sequência Molecular , Família Multigênica , Filogenia , Análise de Sequência de DNA
10.
Nat Biotechnol ; 27(12): 1173-5, 2009 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-19915551

RESUMO

We present a method that harnesses massively parallel DNA synthesis and sequencing for the high-throughput functional analysis of regulatory sequences at single-nucleotide resolution. As a proof of concept, we quantitatively assayed the effects of all possible single-nucleotide mutations for three bacteriophage promoters and three mammalian core promoters in a single experiment per promoter. The method may also serve as a rapid screening tool for regulatory element engineering in synthetic biology.


Assuntos
Algoritmos , DNA/química , DNA/genética , Mutagênese Sítio-Dirigida/métodos , Elementos Reguladores de Transcrição/genética , Análise de Sequência de DNA/métodos , Sequência de Bases , Dados de Sequência Molecular
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA