Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 19 de 19
Filtrar
1.
Cell ; 180(6): 1262-1271.e15, 2020 03 19.
Artículo en Inglés | MEDLINE | ID: mdl-32169219

RESUMEN

Establishing causal links between non-coding variants and human phenotypes is an increasing challenge. Here, we introduce a high-throughput mouse reporter assay for assessing the pathogenic potential of human enhancer variants in vivo and examine nearly a thousand variants in an enhancer repeatedly linked to polydactyly. We show that 71% of all rare non-coding variants previously proposed as causal lead to reporter gene expression in a pattern consistent with their pathogenic role. Variants observed to alter enhancer activity were further confirmed to cause polydactyly in knockin mice. We also used combinatorial and single-nucleotide mutagenesis to evaluate the in vivo impact of mutations affecting all positions of the enhancer and identified additional functional substitutions, including potentially pathogenic variants hitherto not observed in humans. Our results uncover the functional consequences of hundreds of mutations in a phenotype-associated enhancer and establish a widely applicable strategy for systematic in vivo evaluation of human enhancer variants.


Asunto(s)
Elementos de Facilitación Genéticos/genética , Ensayos Analíticos de Alto Rendimiento/métodos , Polidactilia/genética , Animales , Elementos de Facilitación Genéticos/fisiología , Regulación del Desarrollo de la Expresión Génica/genética , Técnicas de Sustitución del Gen/métodos , Proteínas Hedgehog/genética , Proteínas Hedgehog/metabolismo , Humanos , Ratones , Mutación , Fenotipo , Polidactilia/metabolismo , ARN no Traducido/genética
2.
Cell ; 172(3): 491-499.e15, 2018 01 25.
Artículo en Inglés | MEDLINE | ID: mdl-29358049

RESUMEN

Non-coding "ultraconserved" regions containing hundreds of consecutive bases of perfect sequence conservation across mammalian genomes can function as distant-acting enhancers. However, initial deletion studies in mice revealed that loss of such extraordinarily constrained sequences had no immediate impact on viability. Here, we show that ultraconserved enhancers are required for normal development. Focusing on some of the longest ultraconserved sites genome wide, located near the essential neuronal transcription factor Arx, we used genome editing to create an expanded series of knockout mice lacking individual or combinations of ultraconserved enhancers. Mice with single or pairwise deletions of ultraconserved enhancers were viable and fertile but in nearly all cases showed neurological or growth abnormalities, including substantial alterations of neuron populations and structural brain defects. Our results demonstrate the functional importance of ultraconserved enhancers and indicate that remarkably strong sequence conservation likely results from fitness deficits that appear subtle in a laboratory setting.


Asunto(s)
Secuencia Conservada , Desarrollo Embrionario/genética , Elementos de Facilitación Genéticos , Animales , Encéfalo/anomalías , Encéfalo/embriología , Encéfalo/metabolismo , Femenino , Eliminación de Gen , Proteínas de Homeodominio/genética , Proteínas de Homeodominio/metabolismo , Masculino , Ratones , Factores de Transcripción/genética , Factores de Transcripción/metabolismo
3.
Cell ; 167(3): 633-642.e11, 2016 Oct 20.
Artículo en Inglés | MEDLINE | ID: mdl-27768887

RESUMEN

The evolution of body shape is thought to be tightly coupled to changes in regulatory sequences, but specific molecular events associated with major morphological transitions in vertebrates have remained elusive. We identified snake-specific sequence changes within an otherwise highly conserved long-range limb enhancer of Sonic hedgehog (Shh). Transgenic mouse reporter assays revealed that the in vivo activity pattern of the enhancer is conserved across a wide range of vertebrates, including fish, but not in snakes. Genomic substitution of the mouse enhancer with its human or fish ortholog results in normal limb development. In contrast, replacement with snake orthologs caused severe limb reduction. Synthetic restoration of a single transcription factor binding site lost in the snake lineage reinstated full in vivo function to the snake enhancer. Our results demonstrate changes in a regulatory sequence associated with a major body plan transition and highlight the role of enhancers in morphological evolution. PAPERCLIP.


Asunto(s)
Evolución Biológica , Elementos de Facilitación Genéticos , Extremidades/crecimiento & desarrollo , Proteínas Hedgehog/genética , Serpientes/genética , Animales , Secuencia de Bases , Evolución Molecular , Técnicas de Sustitución del Gen , Ratones , Ratones Transgénicos , Mutación , Filogenia , Serpientes/clasificación
4.
Nature ; 583(7818): 744-751, 2020 07.
Artículo en Inglés | MEDLINE | ID: mdl-32728240

RESUMEN

The Encyclopedia of DNA Elements (ENCODE) project has established a genomic resource for mammalian development, profiling a diverse panel of mouse tissues at 8 developmental stages from 10.5 days after conception until birth, including transcriptomes, methylomes and chromatin states. Here we systematically examined the state and accessibility of chromatin in the developing mouse fetus. In total we performed 1,128 chromatin immunoprecipitation with sequencing (ChIP-seq) assays for histone modifications and 132 assay for transposase-accessible chromatin using sequencing (ATAC-seq) assays for chromatin accessibility across 72 distinct tissue-stages. We used integrative analysis to develop a unified set of chromatin state annotations, infer the identities of dynamic enhancers and key transcriptional regulators, and characterize the relationship between chromatin state and accessibility during developmental gene regulation. We also leveraged these data to link enhancers to putative target genes and demonstrate tissue-specific enrichments of sequence variants associated with disease in humans. The mouse ENCODE data sets provide a compendium of resources for biomedical researchers and achieve, to our knowledge, the most comprehensive view of chromatin dynamics during mammalian fetal development to date.


Asunto(s)
Cromatina/genética , Cromatina/metabolismo , Conjuntos de Datos como Asunto , Desarrollo Fetal/genética , Histonas/metabolismo , Anotación de Secuencia Molecular , Secuencias Reguladoras de Ácidos Nucleicos/genética , Animales , Cromatina/química , Secuenciación de Inmunoprecipitación de Cromatina , Enfermedad/genética , Elementos de Facilitación Genéticos/genética , Femenino , Regulación del Desarrollo de la Expresión Génica/genética , Variación Genética , Histonas/química , Humanos , Masculino , Ratones , Ratones Endogámicos C57BL , Especificidad de Órganos/genética , Reproducibilidad de los Resultados , Transposasas/metabolismo
6.
Nature ; 554(7691): 239-243, 2018 02 08.
Artículo en Inglés | MEDLINE | ID: mdl-29420474

RESUMEN

Distant-acting tissue-specific enhancers, which regulate gene expression, vastly outnumber protein-coding genes in mammalian genomes, but the functional importance of this regulatory complexity remains unclear. Here we show that the pervasive presence of multiple enhancers with similar activities near the same gene confers phenotypic robustness to loss-of-function mutations in individual enhancers. We used genome editing to create 23 mouse deletion lines and inter-crosses, including both single and combinatorial enhancer deletions at seven distinct loci required for limb development. Unexpectedly, none of the ten deletions of individual enhancers caused noticeable changes in limb morphology. By contrast, the removal of pairs of limb enhancers near the same gene resulted in discernible phenotypes, indicating that enhancers function redundantly in establishing normal morphology. In a genetic background sensitized by reduced baseline expression of the target gene, even single enhancer deletions caused limb abnormalities, suggesting that functional redundancy is conferred by additive effects of enhancers on gene expression levels. A genome-wide analysis integrating epigenomic and transcriptomic data from 29 developmental mouse tissues revealed that mammalian genes are very commonly associated with multiple enhancers that have similar spatiotemporal activity. Systematic exploration of three representative developmental structures (limb, brain and heart) uncovered more than one thousand cases in which five or more enhancers with redundant activity patterns were found near the same gene. Together, our data indicate that enhancer redundancy is a remarkably widespread feature of mammalian genomes that provides an effective regulatory buffer to prevent deleterious phenotypic consequences upon the loss of individual enhancers.


Asunto(s)
Elementos de Facilitación Genéticos/genética , Extremidades/embriología , Regulación del Desarrollo de la Expresión Génica/genética , Fenotipo , Animales , Encéfalo/embriología , Femenino , Genoma , Corazón/embriología , Deformidades Congénitas de las Extremidades/embriología , Deformidades Congénitas de las Extremidades/genética , Masculino , Ratones , Eliminación de Secuencia , Análisis Espacio-Temporal
7.
Nat Methods ; 17(8): 807-814, 2020 08.
Artículo en Inglés | MEDLINE | ID: mdl-32737473

RESUMEN

Enhancers are important non-coding elements, but they have traditionally been hard to characterize experimentally. The development of massively parallel assays allows the characterization of large numbers of enhancers for the first time. Here, we developed a framework using Drosophila STARR-seq to create shape-matching filters based on meta-profiles of epigenetic features. We integrated these features with supervised machine-learning algorithms to predict enhancers. We further demonstrated that our model could be transferred to predict enhancers in mammals. We comprehensively validated the predictions using a combination of in vivo and in vitro approaches, involving transgenic assays in mice and transduction-based reporter assays in human cell lines (153 enhancers in total). The results confirmed that our model can accurately predict enhancers in different species without re-parameterization. Finally, we examined the transcription factor binding patterns at predicted enhancers versus promoters. We demonstrated that these patterns enable the construction of a secondary model that effectively distinguishes enhancers and promoters.


Asunto(s)
Epigénesis Genética/fisiología , Reconocimiento de Normas Patrones Automatizadas/métodos , Animales , Línea Celular , Drosophila , Histonas/genética , Histonas/metabolismo , Humanos , Ratones , Ratones Transgénicos , Reproducibilidad de los Resultados
9.
Genome Res ; 28(8): 1126-1135, 2018 08.
Artículo en Inglés | MEDLINE | ID: mdl-29954844

RESUMEN

The SK-BR-3 cell line is one of the most important models for HER2+ breast cancers, which affect one in five breast cancer patients. SK-BR-3 is known to be highly rearranged, although much of the variation is in complex and repetitive regions that may be underreported. Addressing this, we sequenced SK-BR-3 using long-read single molecule sequencing from Pacific Biosciences and develop one of the most detailed maps of structural variations (SVs) in a cancer genome available, with nearly 20,000 variants present, most of which were missed by short-read sequencing. Surrounding the important ERBB2 oncogene (also known as HER2), we discover a complex sequence of nested duplications and translocations, suggesting a punctuated progression. Full-length transcriptome sequencing further revealed several novel gene fusions within the nested genomic variants. Combining long-read genome and transcriptome sequencing enables an in-depth analysis of how SVs disrupt the genome and sheds new light on the complex mechanisms involved in cancer genome evolution.


Asunto(s)
Neoplasias de la Mama/genética , Amplificación de Genes/genética , Reordenamiento Génico/genética , Oncogenes/genética , Neoplasias de la Mama/patología , Femenino , Genoma Humano , Variación Estructural del Genoma , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Células MCF-7 , Receptor ErbB-2/genética , Secuencias Repetitivas de Ácidos Nucleicos/genética , Transcriptoma/genética
10.
PLoS Genet ; 12(4): e1005954, 2016 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-27082250

RESUMEN

We report here the ~670 Mb genome assembly of the Asian seabass (Lates calcarifer), a tropical marine teleost. We used long-read sequencing augmented by transcriptomics, optical and genetic mapping along with shared synteny from closely related fish species to derive a chromosome-level assembly with a contig N50 size over 1 Mb and scaffold N50 size over 25 Mb that span ~90% of the genome. The population structure of L. calcarifer species complex was analyzed by re-sequencing 61 individuals representing various regions across the species' native range. SNP analyses identified high levels of genetic diversity and confirmed earlier indications of a population stratification comprising three clades with signs of admixture apparent in the South-East Asian population. The quality of the Asian seabass genome assembly far exceeds that of any other fish species, and will serve as a new standard for fish genomics.


Asunto(s)
Lubina/genética , Mapeo Cromosómico , Animales , Lubina/clasificación , Genoma , Hibridación Fluorescente in Situ , Filogenia
12.
Nat Methods ; 12(11): 1058-60, 2015 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-26344043

RESUMEN

We present Ginkgo (http://qb.cshl.edu/ginkgo), a user-friendly, open-source web platform for the analysis of single-cell copy-number variations (CNVs). Ginkgo automatically constructs copy-number profiles of cells from mapped reads and constructs phylogenetic trees of related cells. We validated Ginkgo by reproducing the results of five major studies. After comparing three commonly used single-cell amplification techniques, we concluded that degenerate oligonucleotide-primed PCR is the most consistent for CNV analysis.


Asunto(s)
Biología Computacional , Variaciones en el Número de Copia de ADN , Genoma Humano , Oligonucleótidos/genética , Algoritmos , Animales , Automatización , Análisis por Conglomerados , Drosophila , Femenino , Dosificación de Gen , Genoma , Humanos , Internet , Neoplasias Pulmonares/diagnóstico , Neoplasias Pulmonares/genética , Masculino , Ratones , Pan troglodytes , Filogenia , Reacción en Cadena de la Polimerasa , Ratas , Reproducibilidad de los Resultados , Cromosomas Sexuales , Carcinoma Pulmonar de Células Pequeñas/diagnóstico , Carcinoma Pulmonar de Células Pequeñas/genética , Programas Informáticos
13.
PLoS Comput Biol ; 13(8): e1005720, 2017 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-28827824

RESUMEN

Epigenomic mapping of enhancer-associated chromatin modifications facilitates the genome-wide discovery of tissue-specific enhancers in vivo. However, reliance on single chromatin marks leads to high rates of false-positive predictions. More sophisticated, integrative methods have been described, but commonly suffer from limited accessibility to the resulting predictions and reduced biological interpretability. Here we present the Limb-Enhancer Genie (LEG), a collection of highly accurate, genome-wide predictions of enhancers in the developing limb, available through a user-friendly online interface. We predict limb enhancers using a combination of >50 published limb-specific datasets and clusters of evolutionarily conserved transcription factor binding sites, taking advantage of the patterns observed at previously in vivo validated elements. By combining different statistical models, our approach outperforms current state-of-the-art methods and provides interpretable measures of feature importance. Our results indicate that including a previously unappreciated score that quantifies tissue-specific nuclease accessibility significantly improves prediction performance. We demonstrate the utility of our approach through in vivo validation of newly predicted elements. Moreover, we describe general features that can guide the type of datasets to include when predicting tissue-specific enhancers genome-wide, while providing an accessible resource to the general biological community and facilitating the functional interpretation of genetic studies of limb malformations.


Asunto(s)
Elementos de Facilitación Genéticos/genética , Extremidades/crecimiento & desarrollo , Genómica/métodos , Crecimiento y Desarrollo/genética , Programas Informáticos , Animales , Genoma/genética , Aprendizaje Automático , Ratones
14.
Cell Rep ; 40(12): 111400, 2022 09 20.
Artículo en Inglés | MEDLINE | ID: mdl-36130500

RESUMEN

Heart disease is associated with re-expression of key transcription factors normally active only during prenatal development of the heart. However, the impact of this reactivation on the regulatory landscape in heart disease is unclear. Here, we use RNA-seq and ChIP-seq targeting a histone modification associated with active transcriptional enhancers to generate genome-wide enhancer maps from left ventricle tissue from up to 26 healthy controls, 18 individuals with idiopathic dilated cardiomyopathy (DCM), and five fetal hearts. Healthy individuals have a highly reproducible epigenomic landscape, consisting of more than 33,000 predicted heart enhancers. In contrast, we observe reproducible disease-associated changes in activity at 6,850 predicted heart enhancers. Combined analysis of adult and fetal samples reveals that the heart disease epigenome and transcriptome both acquire fetal-like characteristics, with 3,400 individual enhancers sharing fetal regulatory properties. We also provide a comprehensive data resource (http://heart.lbl.gov) for the mechanistic exploration of DCM etiology.


Asunto(s)
Cardiomiopatía Dilatada , Elementos de Facilitación Genéticos , Adulto , Elementos de Facilitación Genéticos/genética , Epigenoma , Epigenómica , Humanos , Factores de Transcripción
15.
Plant Genome ; 14(2): e20101, 2021 07.
Artículo en Inglés | MEDLINE | ID: mdl-34109759

RESUMEN

Spinach (Spinacia oleracea L.) is a member of the Caryophyllales family, a basal eudicot asterid that consists of sugar beet (Beta vulgaris L. subsp. vulgaris), quinoa (Chenopodium quinoa Willd.), and amaranth (Amaranthus hypochondriacus L.). With the introduction of baby leaf types, spinach has become a staple food in many homes. Production issues focus on yield, nitrogen-use efficiency and resistance to downy mildew (Peronospora effusa). Although genomes are available for the above species, a chromosome-level assembly exists only for quinoa, allowing for proper annotation and structural analyses to enhance crop improvement. We independently assembled and annotated genomes of the cultivar Viroflay using short-read strategy (Illumina) and long-read strategies (Pacific Biosciences) to develop a chromosome-level, genetically anchored assembly for spinach. Scaffold N50 for the Illumina assembly was 389 kb, whereas that for Pacific BioSciences was 4.43 Mb, representing 911 Mb (93% of the genome) in 221 scaffolds, 80% of which are anchored and oriented on a sequence-based genetic map, also described within this work. The two assemblies were 99.5% collinear. Independent annotation of the two assemblies with the same comprehensive transcriptome dataset show that the quality of the assembly directly affects the annotation with significantly more genes predicted (26,862 vs. 34,877) in the long-read assembly. Analysis of resistance genes confirms a bias in resistant gene motifs more typical of monocots. Evolutionary analysis indicates that Spinacia is a paleohexaploid with a whole-genome triplication followed by extensive gene rearrangements identified in this work. Diversity analysis of 75 lines indicate that variation in genes is ample for hypothesis-driven, genomic-assisted breeding enabled by this work.


Asunto(s)
Peronospora , Spinacia oleracea , Cromosomas , Reordenamiento Génico , Fitomejoramiento , Spinacia oleracea/genética
16.
Nat Genet ; 49(6): 825-833, 2017 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-28481342

RESUMEN

The contributions of coding mutations to tumorigenesis are relatively well known; however, little is known about somatic alterations in noncoding DNA. Here we describe GECCO (Genomic Enrichment Computational Clustering Operation) to analyze somatic noncoding alterations in 308 pancreatic ductal adenocarcinomas (PDAs) and identify commonly mutated regulatory regions. We find recurrent noncoding mutations to be enriched in PDA pathways, including axon guidance and cell adhesion, and newly identified processes, including transcription and homeobox genes. We identified mutations in protein binding sites correlating with differential expression of proximal genes and experimentally validated effects of mutations on expression. We developed an expression modulation score that quantifies the strength of gene regulation imposed by each class of regulatory elements, and found the strongest elements were most frequently mutated, suggesting a selective advantage. Our detailed single-cancer analysis of noncoding alterations identifies regulatory mutations as candidates for diagnostic and prognostic markers, and suggests new mechanisms for tumor evolution.


Asunto(s)
Adenocarcinoma/genética , Carcinoma Ductal Pancreático/genética , Regulación Neoplásica de la Expresión Génica , Mutación , Neoplasias Pancreáticas/genética , Adenocarcinoma/mortalidad , Carcinoma Ductal Pancreático/mortalidad , Humanos , Neoplasias Pancreáticas/mortalidad , Proteínas Tirosina Fosfatasas Clase 8 Similares a Receptores/genética , Simportadores de Cloruro de Sodio-Potasio/genética
17.
Elife ; 62017 01 16.
Artículo en Inglés | MEDLINE | ID: mdl-28092266

RESUMEN

Many lines of evidence have indicated that both genetic and non-genetic determinants can contribute to intra-tumor heterogeneity and influence cancer outcomes. Among the best described sub-population of cancer cells generated by non-genetic mechanisms are cells characterized by a CD44+/CD24- cell surface marker profile. Here, we report that human CD44+/CD24- cancer cells are genetically highly unstable because of intrinsic defects in their DNA-repair capabilities. In fact, in CD44+/CD24- cells, constitutive activation of the TGF-beta axis was both necessary and sufficient to reduce the expression of genes that are crucial in coordinating DNA damage repair mechanisms. Consequently, we observed that cancer cells that reside in a CD44+/CD24- state are characterized by increased accumulation of DNA copy number alterations, greater genetic diversity and improved adaptability to drug treatment. Together, these data suggest that the transition into a CD44+/CD24- cell state can promote intra-tumor genetic heterogeneity, spur tumor evolution and increase tumor fitness.


Asunto(s)
Antígeno CD24/análisis , Roturas del ADN de Doble Cadena , Reparación del ADN , Variación Genética , Receptores de Hialuranos/análisis , Neoplasias/fisiopatología , Factor de Crecimiento Transformador beta/metabolismo , Línea Celular Tumoral , Dosificación de Gen , Humanos , Mutación
18.
Nat Commun ; 7: 12923, 2016 10 05.
Artículo en Inglés | MEDLINE | ID: mdl-27703156

RESUMEN

Whole-genome sequencing is identifying growing numbers of non-coding variants in human disease studies, but the lack of accurate functional annotations prevents their interpretation. We describe the genome-wide landscape of distant-acting enhancers active in the developing and adult human heart, an organ whose impairment is a predominant cause of mortality and morbidity. Using integrative analysis of >35 epigenomic data sets from mouse and human pre- and postnatal hearts we created a comprehensive reference of >80,000 putative human heart enhancers. To illustrate the importance of enhancers in the regulation of genes involved in heart disease, we deleted the mouse orthologs of two human enhancers near cardiac myosin genes. In both cases, we observe in vivo expression changes and cardiac phenotypes consistent with human heart disease. Our study provides a comprehensive catalogue of human heart enhancers for use in clinical whole-genome sequencing studies and highlights the importance of enhancers for cardiac function.


Asunto(s)
Elementos de Facilitación Genéticos , Corazón/fisiología , Animales , Ecocardiografía , Epigenómica , Femenino , Perfilación de la Expresión Génica , Regulación de la Expresión Génica , Regulación del Desarrollo de la Expresión Génica , Genoma Humano , Histonas/metabolismo , Humanos , Masculino , Ratones , Ratones Endogámicos C57BL , Ratones Noqueados , Mutación , Fenotipo
19.
PLoS One ; 8(12): e81148, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-24324667

RESUMEN

DNA methylation is implicated in a surprising diversity of regulatory, evolutionary processes and diseases in eukaryotes. The introduction of whole-genome bisulfite sequencing has enabled the study of DNA methylation at a single-base resolution, revealing many new aspects of DNA methylation and highlighting the usefulness of methylome data in understanding a variety of genomic phenomena. As the number of publicly available whole-genome bisulfite sequencing studies reaches into the hundreds, reliable and convenient tools for comparing and analyzing methylomes become increasingly important. We present MethPipe, a pipeline for both low and high-level methylome analysis, and MethBase, an accompanying database of annotated methylomes from the public domain. Together these resources enable researchers to extract interesting features from methylomes and compare them with those identified in public methylomes in our database.


Asunto(s)
Metilación de ADN , ADN/metabolismo , Epigenómica/métodos , Genoma , Proteínas de Neoplasias/metabolismo , Neoplasias/metabolismo , Animales , ADN/genética , Bases de Datos Factuales , Epigénesis Genética , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Proteínas de Neoplasias/genética , Neoplasias/genética , Neoplasias/patología , Pan troglodytes/genética , Análisis de Secuencia de ADN
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA