Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 31
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Cell ; 184(24): 5985-6001.e19, 2021 11 24.
Artículo en Inglés | MEDLINE | ID: mdl-34774128

RESUMEN

Current catalogs of regulatory sequences in the human genome are still incomplete and lack cell type resolution. To profile the activity of gene regulatory elements in diverse cell types and tissues in the human body, we applied single-cell chromatin accessibility assays to 30 adult human tissue types from multiple donors. We integrated these datasets with previous single-cell chromatin accessibility data from 15 fetal tissue types to reveal the status of open chromatin for ∼1.2 million candidate cis-regulatory elements (cCREs) in 222 distinct cell types comprised of >1.3 million nuclei. We used these chromatin accessibility maps to delineate cell-type-specificity of fetal and adult human cCREs and to systematically interpret the noncoding variants associated with complex human traits and diseases. This rich resource provides a foundation for the analysis of gene regulatory programs in human cell types across tissues, life stages, and organ systems.


Asunto(s)
Cromatina/metabolismo , Genoma Humano , Análisis de la Célula Individual , Adulto , Análisis por Conglomerados , Feto/metabolismo , Variación Genética , Estudio de Asociación del Genoma Completo , Humanos , Especificidad de Órganos , Filogenia , Secuencias Reguladoras de Ácidos Nucleicos/genética , Factores de Riesgo
2.
Nature ; 591(7848): 147-151, 2021 03.
Artículo en Inglés | MEDLINE | ID: mdl-33505025

RESUMEN

Many sequence variants have been linked to complex human traits and diseases1, but deciphering their biological functions remains challenging, as most of them reside in noncoding DNA. Here we have systematically assessed the binding of 270 human transcription factors to 95,886 noncoding variants in the human genome using an ultra-high-throughput multiplex protein-DNA binding assay, termed single-nucleotide polymorphism evaluation by systematic evolution of ligands by exponential enrichment (SNP-SELEX). The resulting 828 million measurements of transcription factor-DNA interactions enable estimation of the relative affinity of these transcription factors to each variant in vitro and evaluation of the current methods to predict the effects of noncoding variants on transcription factor binding. We show that the position weight matrices of most transcription factors lack sufficient predictive power, whereas the support vector machine combined with the gapped k-mer representation show much improved performance, when assessed on results from independent SNP-SELEX experiments involving a new set of 61,020 sequence variants. We report highly predictive models for 94 human transcription factors and demonstrate their utility in genome-wide association studies and understanding of the molecular pathways involved in diverse human traits and diseases.


Asunto(s)
Polimorfismo de Nucleótido Simple/genética , Técnica SELEX de Producción de Aptámeros , Máquina de Vectores de Soporte , Factores de Transcripción/metabolismo , Sitios de Unión/genética , Enfermedad/genética , Genoma Humano/genética , Humanos , Ligandos , Unión Proteica
3.
Nature ; 598(7879): 129-136, 2021 10.
Artículo en Inglés | MEDLINE | ID: mdl-34616068

RESUMEN

The mammalian cerebrum performs high-level sensory perception, motor control and cognitive functions through highly specialized cortical and subcortical structures1. Recent surveys of mouse and human brains with single-cell transcriptomics2-6 and high-throughput imaging technologies7,8 have uncovered hundreds of neural cell types distributed in different brain regions, but the transcriptional regulatory programs that are responsible for the unique identity and function of each cell type remain unknown. Here we probe the accessible chromatin in more than 800,000 individual nuclei from 45 regions that span the adult mouse isocortex, olfactory bulb, hippocampus and cerebral nuclei, and use the resulting data to map the state of 491,818 candidate cis-regulatory DNA elements in 160 distinct cell types. We find high specificity of spatial distribution for not only excitatory neurons, but also most classes of inhibitory neurons and a subset of glial cell types. We characterize the gene regulatory sequences associated with the regional specificity within these cell types. We further link a considerable fraction of the cis-regulatory elements to putative target genes expressed in diverse cerebral cell types and predict transcriptional regulators that are involved in a broad spectrum of molecular and cellular pathways in different neuronal and glial cell populations. Our results provide a foundation for comprehensive analysis of gene regulatory programs of the mammalian brain and assist in the interpretation of noncoding risk variants associated with various neurological diseases and traits in humans.


Asunto(s)
Cerebro/citología , Cerebro/metabolismo , Secuencias Reguladoras de Ácidos Nucleicos/genética , Animales , Atlas como Asunto , Cromatina/química , Cromatina/genética , Cromatina/metabolismo , Ensamble y Desensamble de Cromatina , Regulación de la Expresión Génica , Predisposición Genética a la Enfermedad/genética , Humanos , Masculino , Ratones , Ratones Endogámicos C57BL , Enfermedades del Sistema Nervioso/genética , Neuroglía/clasificación , Neuroglía/metabolismo , Neuronas/clasificación , Neuronas/metabolismo , Análisis de Secuencia de ADN , Análisis de la Célula Individual
4.
Brief Bioinform ; 25(3)2024 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-38711367

RESUMEN

Hi-C data are commonly normalized using single sample processing methods, with focus on comparisons between regions within a given contact map. Here, we aim to compare contact maps across different samples. We demonstrate that unwanted variation, of likely technical origin, is present in Hi-C data with replicates from different individuals, and that properties of this unwanted variation change across the contact map. We present band-wise normalization and batch correction, a method for normalization and batch correction of Hi-C data and show that it substantially improves comparisons across samples, including in a quantitative trait loci analysis as well as differential enrichment across cell types.


Asunto(s)
Sitios de Carácter Cuantitativo , Humanos , Biología Computacional
5.
Nature ; 583(7818): 744-751, 2020 07.
Artículo en Inglés | MEDLINE | ID: mdl-32728240

RESUMEN

The Encyclopedia of DNA Elements (ENCODE) project has established a genomic resource for mammalian development, profiling a diverse panel of mouse tissues at 8 developmental stages from 10.5 days after conception until birth, including transcriptomes, methylomes and chromatin states. Here we systematically examined the state and accessibility of chromatin in the developing mouse fetus. In total we performed 1,128 chromatin immunoprecipitation with sequencing (ChIP-seq) assays for histone modifications and 132 assay for transposase-accessible chromatin using sequencing (ATAC-seq) assays for chromatin accessibility across 72 distinct tissue-stages. We used integrative analysis to develop a unified set of chromatin state annotations, infer the identities of dynamic enhancers and key transcriptional regulators, and characterize the relationship between chromatin state and accessibility during developmental gene regulation. We also leveraged these data to link enhancers to putative target genes and demonstrate tissue-specific enrichments of sequence variants associated with disease in humans. The mouse ENCODE data sets provide a compendium of resources for biomedical researchers and achieve, to our knowledge, the most comprehensive view of chromatin dynamics during mammalian fetal development to date.


Asunto(s)
Cromatina/genética , Cromatina/metabolismo , Conjuntos de Datos como Asunto , Desarrollo Fetal/genética , Histonas/metabolismo , Anotación de Secuencia Molecular , Secuencias Reguladoras de Ácidos Nucleicos/genética , Animales , Cromatina/química , Secuenciación de Inmunoprecipitación de Cromatina , Enfermedad/genética , Elementos de Facilitación Genéticos/genética , Femenino , Regulación del Desarrollo de la Expresión Génica/genética , Variación Genética , Histonas/química , Humanos , Masculino , Ratones , Ratones Endogámicos C57BL , Especificidad de Órganos/genética , Reproducibilidad de los Resultados , Transposasas/metabolismo
7.
Nature ; 560(7720): 655-660, 2018 08.
Artículo en Inglés | MEDLINE | ID: mdl-30135582

RESUMEN

Mammalian cells are surrounded by neighbouring cells and extracellular matrix (ECM), which provide cells with structural support and mechanical cues that influence diverse biological processes1. The Hippo pathway effectors YAP (also known as YAP1) and TAZ (also known as WWTR1) are regulated by mechanical cues and mediate cellular responses to ECM stiffness2,3. Here we identified the Ras-related GTPase RAP2 as a key intracellular signal transducer that relays ECM rigidity signals to control mechanosensitive cellular activities through YAP and TAZ. RAP2 is activated by low ECM stiffness, and deletion of RAP2 blocks the regulation of YAP and TAZ by stiffness signals and promotes aberrant cell growth. Mechanistically, matrix stiffness acts through phospholipase Cγ1 (PLCγ1) to influence levels of phosphatidylinositol 4,5-bisphosphate and phosphatidic acid, which activates RAP2 through PDZGEF1 and PDZGEF2 (also known as RAPGEF2 and RAPGEF6). At low stiffness, active RAP2 binds to and stimulates MAP4K4, MAP4K6, MAP4K7 and ARHGAP29, resulting in activation of LATS1 and LATS2 and inhibition of YAP and TAZ. RAP2, YAP and TAZ have pivotal roles in mechanoregulated transcription, as deletion of YAP and TAZ abolishes the ECM stiffness-responsive transcriptome. Our findings show that RAP2 is a molecular switch in mechanotransduction, thereby defining a mechanosignalling pathway from ECM stiffness to the nucleus.


Asunto(s)
Proteínas Serina-Treonina Quinasas/metabolismo , Transducción de Señal , Proteínas de Unión al GTP rap/metabolismo , Proteínas Adaptadoras Transductoras de Señales/metabolismo , Animales , Transformación Celular Neoplásica , Matriz Extracelular/química , Matriz Extracelular/genética , Matriz Extracelular/metabolismo , Femenino , Proteínas Activadoras de GTPasa/metabolismo , Quinasas del Centro Germinal , Factores de Intercambio de Guanina Nucleótido/metabolismo , Células HEK293 , Vía de Señalización Hippo , Humanos , Péptidos y Proteínas de Señalización Intracelular/metabolismo , Ratones , Ratones Endogámicos NOD , Ratones Desnudos , Ratones SCID , Proteínas del Tejido Nervioso/metabolismo , Fosfolipasa C gamma/metabolismo , Fosfoproteínas/metabolismo , Transactivadores , Factores de Transcripción , Proteínas Coactivadoras Transcripcionales con Motivo de Unión a PDZ , Transcriptoma , Proteínas Señalizadoras YAP , Proteínas de Unión al GTP rap/genética
9.
BMC Genomics ; 22(1): 84, 2021 Jan 28.
Artículo en Inglés | MEDLINE | ID: mdl-33509077

RESUMEN

BACKGROUND: Co-localized combinations of histone modifications ("chromatin states") have been shown to correlate with promoter and enhancer activity. Changes in chromatin states over multiple time points ("chromatin state trajectories") have previously been analyzed at promoter and enhancers separately. With the advent of time series Hi-C data it is now possible to connect promoters and enhancers and to analyze chromatin state trajectories at promoter-enhancer pairs. RESULTS: We present TimelessFlex, a framework for investigating chromatin state trajectories at promoters and enhancers and at promoter-enhancer pairs based on Hi-C information. TimelessFlex extends our previous approach Timeless, a Bayesian network for clustering multiple histone modification data sets at promoter and enhancer feature regions. We utilize time series ATAC-seq data measuring open chromatin to define promoters and enhancer candidates. We developed an expectation-maximization algorithm to assign promoters and enhancers to each other based on Hi-C interactions and jointly cluster their feature regions into paired chromatin state trajectories. We find jointly clustered promoter-enhancer pairs showing the same activation patterns on both sides but with a stronger trend at the enhancer side. While the promoter side remains accessible across the time series, the enhancer side becomes dynamically more open towards the gene activation time point. Promoter cluster patterns show strong correlations with gene expression signals, whereas Hi-C signals get only slightly stronger towards activation. The code of the framework is available at https://github.com/henriettemiko/TimelessFlex . CONCLUSIONS: TimelessFlex clusters time series histone modifications at promoter-enhancer pairs based on Hi-C and it can identify distinct chromatin states at promoter and enhancer feature regions and their changes over time.


Asunto(s)
Cromatina , Elementos de Facilitación Genéticos , Teorema de Bayes , Cromatina/genética , Cromosomas , Regiones Promotoras Genéticas
10.
Nature ; 518(7539): 350-354, 2015 Feb 19.
Artículo en Inglés | MEDLINE | ID: mdl-25693566

RESUMEN

Allelic differences between the two homologous chromosomes can affect the propensity of inheritance in humans; however, the extent of such differences in the human genome has yet to be fully explored. Here we delineate allelic chromatin modifications and transcriptomes among a broad set of human tissues, enabled by a chromosome-spanning haplotype reconstruction strategy. The resulting large collection of haplotype-resolved epigenomic maps reveals extensive allelic biases in both chromatin state and transcription, which show considerable variation across tissues and between individuals, and allow us to investigate cis-regulatory relationships between genes and their control sequences. Analyses of histone modification maps also uncover intriguing characteristics of cis-regulatory elements and tissue-restricted activities of repetitive elements. The rich data sets described here will enhance our understanding of the mechanisms by which cis-regulatory elements control gene expression programs.


Asunto(s)
Alelos , Epigénesis Genética/genética , Epigenómica , Haplotipos/genética , Acetilación , Cromatina/genética , Cromatina/metabolismo , Cromosomas Humanos/genética , Conjuntos de Datos como Asunto , Elementos de Facilitación Genéticos/genética , Variación Genética/genética , Histonas/metabolismo , Humanos , Motivos de Nucleótidos , Especificidad de Órganos/genética , Transcripción Genética/genética
11.
Nat Methods ; 14(6): 629-635, 2017 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-28417999

RESUMEN

Millions of cis-regulatory elements are predicted to be present in the human genome, but direct evidence for their biological function is scarce. Here we report a high-throughput method, cis-regulatory element scan by tiling-deletion and sequencing (CREST-seq), for the unbiased discovery and functional assessment of cis-regulatory sequences in the genome. We used it to interrogate the 2-Mb POU5F1 locus in human embryonic stem cells, and identified 45 cis-regulatory elements. A majority of these elements have active chromatin marks, DNase hypersensitivity, and occupancy by multiple transcription factors, which confirms the utility of chromatin signatures in cis-element mapping. Notably, 17 of them are previously annotated promoters of functionally unrelated genes, and like typical enhancers, they form extensive spatial contacts with the POU5F1 promoter. These results point to the commonality of enhancer-like promoters in the human genome.


Asunto(s)
Mapeo Cromosómico/métodos , Pruebas Genéticas/métodos , Secuencias Reguladoras de Ácidos Nucleicos/genética , Algoritmos , Células Cultivadas , Células Madre Embrionarias/fisiología , Regulación de la Expresión Génica/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Análisis de Secuencia de ADN , Análisis de la Célula Individual
12.
PLoS Comput Biol ; 15(4): e1006982, 2019 04.
Artículo en Inglés | MEDLINE | ID: mdl-30986246

RESUMEN

Hi-C and chromatin immunoprecipitation (ChIP) have been combined to identify long-range chromatin interactions genome-wide at reduced cost and enhanced resolution, but extracting information from the resulting datasets has been challenging. Here we describe a computational method, MAPS, Model-based Analysis of PLAC-seq and HiChIP, to process the data from such experiments and identify long-range chromatin interactions. MAPS adopts a zero-truncated Poisson regression framework to explicitly remove systematic biases in the PLAC-seq and HiChIP datasets, and then uses the normalized chromatin contact frequencies to identify significant chromatin interactions anchored at genomic regions bound by the protein of interest. MAPS shows superior performance over existing software tools in the analysis of chromatin interactions from multiple PLAC-seq and HiChIP datasets centered on different transcriptional factors and histone marks. MAPS is freely available at https://github.com/ijuric/MAPS.


Asunto(s)
Ensamble y Desensamble de Cromatina/fisiología , Mapeo Cromosómico/métodos , Biología Computacional/métodos , Cromatina/metabolismo , Cromatina/fisiología , Inmunoprecipitación de Cromatina/métodos , Simulación por Computador , Genoma , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Código de Histonas , Humanos , Análisis de Secuencia de ADN/métodos , Programas Informáticos
13.
J Biol Chem ; 293(28): 11230-11240, 2018 07 13.
Artículo en Inglés | MEDLINE | ID: mdl-29802201

RESUMEN

The Hippo pathway plays an important role in regulating tissue homeostasis, and its effectors, the transcriptional co-activators Yes-associated protein (YAP) and WW domain-containing transcription regulator 1 (WWTR1 or TAZ), are responsible for mediating the vast majority of its physiological functions. Although YAP and TAZ are thought to be largely redundant and similarly regulated by Hippo signaling, they have developmental, structural, and physiological differences that suggest they may differ in their regulation and downstream functions. To better understand the functions of YAP and TAZ in the Hippo pathway, using CRISPR/Cas9, we generated YAP KO, TAZ KO, and YAP/TAZ KO cell lines in HEK293A cells. We evaluated them in response to many environmental conditions and stimuli and used RNA-Seq to compare their transcriptional profiles. We found that YAP inactivation has a greater effect on cellular physiology (namely, cell spreading, volume, granularity, glucose uptake, proliferation, and migration) than TAZ inactivation. However, functional redundancy between YAP and TAZ was also observed. In summary, our findings confirm that the Hippo pathway effectors YAP and TAZ are master regulators for multiple cellular processes but also reveal that YAP has a stronger influence than TAZ.


Asunto(s)
Proteínas Adaptadoras Transductoras de Señales/metabolismo , Fenómenos Fisiológicos Celulares , Fosfoproteínas/metabolismo , Proteínas Serina-Treonina Quinasas/metabolismo , Transducción de Señal , Factores de Transcripción/metabolismo , Proteínas Supresoras de Tumor/metabolismo , Aciltransferasas , Proteínas Adaptadoras Transductoras de Señales/antagonistas & inhibidores , Proteínas Adaptadoras Transductoras de Señales/genética , Sistemas CRISPR-Cas , Perfilación de la Expresión Génica , Células HEK293 , Vía de Señalización Hippo , Homeostasis , Humanos , Fosfoproteínas/antagonistas & inhibidores , Fosfoproteínas/genética , Proteínas Serina-Treonina Quinasas/antagonistas & inhibidores , Proteínas Serina-Treonina Quinasas/genética , Factores de Transcripción/antagonistas & inhibidores , Factores de Transcripción/genética , Proteínas Supresoras de Tumor/antagonistas & inhibidores , Proteínas Supresoras de Tumor/genética , Proteínas Señalizadoras YAP
14.
Nucleic Acids Res ; 43(1): 104-14, 2015 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-25505163

RESUMEN

To find signature features shared by various ncRNA sub-types and characterize novel ncRNAs, we have developed a method, RNAfeature, to investigate >600 sets of genomic and epigenomic data with various evolutionary and biophysical scores. RNAfeature utilizes a fine-tuned intra-species wrapper algorithm that is followed by a novel feature selection strategy across species. It considers long distance effect of certain features (e.g. histone modification at the promoter region). We finally narrow down on 10 informative features (including sequences, structures, expression profiles and epigenetic signals). These features are complementary to each other and as a whole can accurately distinguish canonical ncRNAs from CDSs and UTRs (accuracies: >92% in human, mouse, worm and fly). Moreover, the feature pattern is conserved across multiple species. For instance, the supervised 10-feature model derived from animal species can predict ncRNAs in Arabidopsis (accuracy: 82%). Subsequently, we integrate the 10 features to define a set of noncoding potential scores, which can identify, evaluate and characterize novel noncoding RNAs. The score covers all transcribed regions (including unconserved ncRNAs), without requiring assembly of the full-length transcripts. Importantly, the noncoding potential allows us to identify and characterize potential functional domains with feature patterns similar to canonical ncRNAs (e.g. tRNA, snRNA, miRNA, etc) on ∼70% of human long ncRNAs (lncRNAs).


Asunto(s)
Genómica/métodos , ARN no Traducido/química , ARN no Traducido/genética , Algoritmos , Animales , Humanos , Ratones , Conformación de Ácido Nucleico , ARN Largo no Codificante/química , ARN no Traducido/metabolismo
15.
Nat Commun ; 14(1): 6711, 2023 10 23.
Artículo en Inglés | MEDLINE | ID: mdl-37872149

RESUMEN

Tandem repeats (TRs) represent one of the largest sources of genetic variation in humans and are implicated in a range of phenotypes. Here we present a deep characterization of TR variation based on high coverage whole genome sequencing from 3550 diverse individuals from the 1000 Genomes Project and H3Africa cohorts. We develop a method, EnsembleTR, to integrate genotypes from four separate methods resulting in high-quality genotypes at more than 1.7 million TR loci. Our catalog reveals novel sequence features influencing TR heterozygosity, identifies population-specific trinucleotide expansions, and finds hundreds of novel eQTL signals. Finally, we generate a phased haplotype panel which can be used to impute most TRs from nearby single nucleotide polymorphisms (SNPs) with high accuracy. Overall, the TR genotypes and reference haplotype panel generated here will serve as valuable resources for future genome-wide and population-wide studies of TRs and their role in human phenotypes.


Asunto(s)
Polimorfismo de Nucleótido Simple , Secuencias Repetidas en Tándem , Humanos , Genotipo , Secuenciación Completa del Genoma
16.
bioRxiv ; 2023 Mar 12.
Artículo en Inglés | MEDLINE | ID: mdl-36945429

RESUMEN

Tandem repeats (TRs) represent one of the largest sources of genetic variation in humans and are implicated in a range of phenotypes. Here we present a deep characterization of TR variation based on high coverage whole genome sequencing from 3,550 diverse individuals from the 1000 Genomes Project and H3Africa cohorts. We develop a method, EnsembleTR, to integrate genotypes from four separate methods resulting in high-quality genotypes at more than 1.7 million TR loci. Our catalog reveals novel sequence features influencing TR heterozygosity, identifies population-specific trinucleotide expansions, and finds hundreds of novel eQTL signals. Finally, we generate a phased haplotype panel which can be used to impute most TRs from nearby single nucleotide polymorphisms (SNPs) with high accuracy. Overall, the TR genotypes and reference haplotype panel generated here will serve as valuable resources for future genome-wide and population-wide studies of TRs and their role in human phenotypes.

17.
Sci Adv ; 8(21): eabl9806, 2022 May 27.
Artículo en Inglés | MEDLINE | ID: mdl-35613278

RESUMEN

Semaphorins were originally identified as axonal guidance molecules, but they also control processes such as vascular development and tumorigenesis. The downstream signaling cascades of Semaphorins in these biological processes remain unclear. Here, we show that the class 3 Semaphorins (SEMA3s) activate the Hippo pathway to attenuate tissue growth, angiogenesis, and tumorigenesis. SEMA3B restoration in lung cancer cells with SEMA3B loss of heterozygosity suppresses cancer cell growth via activating the core Hippo kinases LATS1/2 (large tumor suppressor kinase 1/2). Furthermore, SEMA3 also acts through LATS1/2 to inhibit angiogenesis. We identified p190RhoGAPs as essential partners of the SEMA3A receptor PlexinA in Hippo regulation. Upon SEMA3 treatment, PlexinA interacts with the pseudo-guanosine triphosphatase (GTPase) domain of p190RhoGAP and simultaneously recruits RND GTPases to activate p190RhoGAP, which then stimulates LATS1/2. Disease-associated etiological factors, such as genetic lesions and oscillatory shear, diminish Hippo pathway regulation by SEMA3. Our study thus discovers a critical role of Hippo signaling in mediating SEMA3 physiological function.

18.
Cell Genom ; 2(12): 100214, 2022 Dec 14.
Artículo en Inglés | MEDLINE | ID: mdl-36778047

RESUMEN

We combined functional genomics and human genetics to investigate processes that affect type 1 diabetes (T1D) risk by mediating beta cell survival in response to proinflammatory cytokines. We mapped 38,931 cytokine-responsive candidate cis-regulatory elements (cCREs) in beta cells using ATAC-seq and snATAC-seq and linked them to target genes using co-accessibility and HiChIP. Using a genome-wide CRISPR screen in EndoC-ßH1 cells, we identified 867 genes affecting cytokine-induced survival, and genes promoting survival and up-regulated in cytokines were enriched at T1D risk loci. Using SNP-SELEX, we identified 2,229 variants in cytokine-responsive cCREs altering transcription factor (TF) binding, and variants altering binding of TFs regulating stress, inflammation, and apoptosis were enriched for T1D risk. At the 16p13 locus, a fine-mapped T1D variant altering TF binding in a cytokine-induced cCRE interacted with SOCS1, which promoted survival in cytokine exposure. Our findings reveal processes and genes acting in beta cells during inflammation that modulate T1D risk.

19.
Genome Med ; 14(1): 84, 2022 08 11.
Artículo en Inglés | MEDLINE | ID: mdl-35948990

RESUMEN

BACKGROUND: Expansions of short tandem repeats are the cause of many neurogenetic disorders including familial amyotrophic lateral sclerosis, Huntington disease, and many others. Multiple methods have been recently developed that can identify repeat expansions in whole genome or exome sequencing data. Despite the widely recognized need for visual assessment of variant calls in clinical settings, current computational tools lack the ability to produce such visualizations for repeat expansions. Expanded repeats are difficult to visualize because they correspond to large insertions relative to the reference genome and involve many misaligning and ambiguously aligning reads. RESULTS: We implemented REViewer, a computational method for visualization of sequencing data in genomic regions containing long repeat expansions and FlipBook, a companion image viewer designed for manual curation of large collections of REViewer images. To generate a read pileup, REViewer reconstructs local haplotype sequences and distributes reads to these haplotypes in a way that is most consistent with the fragment lengths and evenness of read coverage. To create appropriate training materials for onboarding new users, we performed a concordance study involving 12 scientists involved in short tandem repeat research. We used the results of this study to create a user guide that describes the basic principles of using REViewer as well as a guide to the typical features of read pileups that correspond to low confidence repeat genotype calls. Additionally, we demonstrated that REViewer can be used to annotate clinically relevant repeat interruptions by comparing visual assessment results of 44 FMR1 repeat alleles with the results of triplet repeat primed PCR. For 38 of these alleles, the results of visual assessment were consistent with triplet repeat primed PCR. CONCLUSIONS: Read pileup plots generated by REViewer offer an intuitive way to visualize sequencing data in regions containing long repeat expansions. Laboratories can use REViewer and FlipBook to assess the quality of repeat genotype calls as well as to visually detect interruptions or other imperfections in the repeat sequence and the surrounding flanking regions. REViewer and FlipBook are available under open-source licenses at https://github.com/illumina/REViewer and https://github.com/broadinstitute/flipbook respectively.


Asunto(s)
Esclerosis Amiotrófica Lateral , Secuencias Repetidas en Tándem , Alelos , Esclerosis Amiotrófica Lateral/genética , Exoma , Proteína de la Discapacidad Intelectual del Síndrome del Cromosoma X Frágil/genética , Haplotipos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos
20.
Comput Struct Biotechnol J ; 19: 355-362, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-33489005

RESUMEN

Hi-C experiments have been widely adopted to study chromatin spatial organization, which plays an essential role in genome function. We have recently identified frequently interacting regions (FIREs) and found that they are closely associated with cell-type-specific gene regulation. However, computational tools for detecting FIREs from Hi-C data are still lacking. In this work, we present FIREcaller, a stand-alone, user-friendly R package for detecting FIREs from Hi-C data. FIREcaller takes raw Hi-C contact matrices as input, performs within-sample and cross-sample normalization, and outputs continuous FIRE scores, dichotomous FIREs, and super-FIREs. Applying FIREcaller to Hi-C data from various human tissues, we demonstrate that FIREs and super-FIREs identified, in a tissue-specific manner, are closely related to gene regulation, are enriched for enhancer-promoter (E-P) interactions, tend to overlap with regions exhibiting epigenomic signatures of cis-regulatory roles, and aid the interpretation or GWAS variants. The FIREcaller package is implemented in R and freely available at https://yunliweb.its.unc.edu/FIREcaller.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA