Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Science ; 384(6694): eadj0116, 2024 Apr 26.
Artigo em Inglês | MEDLINE | ID: mdl-38662817

RESUMO

Transcription initiation is a process that is essential to ensuring the proper function of any gene, yet we still lack a unified understanding of sequence patterns and rules that explain most transcription start sites in the human genome. By predicting transcription initiation at base-pair resolution from sequences with a deep learning-inspired explainable model called Puffin, we show that a small set of simple rules can explain transcription initiation at most human promoters. We identify key sequence patterns that contribute to human promoter activity, each activating transcription with distinct position-specific effects. Furthermore, we explain the sequence basis of bidirectional transcription at promoters, identify the links between promoter sequence and gene expression variation across cell types, and explore the conservation of sequence determinants of transcription initiation across mammalian species.


Assuntos
Genoma Humano , Regiões Promotoras Genéticas , Sítio de Iniciação de Transcrição , Iniciação da Transcrição Genética , Humanos , Aprendizado Profundo , Animais , Sequência de Bases
2.
bioRxiv ; 2023 Aug 09.
Artigo em Inglês | MEDLINE | ID: mdl-37609196

RESUMO

The role of non-coding regulatory elements and how they might contribute to tissue type specificity of disease phenotypes is poorly understood. Autosomal Dominant Leukodystrophy (ADLD) is a fatal, adult-onset, neurological disorder that is characterized by extensive CNS demyelination. Most cases of ADLD are caused by tandem genomic duplications involving the lamin B1 gene ( LMNB1 ) while a small subset are caused by genomic deletions upstream of the gene. Utilizing data from recently identified families that carry LMNB1 gene duplications but do not exhibit demyelination, ADLD patient tissues, CRISPR modified cell lines and mouse models, we have identified a novel silencer element that is lost in ADLD patients and that specifically targets overexpression to oligodendrocytes. This element consists of CTCF binding sites that mediate three-dimensional chromatin looping involving the LMNB1 and the recruitment of the PRC2 repressor complex. Loss of the silencer element in ADLD identifies a previously unknown role for silencer elements in tissue specificity and disease causation.

3.
bioRxiv ; 2023 Jun 29.
Artigo em Inglês | MEDLINE | ID: mdl-37425823

RESUMO

Transcription initiation is an essential process for ensuring proper function of any gene, however, a unified understanding of sequence patterns and rules that determine transcription initiation sites in human genome remains elusive. By explaining transcription initiation at basepair resolution from sequence with a deep learning-inspired explainable modeling approach, here we show that simple rules can explain the vast majority of human promoters. We identified key sequence patterns that contribute to human promoter function, each activating transcription with a distinct position-specific effect curve that likely reflects its mechanism of promoting transcription initiation. Most of these position-specific effects have not been previously characterized, and we verified them using experimental perturbations of transcription factors and sequences. We revealed the sequence basis of bidirectional transcription at promoters and links between promoter selectivity and gene expression variation across cell types. Additionally, by analyzing 241 mammalian genomes and mouse transcription initiation site data, we showed that the sequence determinants are conserved across mammalian species. Taken together, we provide a unified model of the sequence basis of transcription initiation at the basepair level that is broadly applicable across mammalian species, and shed new light on basic questions related to promoter sequence and function.

4.
ArXiv ; 2023 Jun 16.
Artigo em Inglês | MEDLINE | ID: mdl-37292476

RESUMO

Designing biological sequences is an important challenge that requires satisfying complex constraints and thus is a natural problem to address with deep generative modeling. Diffusion generative models have achieved considerable success in many applications. Score-based generative stochastic differential equations (SDE) model is a continuous-time diffusion model framework that enjoys many benefits, but the originally proposed SDEs are not naturally designed for modeling discrete data. To develop generative SDE models for discrete data such as biological sequences, here we introduce a diffusion process defined in the probability simplex space with stationary distribution being the Dirichlet distribution. This makes diffusion in continuous space natural for modeling discrete data. We refer to this approach as Dirchlet diffusion score model. We demonstrate that this technique can generate samples that satisfy hard constraints using a Sudoku generation task. This generative model can also solve Sudoku, including hard puzzles, without additional training. Finally, we applied this approach to develop the first human promoter DNA sequence design model and showed that designed sequences share similar properties with natural promoter sequences.

5.
Blood ; 142(4): 336-351, 2023 07 27.
Artigo em Inglês | MEDLINE | ID: mdl-36947815

RESUMO

Structural variants (SVs) involving enhancer hijacking can rewire chromatin topologies to cause oncogene activation in human cancers, including hematologic malignancies; however, because of the lack of tools to assess their effects on gene regulation and chromatin organization, the molecular determinants for the functional output of enhancer hijacking remain poorly understood. Here, we developed a multimodal approach to integrate genome sequencing, chromosome conformation, chromatin state, and transcriptomic alteration for quantitative analysis of transcriptional effects and structural reorganization imposed by SVs in leukemic genomes. We identified known and new pathogenic SVs, including recurrent t(5;14) translocations that cause the hijacking of BCL11B enhancers for the allele-specific activation of TLX3 in a subtype of pediatric leukemia. Epigenetic perturbation of SV-hijacked BCL11B enhancers impairs TLX3 transcription, which are required for the growth of t(5;14) leukemia cells. By CRISPR engineering of patient-derived t(5;14) in isogenic leukemia cells, we uncovered a new mechanism whereby the transcriptional output of SV-induced BCL11B enhancer hijacking is dependent on the loss of DNA hypermethylation at the TLX3 promoter. Our results highlight the importance of the cooperation between genetic alteration and permissive chromatin as a critical determinant of SV-mediated oncogene activation, with implications for understanding aberrant gene transcription after epigenetic therapies in patients with leukemia. Hence, leveraging the interdependency of genetic alteration on chromatin variation may provide new opportunities to reprogram gene regulation as targeted interventions in human disease.


Assuntos
Cromatina , Leucemia , Humanos , Criança , Cromatina/genética , Elementos Facilitadores Genéticos , Cromossomos/metabolismo , Fatores de Transcrição/genética , Leucemia/genética , Proteínas Supressoras de Tumor/genética , Proteínas Repressoras/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...