Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 109
Filtrar
Mais filtros

Bases de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Mol Cell ; 81(1): 166-182.e6, 2021 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-33238161

RESUMO

The repeating structural unit of metazoan chromatin is the chromatosome, a nucleosome bound to a linker histone, H1. There are 11 human H1 isoforms with diverse cellular functions, but how they interact with the nucleosome remains elusive. Here, we determined the cryoelectron microscopy (cryo-EM) structures of chromatosomes containing 197 bp DNA and three different human H1 isoforms, respectively. The globular domains of all three H1 isoforms bound to the nucleosome dyad. However, the flanking/linker DNAs displayed substantial distinct dynamic conformations. Nuclear magnetic resonance (NMR) and H1 tail-swapping cryo-EM experiments revealed that the C-terminal tails of the H1 isoforms mainly controlled the flanking DNA orientations. We also observed partial ordering of the core histone H2A C-terminal and H3 N-terminal tails in the chromatosomes. Our results provide insights into the structures and dynamics of the chromatosomes and have implications for the structure and function of chromatin.


Assuntos
DNA/química , Histonas/química , Nucleossomos/química , Microscopia Crioeletrônica , DNA/ultraestrutura , Humanos , Nucleossomos/ultraestrutura , Isoformas de Proteínas/química
2.
Genes Dev ; 31(19): 1958-1972, 2017 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-29074736

RESUMO

Histone CENP-A-containing nucleosomes play an important role in nucleating kinetochores at centromeres for chromosome segregation. However, the molecular mechanisms by which CENP-A nucleosomes engage with kinetochore proteins are not well understood. Here, we report the finding of a new function for the budding yeast Cse4/CENP-A histone-fold domain interacting with inner kinetochore protein Mif2/CENP-C. Strikingly, we also discovered that AT-rich centromere DNA has an important role for Mif2 recruitment. Mif2 contacts one side of the nucleosome dyad, engaging with both Cse4 residues and AT-rich nucleosomal DNA. Both interactions are directed by a contiguous DNA- and histone-binding domain (DHBD) harboring the conserved CENP-C motif, an AT hook, and RK clusters (clusters enriched for arginine-lysine residues). Human CENP-C has two related DHBDs that bind preferentially to DNA sequences of higher AT content. Our findings suggest that a DNA composition-based mechanism together with residues characteristic for the CENP-A histone variant contribute to the specification of centromere identity.


Assuntos
Proteína Centromérica A/metabolismo , Centrômero/metabolismo , Proteínas Cromossômicas não Histona/metabolismo , Modelos Moleculares , Nucleossomos/química , Nucleossomos/metabolismo , Saccharomyces cerevisiae , Sequência Rica em At , Centrômero/química , Proteína Centromérica A/química , Proteínas Cromossômicas não Histona/química , DNA Satélite/metabolismo , Proteínas de Ligação a DNA/metabolismo , Dimerização , Humanos , Ligação Proteica , Estrutura Terciária de Proteína , Saccharomyces cerevisiae/química , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo
3.
Nucleic Acids Res ; 50(4): 1864-1874, 2022 02 28.
Artigo em Inglês | MEDLINE | ID: mdl-35166834

RESUMO

Cytosine methylation at the 5-carbon position is an essential DNA epigenetic mark in many eukaryotic organisms. Although countless structural and functional studies of cytosine methylation have been reported, our understanding of how it influences the nucleosome assembly, structure, and dynamics remains obscure. Here, we investigate the effects of cytosine methylation at CpG sites on nucleosome dynamics and stability. By applying long molecular dynamics simulations on several microsecond time scale, we generate extensive atomistic conformational ensembles of full nucleosomes. Our results reveal that methylation induces pronounced changes in geometry for both linker and nucleosomal DNA, leading to a more curved, under-twisted DNA, narrowing the adjacent minor grooves, and shifting the population equilibrium of sugar-phosphate backbone geometry. These DNA conformational changes are associated with a considerable enhancement of interactions between methylated DNA and the histone octamer, doubling the number of contacts at some key arginines. H2A and H3 tails play important roles in these interactions, especially for DNA methylated nucleosomes. This, in turn, prevents a spontaneous DNA unwrapping of 3-4 helical turns for the methylated nucleosome with truncated histone tails, otherwise observed in the unmethylated system on several microseconds time scale.


Assuntos
Metilação de DNA , Nucleossomos , Sinais (Psicologia) , Citosina , DNA/química , Histonas/metabolismo , Nucleossomos/genética
4.
Mol Biol Evol ; 37(2): 320-326, 2020 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-31642480

RESUMO

Cancer progression is an evolutionary process. During this process, evolving cancer cell populations encounter restrictive ecological niches within the body, such as the primary tumor, circulatory system, and diverse metastatic sites. Efforts to prevent or delay cancer evolution-and progression-require a deep understanding of the underlying molecular evolutionary processes. Herein we discuss a suite of concepts and tools from evolutionary and ecological theory that can inform cancer biology in new and meaningful ways. We also highlight current challenges to applying these concepts, and propose ways in which incorporating these concepts could identify new therapeutic modes and vulnerabilities in cancer.


Assuntos
Genômica/métodos , Neoplasias/genética , Progressão da Doença , Evolução Molecular , Aptidão Genética , Humanos , Filogenia , Nicho de Células-Tronco
5.
Brief Bioinform ; 19(6): 1085-1101, 2018 11 27.
Artigo em Inglês | MEDLINE | ID: mdl-28498882

RESUMO

Cancer is a genetic disorder, meaning that a plethora of different mutations, whether somatic or germ line, underlie the etiology of the 'Emperor of Maladies'. Point mutations, chromosomal rearrangements and copy number changes, whether they have occurred spontaneously in predisposed individuals or have been induced by intrinsic or extrinsic (environmental) mutagens, lead to the activation of oncogenes and inactivation of tumor suppressor genes, thereby promoting malignancy. This scenario has now been recognized and experimentally confirmed in a wide range of different contexts. Over the past decade, a surge in available sequencing technologies has allowed the sequencing of whole genomes from liquid malignancies and solid tumors belonging to different types and stages of cancer, giving birth to the new field of cancer genomics. One of the most striking discoveries has been that cancer genomes are highly enriched with mutations of specific kinds. It has been suggested that these mutations can be classified into 'families' based on their mutational signatures. A mutational signature may be regarded as a type of base substitution (e.g. C:G to T:A) within a particular context of neighboring nucleotide sequence (the bases upstream and/or downstream of the mutation). These mutational signatures, supplemented by mutable motifs (a wider mutational context), promise to help us to understand the nature of the mutational processes that operate during tumor evolution because they represent the footprints of interactions between DNA, mutagens and the enzymes of the repair/replication/modification pathways.


Assuntos
Genômica , Mutação , Neoplasias/genética , DNA/genética , Metilação de DNA , Evolução Molecular , Expressão Gênica , Predisposição Genética para Doença , Humanos , Modelos Genéticos , Mutagênicos/farmacologia , Oncogenes , Seleção Genética
6.
PLoS Comput Biol ; 15(4): e1006981, 2019 04.
Artigo em Inglês | MEDLINE | ID: mdl-31034466

RESUMO

Identifying driver mutations in cancer is notoriously difficult. To date, recurrence of a mutation in patients remains one of the most reliable markers of mutation driver status. However, some mutations are more likely to occur than others due to differences in background mutation rates arising from various forms of infidelity of DNA replication and repair machinery, endogenous, and exogenous mutagens. We calculated nucleotide and codon mutability to study the contribution of background processes in shaping the observed mutational spectrum in cancer. We developed and tested probabilistic pan-cancer and cancer-specific models that adjust the number of mutation recurrences in patients by background mutability in order to find mutations which may be under selection in cancer. We showed that mutations with higher mutability values had higher observed recurrence frequency, especially in tumor suppressor genes. This trend was prominent for nonsense and silent mutations or mutations with neutral functional impact. In oncogenes, however, highly recurring mutations were characterized by relatively low mutability, resulting in an inversed U-shaped trend. Mutations not yet observed in any tumor had relatively low mutability values, indicating that background mutability might limit mutation occurrence. We compiled a dataset of missense mutations from 58 genes with experimentally validated functional and transforming impacts from various studies. We found that mutability of driver mutations was lower than that of passengers and consequently adjusting mutation recurrence frequency by mutability significantly improved ranking of mutations and driver mutation prediction. Even though no training on existing data was involved, our approach performed similarly or better to the state-of-the-art methods.


Assuntos
Códon/genética , Replicação do DNA/genética , Mutação/genética , Mutação/fisiologia , Neoplasias/genética , Biologia Computacional , Humanos , Oncogenes/genética
7.
Nucleic Acids Res ; 45(16): 9229-9243, 2017 Sep 19.
Artigo em Inglês | MEDLINE | ID: mdl-28934480

RESUMO

Nucleosomes are the most abundant protein-DNA complexes in eukaryotes that provide compaction of genomic DNA and are implicated in regulation of transcription, DNA replication and repair. The details of DNA positioning on the nucleosome and the DNA conformation can provide key regulatory signals. Hydroxyl-radical footprinting (HRF) of protein-DNA complexes is a chemical technique that probes nucleosome organization in solution with a high precision unattainable by other methods. In this work we propose an integrative modeling method for constructing high-resolution atomistic models of nucleosomes based on HRF experiments. Our method precisely identifies DNA positioning on nucleosome by combining HRF data for both DNA strands with the pseudo-symmetry constraints. We performed high-resolution HRF for Saccharomyces cerevisiae centromeric nucleosome of unknown structure and characterized it using our integrative modeling approach. Our model provides the basis for further understanding the cooperative engagement and interplay between Cse4p protein and the A-tracts important for centromere function.


Assuntos
Pegada de DNA/métodos , DNA/química , Modelos Moleculares , Nucleossomos/química , Algoritmos , Centrômero/química , Proteínas Cromossômicas não Histona , Clivagem do DNA , Proteínas de Ligação a DNA , Radical Hidroxila , Conformação de Ácido Nucleico , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae
8.
Nucleic Acids Res ; 45(W1): W514-W522, 2017 07 03.
Artigo em Inglês | MEDLINE | ID: mdl-28472504

RESUMO

Much remains unknown about the progression and heterogeneity of mutational processes in different cancers and their diagnostic and clinical potential. A growing body of evidence supports mutation rate dependence on the local DNA sequence context for various types of mutations. We propose several tools for the analysis of cancer context-dependent mutations, which are implemented in an online computational framework MutaGene. The framework explores DNA context-dependent mutational patterns and underlying somatic cancer mutagenesis, analyzes mutational profiles of cancer samples, identifies the combinations of underlying mutagenic processes including those related to infidelity of DNA replication and repair machinery, and various other endogenous and exogenous mutagenic factors. As a result, the combination of mutagenic processes can be identified in any query sample with subsequent comparison to mutational profiles derived from malignant and benign samples. In addition, mutagen or cancer-specific mutational background models are applied to calculate expected DNA and protein site mutability to decouple relative contributions of mutagenesis and selection in carcinogenesis, thus elucidating the site-specific driving events in cancer. MutaGene is freely available at https://www.ncbi.nlm.nih.gov/projects/mutagene/.


Assuntos
Mutação , Neoplasias/genética , Software , Substituição de Aminoácidos , Análise Mutacional de DNA , Humanos , Internet , Mutagênese
9.
Nucleic Acids Res ; 44(W1): W494-501, 2016 07 08.
Artigo em Inglês | MEDLINE | ID: mdl-27150810

RESUMO

Proteins engage in highly selective interactions with their macromolecular partners. Sequence variants that alter protein binding affinity may cause significant perturbations or complete abolishment of function, potentially leading to diseases. There exists a persistent need to develop a mechanistic understanding of impacts of variants on proteins. To address this need we introduce a new computational method MutaBind to evaluate the effects of sequence variants and disease mutations on protein interactions and calculate the quantitative changes in binding affinity. The MutaBind method uses molecular mechanics force fields, statistical potentials and fast side-chain optimization algorithms. The MutaBind server maps mutations on a structural protein complex, calculates the associated changes in binding affinity, determines the deleterious effect of a mutation, estimates the confidence of this prediction and produces a mutant structural model for download. MutaBind can be applied to a large number of problems, including determination of potential driver mutations in cancer and other diseases, elucidation of the effects of sequence variants on protein fitness in evolution and protein design. MutaBind is available at http://www.ncbi.nlm.nih.gov/projects/mutabind/.


Assuntos
Internet , Mapeamento de Interação de Proteínas/métodos , Mapas de Interação de Proteínas/genética , Proteínas/química , Proteínas/metabolismo , Software , Algoritmos , Sítios de Ligação , Conjuntos de Dados como Assunto , Evolução Molecular , Aptidão Genética , Humanos , Simulação de Dinâmica Molecular , Neoplasias/genética , Ligação Proteica/genética , Proteínas/genética
10.
Int J Mol Sci ; 19(7)2018 Jul 20.
Artigo em Inglês | MEDLINE | ID: mdl-30037003

RESUMO

Cancer is a complex disease that is driven by genetic alterations. There has been a rapid development of genome-wide techniques during the last decade along with a significant lowering of the cost of gene sequencing, which has generated widely available cancer genomic data. However, the interpretation of genomic data and the prediction of the association of genetic variations with cancer and disease phenotypes still requires significant improvement. Missense mutations, which can render proteins non-functional and provide a selective growth advantage to cancer cells, are frequently detected in cancer. Effects caused by missense mutations can be pinpointed by in silico modeling, which makes it more feasible to find a treatment and reverse the effect. Specific human phenotypes are largely determined by stability, activity, and interactions between proteins and other biomolecules that work together to execute specific cellular functions. Therefore, analysis of missense mutations' effects on proteins and their complexes would provide important clues for identifying functionally important missense mutations, understanding the molecular mechanisms of cancer progression and facilitating treatment and prevention. Herein, we summarize the major computational approaches and tools that provide not only the classification of missense mutations as cancer drivers or passengers but also the molecular mechanisms induced by driver mutations. This review focuses on the discussion of annotation and prediction methods based on structural and biophysical data, analysis of somatic cancer missense mutations in 3D structures of proteins and their complexes, predictions of the effects of missense mutations on protein stability, protein-protein and protein-nucleic acid interactions, and assessment of conformational changes in protein conformations induced by mutations.


Assuntos
Mutação de Sentido Incorreto/genética , Neoplasias/genética , Animais , Biologia Computacional/métodos , Humanos , Neoplasias/metabolismo , Conformação Proteica , Estabilidade Proteica
11.
Biophys J ; 109(6): 1295-306, 2015 Sep 15.
Artigo em Inglês | MEDLINE | ID: mdl-26213149

RESUMO

Structures of protein complexes provide atomistic insights into protein interactions. Human proteins represent a quarter of all structures in the Protein Data Bank; however, available protein complexes cover less than 10% of the human proteome. Although it is theoretically possible to infer interactions in human proteins based on structures of homologous protein complexes, it is still unclear to what extent protein interactions and binding sites are conserved, and whether protein complexes from remotely related species can be used to infer interactions and binding sites. We considered biological units of protein complexes and clustered protein-protein binding sites into similarity groups based on their structure and sequence, which allowed us to identify unique binding sites. We showed that the growth rate of the number of unique binding sites in the Protein Data Bank was much slower than the growth rate of the number of structural complexes. Next, we investigated the evolutionary roots of unique binding sites and identified the major phyletic branches with the largest expansion in the number of novel binding sites. We found that many binding sites could be traced to the universal common ancestor of all cellular organisms, whereas relatively few binding sites emerged at the major evolutionary branching points. We analyzed the physicochemical properties of unique binding sites and found that the most ancient sites were the largest in size, involved many salt bridges, and were the most compact and least planar. In contrast, binding sites that appeared more recently in the evolution of eukaryotes were characterized by a larger fraction of polar and aromatic residues, and were less compact and more planar, possibly due to their more transient nature and roles in signaling processes.


Assuntos
Sítios de Ligação/genética , Evolução Molecular , Ligação Proteica/genética , Proteínas/genética , Proteínas/metabolismo , Animais , Humanos , Modelos Moleculares
12.
EMBO Rep ; 13(3): 266-71, 2012 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-22261719

RESUMO

Although the identification of protein interactions by high-throughput (HTP) methods progresses at a fast pace, 'interactome' data sets still suffer from high rates of false positives and low coverage. To map the human protein interactome, we describe a new framework that uses experimental evidence on structural complexes, the atomic details of binding interfaces and evolutionary conservation. The structurally inferred interaction network is highly modular and more functionally coherent compared with experimental interaction networks derived from multiple literature citations. Moreover, structurally inferred and high-confidence HTP networks complement each other well, allowing us to construct a merged network to generate testable hypotheses and provide valuable experimental leads.


Assuntos
Complexos Multiproteicos/química , Mapeamento de Interação de Proteínas/métodos , Proteômica/métodos , Sítios de Ligação , Biologia Computacional/métodos , Bases de Dados Genéticas , Humanos , Ligação Proteica , Domínios e Motivos de Interação entre Proteínas , Software
13.
Nucleic Acids Res ; 40(Web Server issue): W242-8, 2012 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-22689646

RESUMO

Sites that show specific conservation patterns within subsets of proteins in a protein family are likely to be involved in the development of functional specificity. These sites, generally termed specificity determining sites (SDS), might play a crucial role in binding to a specific substrate or proteins. Identification of SDS through experimental techniques is a slow, difficult and tedious job. Hence, it is very important to develop efficient computational methods that can more expediently identify SDS. Herein, we present Specificity prediction using amino acids' Properties, Entropy and Evolution Rate (SPEER)-SERVER, a web server that predicts SDS by analyzing quantitative measures of the conservation patterns of protein sites based on their physico-chemical properties and the heterogeneity of evolutionary changes between and within the protein subfamilies. This web server provides an improved representation of results, adds useful input and output options and integrates a wide range of analysis and data visualization tools when compared with the original standalone version of the SPEER algorithm. Extensive benchmarking finds that SPEER-SERVER exhibits sensitivity and precision performance that, on average, meets or exceeds that of other currently available methods. SPEER-SERVER is available at http://www.hpppi.iicb.res.in/ss/.


Assuntos
Proteínas/química , Software , Algoritmos , Aminoácidos/química , Internet , Ligação Proteica , Alinhamento de Sequência , Análise de Sequência de Proteína , Interface Usuário-Computador
14.
Nucleic Acids Res ; 40(Database issue): D834-40, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22102591

RESUMO

We have recently developed the Inferred Biomolecular Interaction Server (IBIS) and database, which reports, predicts and integrates different types of interaction partners and locations of binding sites in proteins based on the analysis of homologous structural complexes. Here, we highlight several new IBIS features and options. The server's webpage is now redesigned to allow users easier access to data for different interaction types. An entry page is added to give a quick summary of available results and to now accept protein sequence accessions. To elucidate the formation of protein complexes, not just binary interactions, IBIS currently presents an expandable interaction network. Previously, IBIS provided annotations for four different types of binding partners: proteins, small molecules, nucleic acids and peptides; in the current version a new protein-ion interaction type has been added. Several options provide easy downloads of IBIS data for all Protein Data Bank (PDB) protein chains and the results for each query. In this study, we show that about one-third of all RefSeq sequences can be annotated with IBIS interaction partners and binding sites. The IBIS server is available at http://www.ncbi.nlm.nih.gov/Structure/ibis/ibis.cgi and updated biweekly.


Assuntos
Bases de Dados de Proteínas , Mapeamento de Interação de Proteínas , Proteínas/química , Sítios de Ligação , Gráficos por Computador , Íons/química , Anotação de Sequência Molecular , Complexos Multiproteicos/química , Ácidos Nucleicos/química , Peptídeos/química , Análise de Sequência de Proteína , Integração de Sistemas , Interface Usuário-Computador
15.
Nucleic Acids Res ; 40(Database issue): D461-4, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22135289

RESUMO

Close to 60% of protein sequences tracked in comprehensive databases can be mapped to a known three-dimensional (3D) structure by standard sequence similarity searches. Potentially, a great deal can be learned about proteins or protein families of interest from considering 3D structure, and to this day 3D structure data may remain an underutilized resource. Here we present enhancements in the Molecular Modeling Database (MMDB) and its data presentation, specifically pertaining to biologically relevant complexes and molecular interactions. MMDB is tightly integrated with NCBI's Entrez search and retrieval system, and mirrors the contents of the Protein Data Bank. It links protein 3D structure data with sequence data, sequence classification resources and PubChem, a repository of small-molecule chemical structures and their biological activities, facilitating access to 3D structure data not only for structural biologists, but also for molecular biologists and chemists. MMDB provides a complete set of detailed and pre-computed structural alignments obtained with the VAST algorithm, and provides visualization tools for 3D structure and structure/sequence alignment via the molecular graphics viewer Cn3D. MMDB can be accessed at http://www.ncbi.nlm.nih.gov/structure.


Assuntos
Bases de Dados de Proteínas , Modelos Moleculares , Conformação Proteica , Análise de Sequência de Proteína
16.
Nucleic Acids Res ; 40(Database issue): D13-25, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22140104

RESUMO

In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Website. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, Genome and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, BioProject, BioSample, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Probe, Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), Biosystems, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.


Assuntos
Bases de Dados como Assunto , Bases de Dados Genéticas , Bases de Dados de Proteínas , Expressão Gênica , Genômica , Internet , Modelos Moleculares , National Library of Medicine (U.S.) , Publicações Periódicas como Assunto , PubMed , Alinhamento de Sequência , Análise de Sequência de DNA , Análise de Sequência de Proteína , Análise de Sequência de RNA , Bibliotecas de Moléculas Pequenas , Estados Unidos
17.
J Mol Biol ; 436(16): 168644, 2024 Jun 05.
Artigo em Inglês | MEDLINE | ID: mdl-38848867

RESUMO

Next-generation pathogenicity predictors are designed to identify pathogenic mutations in genetic disorders but are increasingly used to detect driver mutations in cancer. Despite this, their suitability for cancer is not fully established. Here we have assessed the effectiveness of next-generation pathogenicity predictors when applied to cancer by using a comprehensive experimental benchmark of cancer driver and neutral mutations. Our findings indicate that state-of-the-art methods AlphaMissense and VARITY demonstrate commendable performance despite generally underperforming compared to cancer-specific methods. This is notable considering that these methods do not explicitly incorporate cancer-related data in their training and have made concerted efforts to prevent data leakage from the human-curated training and test sets. Nevertheless, it should be mentioned that a significant limitation of using pathogenicity predictors for cancer arises from their inability to detect cancer potential driver mutations specific for a particular cancer type.

18.
Elife ; 122024 Jan 31.
Artigo em Inglês | MEDLINE | ID: mdl-38293962

RESUMO

Wrapping of DNA into nucleosomes restricts accessibility to DNA and may affect the recognition of binding motifs by transcription factors. A certain class of transcription factors, the pioneer transcription factors, can specifically recognize their DNA binding sites on nucleosomes, initiate local chromatin opening, and facilitate the binding of co-factors in a cell-type-specific manner. For the majority of human pioneer transcription factors, the locations of their binding sites, mechanisms of binding, and regulation remain unknown. We have developed a computational method to predict the cell-type-specific ability of transcription factors to bind nucleosomes by integrating ChIP-seq, MNase-seq, and DNase-seq data with details of nucleosome structure. We have demonstrated the ability of our approach in discriminating pioneer from canonical transcription factors and predicted new potential pioneer transcription factors in H1, K562, HepG2, and HeLa-S3 cell lines. Last, we systematically analyzed the interaction modes between various pioneer transcription factors and detected several clusters of distinctive binding sites on nucleosomal DNA.


Assuntos
Nucleossomos , Fatores de Transcrição , Humanos , Nucleossomos/genética , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Cromatina , DNA/metabolismo , Sítios de Ligação
19.
Cell Rep ; 43(1): 113655, 2024 01 23.
Artigo em Inglês | MEDLINE | ID: mdl-38219146

RESUMO

Alterations in the exonuclease domain of DNA polymerase ε cause ultramutated cancers. These cancers accumulate AGA>ATA transversions; however, their genomic features beyond the trinucleotide motifs are obscure. We analyze the extended DNA context of ultramutation using whole-exome sequencing data from 524 endometrial and 395 colorectal tumors. We find that G>T transversions in POLE-mutant tumors predominantly affect sequences containing at least six consecutive purines, with a striking preference for certain positions within polypurine tracts. Using this signature, we develop a machine-learning classifier to identify tumors with hitherto unknown POLE drivers and validate two drivers, POLE-E978G and POLE-S461L, by functional assays in yeast. Unlike other pathogenic variants, the E978G substitution affects the polymerase domain of Pol ε. We further show that tumors with POLD1 drivers share the extended signature of POLE ultramutation. These findings expand the understanding of ultramutation mechanisms and highlight peculiar mutagenic properties of polypurine tracts in the human genome.


Assuntos
Neoplasias Colorretais , DNA Polimerase II , Humanos , DNA Polimerase II/genética , DNA Polimerase II/metabolismo , Mutação/genética , Mutagênese , Neoplasias Colorretais/patologia , DNA Polimerase III/genética , Sequenciamento do Exoma , Proteínas de Ligação a Poli-ADP-Ribose/genética
20.
Nucleic Acids Res ; 39(Database issue): D38-51, 2011 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-21097890

RESUMO

In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Electronic PCR, OrfFinder, Splign, ProSplign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Entrez Probe, GENSAT, Online Mendelian Inheritance in Man (OMIM), Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), IBIS, Biosystems, Peptidome, OMSSA, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.


Assuntos
Bases de Dados Genéticas , Bases de Dados de Proteínas , Expressão Gênica , Genômica , National Library of Medicine (U.S.) , Estrutura Terciária de Proteína , PubMed , Alinhamento de Sequência , Análise de Sequência de DNA , Análise de Sequência de RNA , Software , Integração de Sistemas , Estados Unidos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA