Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 26
Filter
Add more filters

Publication year range
1.
Nature ; 597(7874): 119-125, 2021 09.
Article in English | MEDLINE | ID: mdl-34433969

ABSTRACT

Meningiomas are the most common primary intracranial tumour in adults1. Patients with symptoms are generally treated with surgery as there are no effective medical therapies. The World Health Organization histopathological grade of the tumour and the extent of resection at surgery (Simpson grade) are associated with the recurrence of disease; however, they do not accurately reflect the clinical behaviour of all meningiomas2. Molecular classifications of meningioma that reliably reflect tumour behaviour and inform on therapies are required. Here we introduce four consensus molecular groups of meningioma by combining DNA somatic copy-number aberrations, DNA somatic point mutations, DNA methylation and messenger RNA abundance in a unified analysis. These molecular groups more accurately predicted clinical outcomes compared with existing classification schemes. Each molecular group showed distinctive and prototypical biology (immunogenic, benign NF2 wild-type, hypermetabolic and proliferative) that informed therapeutic options. Proteogenomic characterization reinforced the robustness of the newly defined molecular groups and uncovered highly abundant and group-specific protein targets that we validated using immunohistochemistry. Single-cell RNA sequencing revealed inter-individual variations in meningioma as well as variations in intrinsic expression programs in neoplastic cells that mirrored the biology of the molecular groups identified.


Subject(s)
Biomarkers, Tumor/metabolism , Meningioma/classification , Meningioma/metabolism , Proteogenomics , DNA Methylation , Data Analysis , Drug Discovery , Female , Gene Expression Regulation, Neoplastic , Humans , Immunohistochemistry , Male , Meningioma/drug therapy , Meningioma/genetics , Mutation , RNA-Seq , Reproducibility of Results , Single-Cell Analysis
2.
Am J Epidemiol ; 2024 Aug 27.
Article in English | MEDLINE | ID: mdl-39191651

ABSTRACT

Common genetic variation throughout the genome together with rare coding variants identified to date explain about a half of the inherited genetic component of epithelial ovarian cancer risk. It is likely that rare variation in the non-coding genome will explain some of the unexplained heritability, but identifying such variants is challenging. The primary problem is lack of statistical power to identifying individual risk variants by association as power is a function of sample size, effect size and allele frequency. Power can be increased by using burden tests which test for association of carriers of any variant in a specified genomic region. This has the effect of increasing the putative effect allele frequency. PAX8 is a transcription factor that plays a critical role in tumour progression, migration and invasion. Furthermore, regulatory elements proximal to target genes of PAX8 are enriched for common ovarian cancer risk variants. We hypothesised that rare variation in PAX8 binding sites are also associated with ovarian cancer risk, but unlikely to be associated with risk of breast, colorectal or endometrial cancer. We have used publicly available, whole-genome sequencing data from the UK 100,000 Genomes Project to evaluate the burden of rare variation in PAX8 binding sites across the genome. Data were available for 522 ovarian cancers, 2,984 breast cancers, 2,696 colorectal cancers, 836 endometrial cancers and 2253 non-cancer controls. Active binding sites were defined using data from multiple PAX8 and H3K27 ChIPseq experiments. We found no association between the burden of rare variation in PAX8 binding sites (defined in several ways) and risk of ovarian, breast or endometrial cancer. An apparent association with colorectal cancer was likely to be a technical artefact as a similar association was also detected for rare variation in random regions of the genome. Despite the null result this study provides a proof-of -principle for using burden testing to identify rare, non-coding germline genetic variation associated with disease. Larger sample sizes available from large-scale sequencing projects together with improved understanding of the function of the non-coding genome will increase the potential of similar studies in the future.

3.
Am J Hum Genet ; 107(4): 622-635, 2020 10 01.
Article in English | MEDLINE | ID: mdl-32946763

ABSTRACT

Quantifying the functional effects of complex disease risk variants can provide insights into mechanisms underlying disease biology. Genome-wide association studies have identified 39 regions associated with risk of epithelial ovarian cancer (EOC). The vast majority of these variants lie in the non-coding genome, where they likely function through interaction with gene regulatory elements. In this study we first estimated the heritability explained by known common low penetrance risk alleles for EOC. The narrow sense heritability (hg2) of EOC overall and high-grade serous ovarian cancer (HGSOCs) were estimated to be 5%-6%. Partitioned SNP heritability across broad functional categories indicated a significant contribution of regulatory elements to EOC heritability. We collated epigenomic profiling data for 77 cell and tissue types from Roadmap Epigenomics and ENCODE, and from H3K27Ac ChIP-seq data generated in 26 ovarian cancer and precursor-related cell and tissue types. We identified significant enrichment of risk single-nucleotide polymorphisms (SNPs) in active regulatory elements marked by H3K27Ac in HGSOCs. To further investigate how risk SNPs in active regulatory elements influence predisposition to ovarian cancer, we used motifbreakR to predict the disruption of transcription factor binding sites. We identified 469 candidate causal risk variants in H3K27Ac peaks that are predicted to significantly break transcription factor (TF) motifs. The most frequently broken motif was REST (p value = 0.0028), which has been reported as both a tumor suppressor and an oncogene. Overall, these systematic functional annotations with epigenomic data improve interpretation of EOC risk variants and shed light on likely cells of origin.


Subject(s)
Carcinoma, Ovarian Epithelial/genetics , Co-Repressor Proteins/genetics , Cystadenocarcinoma, Serous/genetics , Enhancer Elements, Genetic , Histones/genetics , Nerve Tissue Proteins/genetics , Ovarian Neoplasms/genetics , Alleles , Binding Sites , Carcinoma, Ovarian Epithelial/diagnosis , Carcinoma, Ovarian Epithelial/pathology , Chromosome Mapping , Co-Repressor Proteins/metabolism , Cystadenocarcinoma, Serous/diagnosis , Cystadenocarcinoma, Serous/pathology , Female , Genetic Predisposition to Disease , Genome, Human , Genome-Wide Association Study , Histones/metabolism , Humans , Inheritance Patterns , Nerve Tissue Proteins/metabolism , Ovarian Neoplasms/diagnosis , Ovarian Neoplasms/pathology , Penetrance , Polymorphism, Single Nucleotide , Risk
5.
RNA Biol ; 18(12): 2203-2217, 2021 12.
Article in English | MEDLINE | ID: mdl-34006179

ABSTRACT

RNA molecules function as messenger RNAs (mRNAs) that encode proteins and noncoding transcripts that serve as adaptor molecules, structural components, and regulators of genome organization and gene expression. Their function and regulation are largely mediated by RNA binding proteins (RBPs). Here we present RNA proximity labelling (RPL), an RNA-centric method comprising the endonuclease-deficient Type VI CRISPR-Cas protein dCas13b fused to engineered ascorbate peroxidase APEX2. RPL discovers target RNA proximal proteins in vivo via proximity-based biotinylation. RPL applied to U1 identified proteins involved in both U1 canonical and noncanonical functions. Profiling of poly(A) tail proximal proteins uncovered expected categories of RBPs and provided additional evidence for 5'-3' proximity and unexplored subcellular localizations of poly(A)+ RNA. Our results suggest that RPL allows rapid identification of target RNA binding proteins in native cellular contexts, and is expected to pave the way for discovery of novel RNA-protein interactions important for health and disease.


Subject(s)
Ascorbate Peroxidases/genetics , CRISPR-Associated Proteins/genetics , RNA-Binding Proteins/metabolism , RNA/metabolism , Biotinylation , CRISPR-Cas Systems , HEK293 Cells , Humans , Poly A , RNA/chemistry , RNA, Guide, Kinetoplastida/genetics , RNA, Small Nuclear/genetics , Recombinant Fusion Proteins/genetics , Staining and Labeling
6.
Brief Bioinform ; 18(6): 1033-1043, 2017 Nov 01.
Article in English | MEDLINE | ID: mdl-27567382

ABSTRACT

The protein side-chain packing problem (PSCPP) is an important subproblem of both protein structure prediction and protein design. During the past two decades, a large number of methods have been proposed to tackle this problem. These methods consist of three main components: a rotamer library, a scoring function and a search strategy. The average overall accuracy level obtained by these methods is approximately 87%. Whether a better accuracy level could be achieved remains to be answered. To address this question, we calculated the maximum accuracy level attainable using a simple rotamer library, independently of the energy function or the search method. Using 2883 different structures from the Protein Data Bank, we compared this accuracy level with the accuracy level of five state-of-the-art methods. These comparisons indicated that, for buried residues in the protein, we are already close to the best possible accuracy results. In addition, for exposed residues, we found that a significant gap exists between the possible improvement and the maximum accuracy level achievable with current methods. After determining that an improvement is possible, the next step is to understand what limitations are preventing us from obtaining such an improvement. Previous works on protein structure prediction and protein design have shown that scoring function inaccuracies may represent the main obstacle to achieving better results for these problems. To show that the same is true for the PSCPP, we evaluated the quality of two scoring functions used by some state-of-the-art algorithms. Our results indicate that neither of these scoring functions can guide the search method correctly, thereby reinforcing the idea that efforts to solve the PSCPP must also focus on developing better scoring functions.


Subject(s)
Algorithms , Computational Biology/methods , Protein Conformation , Proteins/chemistry , Databases, Protein , Humans , Proteins/metabolism
7.
Gynecol Oncol ; 153(2): 343-355, 2019 05.
Article in English | MEDLINE | ID: mdl-30898391

ABSTRACT

OBJECTIVE: Genome-wide association studies (GWASs) for epithelial ovarian cancer (EOC) have focused largely on populations of European ancestry. We aimed to identify common germline variants associated with EOC risk in Asian women. METHODS: Genotyping was performed as part of the OncoArray project. Samples with >60% Asian ancestry were included in the analysis. Genotyping was performed on 533,631 SNPs in 3238 Asian subjects diagnosed with invasive or borderline EOC and 4083 unaffected controls. After imputation, genotypes were available for 11,595,112 SNPs to identify associations. RESULTS: At chromosome 6p25.2, SNP rs7748275 was associated with risk of serous EOC (odds ratio [OR] = 1.34, P = 8.7 × 10-9) and high-grade serous EOC (HGSOC) (OR = 1.34, P = 4.3 × 10-9). SNP rs6902488 at 6p25.2 (r2 = 0.97 with rs7748275) lies in an active enhancer and is predicted to impact binding of STAT3, P300 and ELF1. We identified additional risk loci with low Bayesian false discovery probability (BFDP) scores, indicating they are likely to be true risk associations (BFDP <10%). At chromosome 20q11.22, rs74272064 was associated with HGSOC risk (OR = 1.27, P = 9.0 × 10-8). Overall EOC risk was associated with rs10260419 at chromosome 7p21.3 (OR = 1.33, P = 1.2 × 10-7) and rs74917072 at chromosome 2q37.3 (OR = 1.25, P = 4.7 × 10-7). At 2q37.3, expression quantitative trait locus analysis in 404 HGSOC tissues identified ESPNL as a putative candidate susceptibility gene (P = 1.2 × 10-7). CONCLUSION: While some risk loci were shared between East Asian and European populations, others were population-specific, indicating that the landscape of EOC risk in Asian women has both shared and unique features compared to women of European ancestry.


Subject(s)
Carcinoma, Ovarian Epithelial/genetics , Asian People/genetics , Base Sequence , Case-Control Studies , Female , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Polymorphism, Single Nucleotide , Quantitative Trait Loci
8.
BMC Bioinformatics ; 19(Suppl 20): 506, 2018 Dec 21.
Article in English | MEDLINE | ID: mdl-30577740

ABSTRACT

BACKGROUND: Atomic details of protein-DNA complexes can provide insightful information for better understanding of the function and binding specificity of DNA binding proteins. In addition to experimental methods for solving protein-DNA complex structures, protein-DNA docking can be used to predict native or near-native complex models. A docking program typically generates a large number of complex conformations and predicts the complex model(s) based on interaction energies between protein and DNA. However, the prediction accuracy is hampered by current approaches to model assessment, especially when docking simulations fail to produce any near-native models. RESULTS: We present here a Support Vector Machine (SVM)-based approach for quality assessment of the predicted transcription factor (TF)-DNA complex models. Besides a knowledge-based protein-DNA interaction potential DDNA3, we applied several structural features that have been shown to play important roles in binding specificity between transcription factors and DNA molecules to quality assessment of complex models. To address the issue of unbalanced positive and negative cases in the training dataset, we applied hard-negative mining, an iterative training process that selects an initial training dataset by combining all of the positive cases and a random sample from the negative cases. Results show that the SVM model greatly improves prediction accuracy (84.2%) over two knowledge-based protein-DNA interaction potentials, orientation potential (60.8%) and DDNA3 (68.4%). The improvement is achieved through reducing the number of false positive predictions, especially for the hard docking cases, in which a docking algorithm fails to produce any near-native complex models. CONCLUSIONS: A learning-based SVM scoring model with structural features for specific protein-DNA binding and an atomic-level protein-DNA interaction potential DDNA3 significantly improves prediction accuracy of complex models by successfully identifying cases without near-native structural models.


Subject(s)
DNA/metabolism , Models, Molecular , Support Vector Machine , Transcription Factors/metabolism , Algorithms , DNA/chemistry , Protein Binding
9.
Proteins ; 84(8): 1147-61, 2016 08.
Article in English | MEDLINE | ID: mdl-27147539

ABSTRACT

DNA-binding proteins play critical roles in biological processes including gene expression, DNA packaging and DNA repair. They bind to DNA target sequences with different degrees of binding specificity, ranging from highly specific (HS) to nonspecific (NS). Alterations of DNA-binding specificity, due to either genetic variation or somatic mutations, can lead to various diseases. In this study, a comparative analysis of protein-DNA complex structures was carried out to investigate the structural features that contribute to binding specificity. Protein-DNA complexes were grouped into three general classes based on degrees of binding specificity: HS, multispecific (MS), and NS. Our results show a clear trend of structural features among the three classes, including amino acid binding propensities, simple and complex hydrogen bonds, major/minor groove and base contacts, and DNA shape. We found that aspartate is enriched in HS DNA binding proteins and predominately binds to a cytosine through a single hydrogen bond or two consecutive cytosines through bidentate hydrogen bonds. Aromatic residues, histidine and tyrosine, are highly enriched in the HS and MS groups and may contribute to specific binding through different mechanisms. To further investigate the role of protein flexibility in specific protein-DNA recognition, we analyzed the conformational changes between the bound and unbound states of DNA-binding proteins and structural variations. The results indicate that HS and MS DNA-binding domains have larger conformational changes upon DNA-binding and larger degree of flexibility in both bound and unbound states. Proteins 2016; 84:1147-1161. © 2016 Wiley Periodicals, Inc.


Subject(s)
Amino Acids/chemistry , DNA-Binding Proteins/chemistry , DNA/chemistry , Binding Sites , Hydrogen Bonding , Hydrophobic and Hydrophilic Interactions , Models, Molecular , Nucleic Acid Conformation , Protein Binding , Protein Interaction Domains and Motifs , Protein Structure, Secondary , Static Electricity , Thermodynamics
10.
Bioinformatics ; 29(3): 322-30, 2013 Feb 01.
Article in English | MEDLINE | ID: mdl-23220572

ABSTRACT

MOTIVATION: Computational modeling of protein-DNA complexes remains a challenging problem in structural bioinformatics. One of the key factors for a successful protein-DNA docking is a potential function that can accurately discriminate the near-native structures from decoy complexes and at the same time make conformational sampling more efficient. Here, we developed a novel orientation-dependent, knowledge-based, residue-level potential for improving transcription factor (TF)-DNA docking. RESULTS: We demonstrated the performance of this new potential in TF-DNA binding affinity prediction, discrimination of native protein-DNA complex from decoy structures, and most importantly in rigid TF-DNA docking. The rigid TF-DNA docking with the new orientation potential, on a benchmark of 38 complexes, successfully predicts 42% of the cases with root mean square deviations lower than 1 Å and 55% of the cases with root mean square deviations lower than 3 Å. The results suggest that docking with this new orientation-dependent, coarse-grained statistical potential can achieve high-docking accuracy and can serve as a crucial first step in multi-stage flexible protein-DNA docking. AVAILABILITY AND IMPLEMENTATION: The new potential is available at http://bioinfozen.uncc.edu/Protein_DNA_orientation_potential.tar.


Subject(s)
DNA/chemistry , Molecular Docking Simulation/methods , Transcription Factors/chemistry , DNA/metabolism , Knowledge Bases , Protein Binding , Transcription Factors/metabolism
11.
bioRxiv ; 2024 Aug 13.
Article in English | MEDLINE | ID: mdl-38405764

ABSTRACT

Genomics for rare disease diagnosis has advanced at a rapid pace due to our ability to perform "N-of-1" analyses on individual patients with ultra-rare diseases. The increasing sizes of ultra-rare disease cohorts internationally newly enables cohort-wide analyses for new discoveries, but well-calibrated statistical genetics approaches for jointly analyzing these patients are still under development.1,2 The Undiagnosed Diseases Network (UDN) brings multiple clinical, research and experimental centers under the same umbrella across the United States to facilitate and scale N-of-1 analyses. Here, we present the first joint analysis of whole genome sequencing data of UDN patients across the network. We introduce new, well-calibrated statistical methods for prioritizing disease genes with de novo recurrence and compound heterozygosity. We also detect pathways enriched with candidate and known diagnostic genes. Our computational analysis, coupled with a systematic clinical review, recapitulated known diagnoses and revealed new disease associations. We further release a software package, RaMeDiES, enabling automated cross-analysis of deidentified sequenced cohorts for new diagnostic and research discoveries. Gene-level findings and variant-level information across the cohort are available in a public-facing browser (https://dbmi-bgm.github.io/udn-browser/). These results show that N-of-1 efforts should be supplemented by a joint genomic analysis across cohorts.

12.
bioRxiv ; 2023 Apr 12.
Article in English | MEDLINE | ID: mdl-37090516

ABSTRACT

The transcription factors MECOM, PAX8, SOX17 and WT1 are candidate master regulators of high-grade serous 'ovarian' cancer (HGSC), yet their cooperative role in the hypothesized tissue of origin, the fallopian tube secretory epithelium (FTSEC) is unknown. We generated 26 epigenome (CUT&TAG, CUT&RUN, ATAC-seq and HiC) data sets and 24 profiles of RNA-seq transcription factor knock-down followed by RNA sequencing in FTSEC and HGSC models to define binding sites and gene sets regulated by these factors in cis and trans. This revealed that MECOM, PAX8, SOX17 and WT1 are lineage-enriched, super-enhancer associated master regulators whose cooperative DNA-binding patterns and target genes are re-wired during tumor development. All four TFs were indispensable for HGSC clonogenicity and survival but only depletion of PAX8 and WT1 impaired FTSEC cell survival. These four TFs were pharmacologically inhibited by transcriptional inhibitors only in HGSCs but not in FTSECs. Collectively, our data highlights that tumor-specific epigenetic remodeling is tightly related to MECOM, PAX8, SOX17 and WT1 activity and these transcription factors are targetable in a tumor-specific manner through transcriptional inhibitors.

13.
Nat Commun ; 14(1): 346, 2023 01 21.
Article in English | MEDLINE | ID: mdl-36681680

ABSTRACT

While the mutational and transcriptional landscapes of renal cell carcinoma (RCC) are well-known, the epigenome is poorly understood. We characterize the epigenome of clear cell (ccRCC), papillary (pRCC), and chromophobe RCC (chRCC) by using ChIP-seq, ATAC-Seq, RNA-seq, and SNP arrays. We integrate 153 individual data sets from 42 patients and nominate 50 histology-specific master transcription factors (MTF) to define RCC histologic subtypes, including EPAS1 and ETS-1 in ccRCC, HNF1B in pRCC, and FOXI1 in chRCC. We confirm histology-specific MTFs via immunohistochemistry including a ccRCC-specific TF, BHLHE41. FOXI1 overexpression with knock-down of EPAS1 in the 786-O ccRCC cell line induces transcriptional upregulation of chRCC-specific genes, TFCP2L1, ATP6V0D2, KIT, and INSRR, implicating FOXI1 as a MTF for chRCC. Integrating RCC GWAS risk SNPs with H3K27ac ChIP-seq and ATAC-seq data reveals that risk-variants are significantly enriched in allelically-imbalanced peaks. This epigenomic atlas in primary human samples provides a resource for future investigation.


Subject(s)
Carcinoma, Renal Cell , Kidney Neoplasms , Humans , Carcinoma, Renal Cell/pathology , Kidney Neoplasms/pathology , Epigenomics , Transcription Factors/genetics , Oncogenes , Forkhead Transcription Factors/genetics
14.
Life Sci Alliance ; 5(10)2022 10.
Article in English | MEDLINE | ID: mdl-35777959

ABSTRACT

Candidate causal risk variants from genome-wide association studies reside almost exclusively in noncoding regions of the genome and innovative approaches are necessary to understand their biological function. Multi-marker analysis of genomic annotation (MAGMA) is a widely used program that nominates candidate risk genes by mapping single-nucleotide polymorphism summary statistics from genome-wide association studies to gene bodies. We augmented MAGMA to create chromatin-MAGMA (chromMAGMA), a method to nominate candidate risk genes based on the presence of risk variants within noncoding regulatory elements (REs). We applied chromMAGMA to a genetic susceptibility dataset for epithelial ovarian cancer (EOC), a rare gynecologic malignancy characterized by high mortality. This identified 155 unique candidate EOC risk genes across five EOC histotypes; 83% (105/127) of high-grade serous ovarian cancer risk genes had not previously been implicated in this EOC histotype. Risk genes nominated by chromMAGMA converged on mRNA splicing and transcriptional dysregulation pathways. chromMAGMA is a pipeline that nominates candidate risk genes through a gene regulation-focused approach and helps interpret the biological mechanism of noncoding risk variants for complex diseases.


Subject(s)
Genome-Wide Association Study , Regulatory Sequences, Nucleic Acid , Chromatin , Female , Genomics , Humans , Ovary , Regulatory Sequences, Nucleic Acid/genetics
15.
Cell Rep Med ; 3(3): 100542, 2022 03 15.
Article in English | MEDLINE | ID: mdl-35492879

ABSTRACT

Endometriosis is associated with increased risk of epithelial ovarian cancers (EOCs). Using data from large endometriosis and EOC genome-wide association meta-analyses, we estimate the genetic correlation and evaluate the causal relationship between genetic liability to endometriosis and EOC histotypes, and identify shared susceptibility loci. We estimate a significant genetic correlation (rg) between endometriosis and clear cell (rg = 0.71), endometrioid (rg = 0.48), and high-grade serous (rg = 0.19) ovarian cancer, associations supported by Mendelian randomization analyses. Bivariate meta-analysis identified 28 loci associated with both endometriosis and EOC, including 19 with evidence for a shared underlying association signal. Differences in the shared risk suggest different underlying pathways may contribute to the relationship between endometriosis and the different histotypes. Functional annotation using transcriptomic and epigenomic profiles of relevant tissues/cells highlights several target genes. This comprehensive analysis reveals profound genetic overlap between endometriosis and EOC histotypes with valuable genomic targets for understanding the biological mechanisms linking the diseases.


Subject(s)
Endometriosis , Neoplasms, Glandular and Epithelial , Ovarian Neoplasms , Carcinoma, Ovarian Epithelial/genetics , Endometriosis/genetics , Female , Genome-Wide Association Study , Humans , Neoplasms, Glandular and Epithelial/complications , Ovarian Neoplasms/genetics
16.
Sci Signal ; 15(728): eabm2496, 2022 04 05.
Article in English | MEDLINE | ID: mdl-35380877

ABSTRACT

PAX8 is a master transcription factor that is essential during embryogenesis and promotes neoplastic growth. It is expressed by the secretory cells lining the female reproductive tract, and its deletion during development results in atresia of reproductive tract organs. Nearly all ovarian carcinomas express PAX8, and its knockdown results in apoptosis of ovarian cancer cells. To explore the role of PAX8 in these tissues, we purified the PAX8 protein complex from nonmalignant fallopian tube cells and high-grade serous ovarian carcinoma cell lines. We found that PAX8 was a member of a large chromatin remodeling complex and preferentially interacted with SOX17, another developmental transcription factor. Depleting either PAX8 or SOX17 from cancer cells altered the expression of factors involved in angiogenesis and functionally disrupted tubule and capillary formation in cell culture and mouse models. PAX8 and SOX17 in ovarian cancer cells promoted the secretion of angiogenic factors by suppressing the expression of SERPINE1, which encodes a proteinase inhibitor with antiangiogenic effects. The findings reveal a non-cell-autonomous function of these transcription factors in regulating angiogenesis in ovarian cancer.


Subject(s)
Ovarian Neoplasms , PAX8 Transcription Factor , SOXF Transcription Factors , Transcription Factors , Animals , Fallopian Tubes/metabolism , Fallopian Tubes/pathology , Female , HMGB Proteins/genetics , HMGB Proteins/metabolism , Humans , Mice , Neoplasm Grading , Ovarian Neoplasms/metabolism , PAX8 Transcription Factor/genetics , PAX8 Transcription Factor/metabolism , SOXF Transcription Factors/genetics , SOXF Transcription Factors/metabolism , Transcription Factors/metabolism
17.
BMC Struct Biol ; 11: 45, 2011 Nov 01.
Article in English | MEDLINE | ID: mdl-22044637

ABSTRACT

BACKGROUND: Structural insight from transcription factor-DNA (TF-DNA) complexes is of paramount importance to our understanding of the affinity and specificity of TF-DNA interaction, and to the development of structure-based prediction of TF binding sites. Yet the majority of the TF-DNA complexes remain unsolved despite the considerable experimental efforts being made. Computational docking represents a promising alternative to bridge the gap. To facilitate the study of TF-DNA docking, carefully designed benchmarks are needed for performance evaluation and identification of the strengths and weaknesses of docking algorithms. RESULTS: We constructed two benchmarks for flexible and rigid TF-DNA docking respectively using a unified non-redundant set of 38 test cases. The test cases encompass diverse fold families and are classified into easy and hard groups with respect to the degrees of difficulty in TF-DNA docking. The major parameters used to classify expected docking difficulty in flexible docking are the conformational differences between bound and unbound TFs and the interaction strength between TFs and DNA. For rigid docking in which the starting structure is a bound TF conformation, only interaction strength is considered. CONCLUSIONS: We believe these benchmarks are important for the development of better interaction potentials and TF-DNA docking algorithms, which bears important implications to structure-based prediction of transcription factor binding sites and drug design.


Subject(s)
DNA/metabolism , Transcription Factors/metabolism , Algorithms , Binding Sites , Computer Simulation , Protein Binding
18.
Pharmacol Ther ; 220: 107722, 2021 04.
Article in English | MEDLINE | ID: mdl-33137377

ABSTRACT

Transcription factors (TFs) are major contributors to cancer risk and somatic development. In preclinical and clinical studies, direct or indirect inhibition of TF-mediated oncogenic gene expression profiles have proven to be effective in many tumor types, highlighting this group of proteins as valuable therapeutic targets. In spite of this, our understanding of TFs in epithelial ovarian cancer (EOC) is relatively limited. EOC is a heterogeneous disease composed of five major histologic subtypes; high-grade serous, low-grade serous, endometrioid, clear cell and mucinous. Each histology is associated with unique clinical etiologies, sensitivity to therapies, and molecular signatures - including diverse transcriptional regulatory programs. While some TFs are shared across EOC subtypes, a set of TFs are expressed in a histotype-specific manner and likely explain part of the histologic diversity of EOC subtypes. Targeting TFs present with unique opportunities for development of novel precision medicine strategies for ovarian cancer. This article reviews the critical TFs in EOC subtypes and highlights the potential of exploiting TFs as biomarkers and therapeutic targets.


Subject(s)
Ovarian Neoplasms , Carcinoma, Ovarian Epithelial/drug therapy , Carcinoma, Ovarian Epithelial/genetics , Female , Gene Expression Regulation, Neoplastic , Humans , Ovarian Neoplasms/drug therapy , Ovarian Neoplasms/genetics , Transcription Factors/genetics
19.
Cell Rep ; 35(2): 108978, 2021 04 13.
Article in English | MEDLINE | ID: mdl-33852846

ABSTRACT

The human fallopian tube harbors the cell of origin for the majority of high-grade serous "ovarian" cancers (HGSCs), but its cellular composition, particularly the epithelial component, is poorly characterized. We perform single-cell transcriptomic profiling of around 53,000 individual cells from 12 primary fallopian specimens to map their major cell types. We identify 10 epithelial subpopulations with diverse transcriptional programs. Based on transcriptional signatures, we reconstruct a trajectory whereby secretory cells differentiate into ciliated cells via a RUNX3high intermediate. Computational deconvolution of advanced HGSCs identifies the "early secretory" population as a likely precursor state for the majority of HGSCs. Its signature comprises both epithelial and mesenchymal features and is enriched in mesenchymal-type HGSCs (p = 6.7 × 10-27), a group known to have particularly poor prognoses. This cellular and molecular compendium of the human fallopian tube in cancer-free women is expected to advance our understanding of the earliest stages of fallopian epithelial neoplasia.


Subject(s)
Core Binding Factor Alpha 3 Subunit/genetics , Endometriosis/genetics , Leiomyoma/genetics , PAX8 Transcription Factor/genetics , SOXF Transcription Factors/genetics , Transcriptome , Adult , Cell Differentiation , Cell Line, Tumor , Core Binding Factor Alpha 3 Subunit/metabolism , Endometriosis/metabolism , Endometriosis/pathology , Endometriosis/surgery , Epithelial Cells/metabolism , Epithelial Cells/pathology , Epithelial-Mesenchymal Transition , Fallopian Tubes/metabolism , Fallopian Tubes/pathology , Fallopian Tubes/surgery , Female , Gene Expression Profiling , Gene Expression Regulation, Neoplastic , Humans , Leiomyoma/metabolism , Leiomyoma/pathology , Leiomyoma/surgery , Middle Aged , PAX8 Transcription Factor/metabolism , SOXF Transcription Factors/metabolism , Signal Transduction , Single-Cell Analysis
20.
Sci Adv ; 7(48): eabf6123, 2021 Nov 26.
Article in English | MEDLINE | ID: mdl-34818047

ABSTRACT

Critical developmental "master transcription factors" (MTFs) can be subverted during tumorigenesis to control oncogenic transcriptional programs. Current approaches to identifying MTFs rely on ChIP-seq data, which is unavailable for many cancers. We developed the CaCTS (Cancer Core Transcription factor Specificity) algorithm to prioritize candidate MTFs using pan-cancer RNA sequencing data. CaCTS identified candidate MTFs across 34 tumor types and 140 subtypes including predictions for cancer types/subtypes for which MTFs are unknown, including e.g. PAX8, SOX17, and MECOM as candidates in ovarian cancer (OvCa). In OvCa cells, consistent with known MTF properties, these factors are required for viability, lie proximal to superenhancers, co-occupy regulatory elements globally, co-bind loci encoding OvCa biomarkers, and are sensitive to pharmacologic inhibition of transcription. Our predictions of MTFs, especially for tumor types with limited understanding of transcriptional drivers, pave the way to therapeutic targeting of MTFs in a broad spectrum of cancers.

SELECTION OF CITATIONS
SEARCH DETAIL