RESUMO
In recent years, the emphasis of scientific inquiry has shifted from whole-genome analyses to an understanding of cellular responses specific to tissue, developmental stage or environmental conditions. One of the central mechanisms underlying the diversity and adaptability of the contextual responses is alternative splicing (AS). It enables a single gene to encode multiple isoforms with distinct biological functions. However, to date, the functions of the vast majority of differentially spliced protein isoforms are not known. Integration of genomic, proteomic, functional, phenotypic and contextual information is essential for supporting isoform-based modeling and analysis. Such integrative proteogenomics approaches promise to provide insights into the functions of the alternatively spliced protein isoforms and provide high-confidence hypotheses to be validated experimentally. This manuscript provides a survey of the public databases supporting isoform-based biology. It also presents an overview of the potential global impact of AS on the human canonical gene functions, molecular interactions and cellular pathways.
Assuntos
Processamento Alternativo , Isoformas de Proteínas/metabolismo , Biologia Computacional , Bases de Dados de Proteínas , HumanosRESUMO
Lynx (http://lynx.ci.uchicago.edu) is a web-based database and a knowledge extraction engine. It supports annotation and analysis of high-throughput experimental data and generation of weighted hypotheses regarding genes and molecular mechanisms contributing to human phenotypes or conditions of interest. Since the last release, the Lynx knowledge base (LynxKB) has been periodically updated with the latest versions of the existing databases and supplemented with additional information from public databases. These additions have enriched the data annotations provided by Lynx and improved the performance of Lynx analytical tools. Moreover, the Lynx analytical workbench has been supplemented with new tools for reconstruction of co-expression networks and feature-and-network-based prioritization of genetic factors and molecular mechanisms. These developments facilitate the extraction of meaningful knowledge from experimental data and LynxKB. The Service Oriented Architecture provides public access to LynxKB and its analytical tools via user-friendly web services and interfaces.
Assuntos
Bases de Dados Genéticas , Medicina Integrativa , Bases de Conhecimento , Mineração de Dados , Redes Reguladoras de Genes , Genes , Humanos , Anotação de Sequência Molecular , FenótipoRESUMO
We have developed Lynx (http://lynx.ci.uchicago.edu)--a web-based database and a knowledge extraction engine, supporting annotation and analysis of experimental data and generation of weighted hypotheses on molecular mechanisms contributing to human phenotypes and disorders of interest. Its underlying knowledge base (LynxKB) integrates various classes of information from >35 public databases and private collections, as well as manually curated data from our group and collaborators. Lynx provides advanced search capabilities and a variety of algorithms for enrichment analysis and network-based gene prioritization to assist the user in extracting meaningful knowledge from LynxKB and experimental data, whereas its service-oriented architecture provides public access to LynxKB and its analytical tools via user-friendly web services and interfaces.
Assuntos
Bases de Dados Genéticas , Doença/genética , Fenótipo , Ferramenta de Busca , Transtorno Autístico/genética , Genes , Genômica , Humanos , Internet , Bases de Conhecimento , Convulsões/genética , Integração de SistemasRESUMO
Lynx is a web-based integrated systems biology platform that supports annotation and analysis of experimental data and generation of weighted hypotheses on molecular mechanisms contributing to human phenotypes and disorders of interest. Lynx has integrated multiple classes of biomedical data (genomic, proteomic, pathways, phenotypic, toxicogenomic, contextual and others) from various public databases as well as manually curated data from our group and collaborators (LynxKB). Lynx provides tools for gene list enrichment analysis using multiple functional annotations and network-based gene prioritization. Lynx provides access to the integrated database and the analytical tools via REST based Web Services (http://lynx.ci.uchicago.edu/webservices.html). This comprises data retrieval services for specific functional annotations, services to search across the complete LynxKB (powered by Lucene), and services to access the analytical tools built within the Lynx platform.
Assuntos
Doenças Genéticas Inatas/genética , Software , Bases de Dados Factuais , Genes , Humanos , Internet , Bases de Conhecimento , Biologia de SistemasRESUMO
Recent technological advances in genomics now allow producing biological data at unprecedented tera- and petabyte scales. Yet, the extraction of useful knowledge from this voluminous data presents a significant challenge to a scientific community. Efficient mining of vast and complex data sets for the needs of biomedical research critically depends on seamless integration of clinical, genomic, and experimental information with prior knowledge about genotype-phenotype relationships accumulated in a plethora of publicly available databases. Furthermore, such experimental data should be accessible to a variety of algorithms and analytical pipelines that drive computational analysis and data mining. Translational projects require sophisticated approaches that coordinate and perform various analytical steps involved in the extraction of useful knowledge from accumulated clinical and experimental data in an orderly semiautomated manner. It presents a number of challenges such as (1) high-throughput data management involving data transfer, data storage, and access control; (2) scalable computational infrastructure; and (3) analysis of large-scale multidimensional data for the extraction of actionable knowledge.We present a scalable computational platform based on crosscutting requirements from multiple scientific groups for data integration, management, and analysis. The goal of this integrated platform is to address the challenges and to support the end-to-end analytical needs of various translational projects.
Assuntos
Pesquisa Translacional Biomédica/métodos , Pesquisa Translacional Biomédica/tendências , Mineração de Dados/métodos , Mineração de Dados/tendências , Bases de Dados Genéticas/tendências , Genômica/métodos , Genômica/tendências , HumanosRESUMO
Kaposi's sarcoma-associated herpesvirus (KSHV) infection and latency-associated nuclear antigen (LANA-1) upregulate the multifunctional protein angiogenin (ANG). Our studies demonstrate that silencing ANG or inhibiting its nuclear translocation downregulates KSHV LANA-1 expression and ANG is necessary for KSHV latency, anti-apoptosis and angiogenesis (Sadagopan et al., J. Virol. 83:3342-3364, 2009; Sadagopan et al., J Virol. 85:2666-2685, 2011). Here we show that LANA-1 interacts with ANG and colocalizes in latently infected endothelial telomerase-immortalized human umbilical vein endothelial (TIVE-LTC) cells. Mass spectrometric analyses of TIVE-LTC proteins immunoprecipitated by anti-LANA-1 and ANG antibodies identified 28 common cellular proteins such as ribosomal proteins, structural proteins, tRNA synthetases, metabolic pathway enzymes, chaperons, transcription factors, antioxidants, and ubiquitin proteosome proteins. LANA-1 and ANG interaction with one of the proteins, annexin A2, was validated. Annexin A2 has been shown to play roles in cell proliferation, apoptosis, plasmin generation, exocytosis, endocytosis, and cytoskeleton reorganization. It is also known to associate with glycolytic enzyme 3-phosphoglyceratekinase in the primer recognition protein (PRP) complex that interacts with DNA polymerase α in the lagging strand of DNA during replication. A higher level of annexin A2 is expressed in KSHV+ but not in Epstein-Barr virus (EBV)+ B-lymphoma cell lines. Annexin A2 colocalized with several LANA-1 punctate spots in KSHV+ body cavity B-cell lymphoma (BCBL-1) cells. In triple-staining analyses, we observed annexin A2-ANG-LANA-1, annexin A2-ANG, and ANG-LANA-1 colocalizations. Annexin A2 appeared as punctate nuclear dots in LANA-1-positive TIVE-LTC cells. In LANA-1-negative TIVE-LTC cells, annexin A2 was detected predominately in the cytoplasm, with some nuclear spots, and colocalization with ANG was observed mostly in the cytoplasm. Annexin A2 coimmunoprecipitated with LANA-1 and ANG in TIVE-LTC and BCBL-1 cells and with ANG in 293T cells independent of LANA-1. This suggested that annexin A2 forms a complex with LANA-1 and ANG as well as a separate complex with ANG. Silencing annexin A2 in BCBL-1 cells resulted in significant cell death, downregulation of cell cycle-associated Cdk6 and of cyclin D, E, and A proteins, and downregulation of LANA-1 and ANG expression. No effect was seen in KSHVâ» lymphoma (BJAB and Ramos) and 293T cells. These studies suggest that LANA-1 association with annexin A2/ANG could be more important than ANG association with annexin A2, and KSHV probably uses annexin A2 to maintain the viability and cell cycle regulation of latently infected cells. Since the identified LANA-1- and ANG-interacting common cellular proteins are hitherto unknown to KSHV and ANG biology, this offers a starting point for further analysis of their roles in KSHV biology, which may lead to identification of potential therapeutic targets to control KSHV latency and associated malignancies.
Assuntos
Anexina A2/metabolismo , Antígenos Virais/metabolismo , Herpesvirus Humano 8/fisiologia , Proteínas Nucleares/metabolismo , Ribonuclease Pancreático/metabolismo , Latência Viral , Sequência de Bases , Western Blotting , Linhagem Celular , Primers do DNA , Imunofluorescência , Humanos , Imunoprecipitação , Reação em Cadeia da Polimerase em Tempo RealRESUMO
Data on long-term outcomes and biological drivers associated with depth of remission after BCL2 inhibition by venetoclax in the treatment of chronic lymphocytic leukemia (CLL) are limited. In this open-label parallel-group phase-3 study, 432 patients with previously untreated CLL were randomized (1:1) to receive either 1-year venetoclax-obinutuzumab (Ven-Obi, 216 patients) or chlorambucil-Obi (Clb-Obi, 216 patients) therapy (NCT02242942). The primary endpoint was investigator-assessed progression-free survival (PFS); secondary endpoints included minimal residual disease (MRD) and overall survival. RNA sequencing of CD19-enriched blood was conducted for exploratory post-hoc analyses. After a median follow-up of 65.4 months, PFS is significantly superior for Ven-Obi compared to Clb-Obi (Hazard ratio [HR] 0.35 [95% CI 0.26-0.46], p < 0.0001). At 5 years after randomization, the estimated PFS rate is 62.6% after Ven-Obi and 27.0% after Clb-Obi. In both arms, MRD status at the end of therapy is associated with longer PFS. MRD + ( ≥ 10-4) status is associated with increased expression of multi-drug resistance gene ABCB1 (MDR1), whereas MRD6 (< 10-6) is associated with BCL2L11 (BIM) expression. Inflammatory response pathways are enriched in MRD+ patient solely in the Ven-Obi arm. These data indicate sustained long-term efficacy of fixed-duration Ven-Obi in patients with previously untreated CLL. The distinct transcriptomic profile of MRD+ status suggests possible biological vulnerabilities.
Assuntos
Leucemia Linfocítica Crônica de Células B , Humanos , Leucemia Linfocítica Crônica de Células B/tratamento farmacológico , Leucemia Linfocítica Crônica de Células B/genética , Transcriptoma , Protocolos de Quimioterapia Combinada Antineoplásica/efeitos adversos , Clorambucila/uso terapêutico , Clorambucila/efeitos adversos , Compostos Bicíclicos Heterocíclicos com Pontes/uso terapêuticoRESUMO
RIF1 is a multifunctional protein that plays key roles in the regulation of DNA processing. During repair of DNA double-strand breaks (DSBs), RIF1 functions in the 53BP1-Shieldin pathway that inhibits resection of DNA ends to modulate the cellular decision on which repair pathway to engage. Under conditions of replication stress, RIF1 protects nascent DNA at stalled replication forks from degradation by the DNA2 nuclease. How these RIF1 activities are regulated at the post-translational level has not yet been elucidated. Here, we identified a cluster of conserved ATM/ATR consensus SQ motifs within the intrinsically disordered region (IDR) of mouse RIF1 that are phosphorylated in proliferating B lymphocytes. We found that phosphorylation of the conserved IDR SQ cluster is dispensable for the inhibition of DSB resection by RIF1, but is essential to counteract DNA2-dependent degradation of nascent DNA at stalled replication forks. Therefore, our study identifies a key molecular feature that enables the genome-protective function of RIF1 during DNA replication stress.
Assuntos
Quebras de DNA de Cadeia Dupla , Replicação do DNA , Animais , DNA/metabolismo , Reparo do DNA , Camundongos , Fosforilação , Proteínas de Ligação a Telômeros/genética , Proteínas de Ligação a Telômeros/metabolismoRESUMO
The establishment of protective humoral immunity is dependent on the ability of mature B cells to undergo antibody gene diversification while adjusting to the physiological stressors induced by activation with the antigen. Mature B cells diversify their antibody genes by class switch recombination (CSR) and somatic hypermutation (SHM), which are both dependent on efficient induction of activation-induced cytidine deaminase (AID). Here, we identified PDGFA-associated protein 1 (Pdap1) as an essential regulator of cellular homeostasis in mature B cells. Pdap1 deficiency leads to sustained expression of the integrated stress response (ISR) effector activating transcription factor 4 (Atf4) and induction of the ISR transcriptional program, increased cell death, and defective AID expression. As a consequence, loss of Pdap1 reduces germinal center B cell formation and impairs CSR and SHM. Thus, Pdap1 protects mature B cells against chronic ISR activation and ensures efficient antibody diversification by promoting their survival and optimal function.
Assuntos
Diversidade de Anticorpos , Linfócitos B/metabolismo , Genes de Imunoglobulinas/genética , Animais , Linfócitos B/imunologia , Proteína 9 Associada à CRISPR , Sistemas CRISPR-Cas , Morte Celular , Diferenciação Celular , Linhagem Celular , Feminino , Imunofluorescência , Edição de Genes , Regulação da Expressão Gênica , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Camundongos TransgênicosRESUMO
Class switch recombination (CSR) is a DNA recombination reaction that diversifies the effector functions of antibodies. CSR occurs via the formation and non-homologous end joining (NHEJ) repair of programmed DNA double-strand breaks (DSBs) at the immunoglobulin heavy chain locus. The DNA repair factors 53BP1 and Rif1 promote NHEJ and CSR by protecting DSBs against resection. However, to what extent repression of DNA end resection contributes to CSR is unknown. Here, we show that B lymphocytes devoid of 53BP1-Rif1-dependent DSB end protection activity undergo robust CSR. Inactivation of specific sets of phospho-sites within 53BP1 N-terminal SQ/TQ motifs abrogates Rif1 recruitment and inhibition of resection but only mildly reduces CSR. Furthermore, mutations within 53BP1 oligomerization domain abolish CSR without substantially affecting DNA end processing. Thus, inhibition of DNA end resection does not correlate with CSR efficiency, indicating that regulation of DSB processing is not a key determinant step in CSR.
Assuntos
Reparo do DNA por Junção de Extremidades , Switching de Imunoglobulina , Proteína 1 de Ligação à Proteína Supressora de Tumor p53/fisiologia , Animais , Linfócitos B/imunologia , Quebras de DNA de Cadeia Dupla , Feminino , Humanos , Masculino , Camundongos , Proteínas de Ligação a Telômeros/metabolismoRESUMO
Identifying high-confidence candidate genes that are causative for disease phenotypes, from the large lists of variations produced by high-throughput genomics, can be both time-consuming and costly. The development of novel computational approaches, utilizing existing biological knowledge for the prioritization of such candidate genes, can improve the efficiency and accuracy of the biomedical data analysis. It can also reduce the cost of such studies by avoiding experimental validations of irrelevant candidates. In this study, we address this challenge by proposing a novel gene prioritization approach that ranks promising candidate genes that are likely to be involved in a disease or phenotype under study. This algorithm is based on the modified conditional random field (CRF) model that simultaneously makes use of both gene annotations and gene interactions, while preserving their original representation. We validated our approach on two independent disease benchmark studies by ranking candidate genes using network and feature information. Our results showed both high area under the curve (AUC) value (0.86), and more importantly high partial AUC (pAUC) value (0.1296), and revealed higher accuracy and precision at the top predictions as compared with other well-performed gene prioritization tools, such as Endeavour (AUC-0.82, pAUC-0.083) and PINTA (AUC-0.76, pAUC-0.066). We were able to detect more target genes (9/18/19/27) on top positions (1/5/10/20) compared to Endeavour (3/11/14/23) and PINTA (6/10/13/18). To demonstrate its usability, we applied our method to a case study for the prediction of molecular mechanisms contributing to intellectual disability and autism. Our approach was able to correctly recover genes related to both disorders and provide suggestions for possible additional candidates based on their rankings and functional annotations.
Assuntos
Transtorno do Espectro Autista/genética , Estudos de Associação Genética/métodos , Deficiência Intelectual/genética , Área Sob a Curva , Redes Reguladoras de Genes , Predisposição Genética para Doença , Humanos , Modelos Genéticos , Anotação de Sequência Molecular , Fenótipo , Curva ROCRESUMO
An essential step in the discovery of molecular mechanisms contributing to disease phenotypes and efficient experimental planning is the development of weighted hypotheses that estimate the functional effects of sequence variants discovered by high-throughput genomics. With the increasing specialization of the bioinformatics resources, creating analytical workflows that seamlessly integrate data and bioinformatics tools developed by multiple groups becomes inevitable. Here we present a case study of a use of the distributed analytical environment integrating four complementary specialized resources, namely the Lynx platform, VISTA RViewer, the Developmental Brain Disorders Database (DBDB), and the RaptorX server, for the identification of high-confidence candidate genes contributing to pathogenesis of spina bifida. The analysis resulted in prediction and validation of deleterious mutations in the SLC19A placental transporter in mothers of the affected children that causes narrowing of the outlet channel and therefore leads to the reduced folate permeation rate. The described approach also enabled correct identification of several genes, previously shown to contribute to pathogenesis of spina bifida, and suggestion of additional genes for experimental validations. The study demonstrates that the seamless integration of bioinformatics resources enables fast and efficient prioritization and characterization of genomic factors and molecular networks contributing to the phenotypes of interest.
Assuntos
Mutação , Proteína Carregadora de Folato Reduzido/genética , Disrafismo Espinal/genética , Criança , Feminino , Ácido Fólico/metabolismo , Genômica/métodos , Humanos , Modelos Moleculares , Gravidez , Conformação Proteica , Proteína Carregadora de Folato Reduzido/química , Proteína Carregadora de Folato Reduzido/metabolismo , Software , Disrafismo Espinal/metabolismoRESUMO
Infantile spasms (ISS) are an epilepsy disorder frequently associated with severe developmental outcome and have diverse genetic etiologies. We ascertained 11 subjects with ISS and novel copy number variants (CNVs) and combined these with a new cohort with deletion 1p36 and ISS, and additional published patients with ISS and other chromosomal abnormalities. Using bioinformatics tools, we analyzed the gene content of these CNVs for enrichment in pathways of pathogenesis. Several important findings emerged. First, the gene content was enriched for the gene regulatory network involved in ventral forebrain development. Second, genes in pathways of synaptic function were overrepresented, significantly those involved in synaptic vesicle transport. Evidence also suggested roles for GABAergic synapses and the postsynaptic density. Third, we confirm the association of ISS with duplication of 14q12 and maternally inherited duplication of 15q11q13, and report the association with duplication of 21q21. We also present a patient with ISS and deletion 7q11.3 not involving MAGI2. Finally, we provide evidence that ISS in deletion 1p36 may be associated with deletion of KLHL17 and expand the epilepsy phenotype in that syndrome to include early infantile epileptic encephalopathy. Several of the identified pathways share functional links, and abnormalities of forebrain synaptic growth and function may form a common biologic mechanism underlying both ISS and autism. This study demonstrates a novel approach to the study of gene content in subjects with ISS and copy number variation, and contributes further evidence to support specific pathways of pathogenesis.