Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 40
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Hum Mutat ; 40(9): 1612-1622, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31241222

RESUMO

The availability of disease-specific genomic data is critical for developing new computational methods that predict the pathogenicity of human variants and advance the field of precision medicine. However, the lack of gold standards to properly train and benchmark such methods is one of the greatest challenges in the field. In response to this challenge, the scientific community is invited to participate in the Critical Assessment for Genome Interpretation (CAGI), where unpublished disease variants are available for classification by in silico methods. As part of the CAGI-5 challenge, we evaluated the performance of 18 submissions and three additional methods in predicting the pathogenicity of single nucleotide variants (SNVs) in checkpoint kinase 2 (CHEK2) for cases of breast cancer in Hispanic females. As part of the assessment, the efficacy of the analysis method and the setup of the challenge were also considered. The results indicated that though the challenge could benefit from additional participant data, the combined generalized linear model analysis and odds of pathogenicity analysis provided a framework to evaluate the methods submitted for SNV pathogenicity identification and for comparison to other available methods. The outcome of this challenge and the approaches used can help guide further advancements in identifying SNV-disease relationships.


Assuntos
Neoplasias da Mama/genética , Quinase do Ponto de Checagem 2/genética , Biologia Computacional/métodos , Hispânico ou Latino/genética , Polimorfismo de Nucleotídeo Único , Adulto , Idoso , Neoplasias da Mama/etnologia , Estudos de Casos e Controles , Simulação por Computador , Feminino , Predisposição Genética para Doença , Humanos , Modelos Lineares , Pessoa de Meia-Idade , Estados Unidos/etnologia , Sequenciamento do Exoma
2.
PLoS Comput Biol ; 14(11): e1006494, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30408027

RESUMO

Research in computational biology has given rise to a vast number of methods developed to solve scientific problems. For areas in which many approaches exist, researchers have a hard time deciding which tool to select to address a scientific challenge, as essentially all publications introducing a new method will claim better performance than all others. Not all of these claims can be correct. Equally, for this same reason, developers struggle to demonstrate convincingly that they created a new and superior algorithm or implementation. Moreover, the developer community often has difficulty discerning which new approaches constitute true scientific advances for the field. The obvious answer to this conundrum is to develop benchmarks-meaning standard points of reference that facilitate evaluating the performance of different tools-allowing both users and developers to compare multiple tools in an unbiased fashion.


Assuntos
Biologia Computacional/métodos , Algoritmos , Área Sob a Curva , Publicações
3.
PLoS Comput Biol ; 13(4): e1005428, 2017 04.
Artigo em Inglês | MEDLINE | ID: mdl-28426665

RESUMO

The fight against cancer is hindered by its highly heterogeneous nature. Genome-wide sequencing studies have shown that individual malignancies contain many mutations that range from those commonly found in tumor genomes to rare somatic variants present only in a small fraction of lesions. Such rare somatic variants dominate the landscape of genomic mutations in cancer, yet efforts to correlate somatic mutations found in one or few individuals with functional roles have been largely unsuccessful. Traditional methods for identifying somatic variants that drive cancer are 'gene-centric' in that they consider only somatic variants within a particular gene and make no comparison to other similar genes in the same family that may play a similar role in cancer. In this work, we present oncodomain hotspots, a new 'domain-centric' method for identifying clusters of somatic mutations across entire gene families using protein domain models. Our analysis confirms that our approach creates a framework for leveraging structural and functional information encapsulated by protein domains into the analysis of somatic variants in cancer, enabling the assessment of even rare somatic variants by comparison to similar genes. Our results reveal a vast landscape of somatic variants that act at the level of domain families altering pathways known to be involved with cancer such as protein phosphorylation, signaling, gene regulation, and cell metabolism. Due to oncodomain hotspots' unique ability to assess rare variants, we expect our method to become an important tool for the analysis of sequenced tumor genomes, complementing existing methods.


Assuntos
Biologia Computacional/métodos , Mutação/genética , Neoplasias/genética , Proteínas Oncogênicas/genética , Domínios Proteicos/genética , Bases de Dados de Proteínas , Fator de Crescimento Epidérmico/genética , Humanos , Proteínas Mitocondriais/genética , Modelos Moleculares , Proteínas Oncogênicas/classificação , Ligação Proteica , Proteínas ras/genética
4.
Hum Mutat ; 37(11): 1137-1143, 2016 11.
Artigo em Inglês | MEDLINE | ID: mdl-27406314

RESUMO

In silico methods for detecting functionally relevant genetic variants are important for identifying genetic markers of human inherited disease. Much research has focused on protein-coding variants since coding regions have well-defined physicochemical and functional properties. However, many bioinformatics tools are not applicable to variants outside coding regions. Here, we increase the classification performance of our regulatory single-nucleotide variant predictor (RSVP) for variants that cause regulatory abnormalities from an AUC of 0.90-0.97 by incorporating genomic regions identified by the ENCODE project into RSVP. RSVP is comparable to a recently published tool, Genome-Wide Annotation of Variants (GWAVA); both RSVP and GWAVA perform better on regulatory variants than a traditional variant predictor, combined annotation-dependent depletion (CADD). However, our method outperforms GWAVA on variants located at similar distances to the transcription start site as the positive set (AUC: 0.96) as compared with GWAVA (AUC: 0.71). Much of this disparity is due to RSVP's incorporation of features pertaining to the nearest gene (expression, GO terms, etc.), which are not included in GWAVA. Our findings hold out the promise of a framework for the assessment of all functional regulatory variants, providing a means to predict which rare or de novo variants are of pathogenic significance.


Assuntos
Biologia Computacional/métodos , Genômica/métodos , Polimorfismo de Nucleotídeo Único , Simulação por Computador , Predisposição Genética para Doença , Genoma Humano , Humanos , Aprendizado de Máquina
5.
J Biol Chem ; 290(35): 21642-51, 2015 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-26160172

RESUMO

Mac-1 exhibits a unique inhibitory activity toward IL-13-induced JAK/STAT activation and thereby regulates macrophage to foam cell transformation. However, the underlying molecular mechanism is unknown. In this study, we report the identification of IL-13Rα1, a component of the IL-13 receptor (IL-13R), as a novel ligand of integrin Mac-1, using a co-evolution-based algorithm. Biochemical analyses demonstrated that recombinant IL-13Rα1 binds Mac-1 in a purified system and supports Mac-1-mediated cell adhesion. Co-immunoprecipitation experiments revealed that endogenous Mac-1 forms a complex with IL-13Rα1 in solution, and confocal fluorescence microscopy demonstrated that these two receptors co-localize with each other on the surface of macrophages. Moreover, we found that genetic inactivation of Mac-1 promotes IL-13-induced JAK/STAT activation in macrophages, resulting in enhanced polarization along the alternative activation pathway. Importantly, we observed that Mac-1(-/-) macrophages exhibit increased expression of foam cell differentiation markers including 15-lipoxygenase and lectin-type oxidized LDL receptor-1 both in vitro and in vivo. Indeed, we found that Mac-1(-/-)LDLR(-/-) mice develop significantly more foam cells than control LDLR(-/-) mice, using an in vivo model of foam cell formation. Together, our data establish for the first time a molecular mechanism by which Mac-1 regulates the signaling activity of IL-13 in macrophages. This newly identified IL-13Rα1/Mac-1-dependent pathway may offer novel targets for therapeutic intervention in the future.


Assuntos
Subunidade alfa1 de Receptor de Interleucina-13/metabolismo , Interleucina-13/metabolismo , Antígeno de Macrófago 1/metabolismo , Macrófagos/metabolismo , Animais , Biomarcadores/metabolismo , Diferenciação Celular , Membrana Celular/metabolismo , Polaridade Celular , Evolução Molecular , Células Espumosas/citologia , Células Espumosas/metabolismo , Inativação Gênica , Janus Quinases/metabolismo , Macrófagos/citologia , Camundongos Endogâmicos C57BL , Ligação Proteica , Proteínas Recombinantes/metabolismo , Fatores de Transcrição STAT/metabolismo , Soluções
6.
Brief Bioinform ; 13(4): 495-512, 2012 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-22247263

RESUMO

An international consortium released the first draft sequence of the human genome 10 years ago. Although the analysis of this data has suggested the genetic underpinnings of many diseases, we have not yet been able to fully quantify the relationship between genotype and phenotype. Thus, a major current effort of the scientific community focuses on evaluating individual predispositions to specific phenotypic traits given their genetic backgrounds. Many resources aim to identify and annotate the specific genes responsible for the observed phenotypes. Some of these use intra-species genetic variability as a means for better understanding this relationship. In addition, several online resources are now dedicated to collecting single nucleotide variants and other types of variants, and annotating their functional effects and associations with phenotypic traits. This information has enabled researchers to develop bioinformatics tools to analyze the rapidly increasing amount of newly extracted variation data and to predict the effect of uncharacterized variants. In this work, we review the most important developments in the field--the databases and bioinformatics tools that will be of utmost importance in our concerted effort to interpret the human variome.


Assuntos
Biologia Computacional/métodos , Variação Genética , Genoma , Bases de Dados Genéticas , Genótipo , Projeto Genoma Humano , Humanos , Fenótipo
7.
BMC Genomics ; 14 Suppl 3: S5, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23819456

RESUMO

BACKGROUND: The body of disease mutations with known phenotypic relevance continues to increase and is expected to do so even faster with the advent of new experimental techniques such as whole-genome sequencing coupled with disease association studies. However, genomic association studies are limited by the molecular complexity of the phenotype being studied and the population size needed to have adequate statistical power. One way to circumvent this problem, which is critical for the study of rare diseases, is to study the molecular patterns emerging from functional studies of existing disease mutations. Current gene-centric analyses to study mutations in coding regions are limited by their inability to account for the functional modularity of the protein. Previous studies of the functional patterns of known human disease mutations have shown a significant tendency to cluster at protein domain positions, namely position-based domain hotspots of disease mutations. However, the limited number of known disease mutations remains the main factor hindering the advancement of mutation studies at a functional level. In this paper, we address this problem by incorporating mutations known to be disruptive of phenotypes in other species. Focusing on two evolutionarily distant organisms, human and yeast, we describe the first inter-species analysis of mutations of phenotypic relevance at the protein domain level. RESULTS: The results of this analysis reveal that phenotypic mutations from yeast cluster at specific positions on protein domains, a characteristic previously revealed to be displayed by human disease mutations. We found over one hundred domain hotspots in yeast with approximately 50% in the exact same domain position as known human disease mutations. CONCLUSIONS: We describe an analysis using protein domains as a framework for transferring functional information by studying domain hotspots in human and yeast and relating phenotypic changes in yeast to diseases in human. This first-of-a-kind study of phenotypically relevant yeast mutations in relation to human disease mutations demonstrates the utility of a multi-species analysis for advancing the understanding of the relationship between genetic mutations and phenotypic changes at the organismal level.


Assuntos
Biologia Computacional/métodos , Evolução Molecular , Doenças Genéticas Inatas/genética , Mutação/genética , Fenótipo , Humanos , Estrutura Terciária de Proteína/genética , Especificidade da Espécie , Leveduras
8.
PLoS Comput Biol ; 8(12): e1002819, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-23300410

RESUMO

Proteins do not function in isolation; it is their interactions with one another and also with other molecules (e.g. DNA, RNA) that mediate metabolic and signaling pathways, cellular processes, and organismal systems. Due to their central role in biological function, protein interactions also control the mechanisms leading to healthy and diseased states in organisms. Diseases are often caused by mutations affecting the binding interface or leading to biochemically dysfunctional allosteric changes in proteins. Therefore, protein interaction networks can elucidate the molecular basis of disease, which in turn can inform methods for prevention, diagnosis, and treatment. In this chapter, we will describe the computational approaches to predict and map networks of protein interactions and briefly review the experimental methods to detect protein interactions. We will describe the application of protein interaction networks as a translational approach to the study of human disease and evaluate the challenges faced by these approaches.


Assuntos
Doença , Ligação Proteica , Humanos , Mutação
9.
Biomedicines ; 11(4)2023 Apr 19.
Artigo em Inglês | MEDLINE | ID: mdl-37189830

RESUMO

The extracellular matrix (ECM) is earning an increasingly relevant role in many disease states and aging. The analysis of these disease states is possible with the GWAS and PheWAS methodologies, and through our analysis, we aimed to explore the relationships between polymorphisms in the compendium of ECM genes (i.e., matrisome genes) in various disease states. A significant contribution on the part of ECM polymorphisms is evident in various types of disease, particularly those in the core-matrisome genes. Our results confirm previous links to connective-tissue disorders but also unearth new and underexplored relationships with neurological, psychiatric, and age-related disease states. Through our analysis of the drug indications for gene-disease relationships, we identify numerous targets that may be repurposed for age-related pathologies. The identification of ECM polymorphisms and their contributions to disease will play an integral role in future therapeutic developments, drug repurposing, precision medicine, and personalized care.

10.
BMC Genomics ; 13 Suppl 4: S9, 2012 Jun 18.
Artigo em Inglês | MEDLINE | ID: mdl-22759657

RESUMO

BACKGROUND: Large-scale tumor sequencing projects are now underway to identify genetic mutations that drive tumor initiation and development. Most studies take a gene-based approach to identifying driver mutations, highlighting genes mutated in a large percentage of tumor samples as those likely to contain driver mutations. However, this gene-based approach usually does not consider the position of the mutation within the gene or the functional context the position of the mutation provides. Here we introduce a novel method for mapping mutations to distinct protein domains, not just individual genes, in which they occur, thus providing the functional context for how the mutation contributes to disease. Furthermore, aggregating mutations from all genes containing a specific protein domain enables the identification of mutations that are rare at the gene level, but that occur frequently within the specified domain. These highly mutated domains potentially reveal disruptions of protein function necessary for cancer development. RESULTS: We mapped somatic mutations from the protein coding regions of 100 colon adenocarcinoma tumor samples to the genes and protein domains in which they occurred, and constructed topographical maps to depict the "mutational landscapes" of gene and domain mutation frequencies. We found significant mutation frequency in a number of genes previously known to be somatically mutated in colon cancer patients including APC, TP53 and KRAS. In addition, we found significant mutation frequency within specific domains located in these genes, as well as within other domains contained in genes having low mutation frequencies. These domain "peaks" were enriched with functions important to cancer development including kinase activity, DNA binding and repair, and signal transduction. CONCLUSIONS: Using our method to create the domain landscapes of mutations in colon cancer, we were able to identify somatic mutations with high potential to drive cancer development. Interestingly, the majority of the genes involved have a low mutation frequency. Therefore, the method shows good potential for identifying rare driver mutations in current, large-scale tumor sequencing projects. In addition, mapping mutations to specific domains provides the necessary functional context for understanding how the mutations contribute to the disease, and may reveal novel or more refined gene and domain target regions for drug development.


Assuntos
Biologia Computacional/métodos , Neoplasias/genética , Neoplasias do Colo/genética , Humanos , Mutação/genética
11.
Brief Bioinform ; 11(1): 96-110, 2010 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-20007728

RESUMO

Over a 100 years ago, William Bateson provided, through his observations of the transmission of alkaptonuria in first cousin offspring, evidence of the application of Mendelian genetics to certain human traits and diseases. His work was corroborated by Archibald Garrod (Archibald AE. The incidence of alkaptonuria: a study in chemical individuality. Lancert 1902;ii:1616-20) and William Farabee (Farabee WC. Inheritance of digital malformations in man. In: Papers of the Peabody Museum of American Archaeology and Ethnology. Cambridge, Mass: Harvard University, 1905; 65-78), who recorded the familial tendencies of inheritance of malformations of human hands and feet. These were the pioneers of the hunt for disease genes that would continue through the century and result in the discovery of hundreds of genes that can be associated with different diseases. Despite many ground-breaking discoveries during the last century, we are far from having a complete understanding of the intricate network of molecular processes involved in diseases, and we are still searching for the cures for most complex diseases. In the last few years, new genome sequencing and other high-throughput experimental techniques have generated vast amounts of molecular and clinical data that contain crucial information with the potential of leading to the next major biomedical discoveries. The need to mine, visualize and integrate these data has motivated the development of several informatics approaches that can broadly be grouped in the research area of 'translational bioinformatics'. This review highlights the latest advances in the field of translational bioinformatics, focusing on the advances of computational techniques to search for and classify disease genes.


Assuntos
Biologia Computacional , Doenças Genéticas Inatas/genética , Predisposição Genética para Doença , Humanos
12.
Bioinformatics ; 27(3): 408-15, 2011 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-21138947

RESUMO

MOTIVATION: A major goal of biomedical research in personalized medicine is to find relationships between mutations and their corresponding disease phenotypes. However, most of the disease-related mutational data are currently buried in the biomedical literature in textual form and lack the necessary structure to allow easy retrieval and visualization. We introduce a high-throughput computational method for the identification of relevant disease mutations in PubMed abstracts applied to prostate (PCa) and breast cancer (BCa) mutations. RESULTS: We developed the extractor of mutations (EMU) tool to identify mutations and their associated genes. We benchmarked EMU against MutationFinder--a tool to extract point mutations from text. Our results show that both methods achieve comparable performance on two manually curated datasets. We also benchmarked EMU's performance for extracting the complete mutational information and phenotype. Remarkably, we show that one of the steps in our approach, a filter based on sequence analysis, increases the precision for that task from 0.34 to 0.59 (PCa) and from 0.39 to 0.61 (BCa). We also show that this high-throughput approach can be extended to other diseases. DISCUSSION: Our method improves the current status of disease-mutation databases by significantly increasing the number of annotated mutations. We found 51 and 128 mutations manually verified to be related to PCa and Bca, respectively, that are not currently annotated for these cancer types in the OMIM or Swiss-Prot databases. EMU's retrieval performance represents a 2-fold improvement in the number of annotated mutations for PCa and BCa. We further show that our method can benefit from full-text analysis once there is an increase in Open Access availability of full-text articles. AVAILABILITY: Freely available at: http://bioinf.umbc.edu/EMU/ftp.


Assuntos
Algoritmos , Biologia Computacional/métodos , Bases de Dados Factuais , Armazenamento e Recuperação da Informação/métodos , Mutação Puntual/genética , Publicações , Humanos , Neoplasias/genética , Reprodutibilidade dos Testes , Software
13.
J Biomed Inform ; 45(5): 835-41, 2012 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-22683993

RESUMO

OBJECTIVES: To explore the notion of mutation-centric pharmacogenomic relation extraction and to evaluate our approach against reference pharmacogenomic relations. METHODS: From a corpus of MEDLINE abstracts relevant to genetic variation, we identify co-occurrences between drug mentions extracted using MetaMap and RxNorm, and genetic variants extracted by EMU. The recall of our approach is evaluated against reference relations curated manually in PharmGKB. We also reviewed a random sample of 180 relations in order to evaluate its precision. RESULTS: One crucial aspect of our strategy is the use of biological knowledge for identifying specific genetic variants in text, not simply gene mentions. On the 104 reference abstracts from PharmGKB, the recall of our mutation-centric approach is 33-46%. Applied to 282,000 abstracts from MEDLINE, our approach identifies pharmacogenomic relations in 4534 abstracts, with a precision of 65%. CONCLUSIONS: Compared to a relation-centric approach, our mutation-centric approach shows similar recall, but slightly lower precision. We show that both approaches have limited overlap in their results, but are complementary and can be used in combination. Rather than a solution for the automatic curation of pharmacogenomic knowledge, we see these high-throughput approaches as tools to assist biocurators in the identification of pharmacogenomic relations of interest from the published literature. This investigation also identified three challenging aspects of the extraction of pharmacogenomic relations, namely processing full-text articles, sequence validation of DNA variants and resolution of genetic variants to reference databases, such as dbSNP.


Assuntos
Mineração de Dados/métodos , Bases de Dados Genéticas , Mutação , Farmacogenética/métodos , Humanos , Bases de Conhecimento , MEDLINE
14.
Bioinformatics ; 26(19): 2458-9, 2010 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-20685956

RESUMO

UNLABELLED: Domain mapping of disease mutations (DMDM) is a database in which each disease mutation can be displayed by its gene, protein or domain location. DMDM provides a unique domain-level view where all human coding mutations are mapped on the protein domain. To build DMDM, all human proteins were aligned to a database of conserved protein domains using a Hidden Markov Model-based sequence alignment tool (HMMer). The resulting protein-domain alignments were used to provide a domain location for all available human disease mutations and polymorphisms. The number of disease mutations and polymorphisms in each domain position are displayed alongside other relevant functional information (e.g. the binding and catalytic activity of the site and the conservation of that domain location). DMDM's protein domain view highlights molecular relationships among mutations from different diseases that might not be clearly observed with traditional gene-centric visualization tools. AVAILABILITY: Freely available at http://bioinf.umbc.edu/dmdm.


Assuntos
Biologia Computacional/métodos , Bases de Dados de Proteínas , Doença/genética , Mutação , Estrutura Terciária de Proteína/genética , Proteínas/genética , Humanos , Polimorfismo Genético , Alinhamento de Sequência
15.
Bioinformatics ; 26(14): 1708-13, 2010 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-20505002

RESUMO

MOTIVATION: Since database retrieval is a fundamental operation, the measurement of retrieval efficacy is critical to progress in bioinformatics. This article points out some issues with current methods of measuring retrieval efficacy and suggests some improvements. In particular, many studies have used the pooled receiver operating characteristic for n irrelevant records (ROC(n)) score, the area under the ROC curve (AUC) of a 'pooled' ROC curve, truncated at n irrelevant records. Unfortunately, the pooled ROC(n) score does not faithfully reflect actual usage of retrieval algorithms. Additionally, a pooled ROC(n) score can be very sensitive to retrieval results from as little as a single query. METHODS: To replace the pooled ROC(n) score, we propose the Threshold Average Precision (TAP-k), a measure closely related to the well-known average precision in information retrieval, but reflecting the usage of E-values in bioinformatics. Furthermore, in addition to conditions previously given in the literature, we introduce three new criteria that an ideal measure of retrieval efficacy should satisfy. RESULTS: PSI-BLAST, GLOBAL, HMMER and RPS-BLAST provided examples of using the TAP-k and pooled ROC(n) scores to evaluate sequence retrieval algorithms. In particular, compelling examples using real data highlight the drawbacks of the pooled ROC(n) score, showing that it can produce evaluations skewing far from intuitive expectations. In contrast, the TAP-k satisfies most of the criteria desired in an ideal measure of retrieval efficacy. AVAILABILITY AND IMPLEMENTATION: The TAP-k web server and downloadable Perl script are freely available at http://www.ncbi.nlm.nih.gov/CBBresearch/Spouge/html.ncbi/tap/


Assuntos
Biologia Computacional/métodos , Armazenamento e Recuperação da Informação/métodos , Software , Bases de Dados Factuais , Internet , Curva ROC
16.
Hum Mutat ; 31(3): 335-46, 2010 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-20052762

RESUMO

An important challenge in translational bioinformatics is to understand how genetic variation gives rise to molecular changes at the protein level that can precipitate both monogenic and complex disease. To this end, we compiled datasets of human disease-associated amino acid substitutions (AAS) in the contexts of inherited monogenic disease, complex disease, functional polymorphisms with no known disease association, and somatic mutations in cancer, and compared them with respect to predicted functional sites in proteins. Using the sequence homology-based tool SIFT to estimate the proportion of deleterious AAS in each dataset, only complex disease AAS were found to be indistinguishable from neutral polymorphic AAS. Investigation of monogenic disease AAS predicted to be nondeleterious by SIFT were characterized by a significant enrichment for inherited AAS within solvent accessible residues, regions of intrinsic protein disorder, and an association with the loss or gain of various posttranslational modifications. Sites of structural and/or functional interest were therefore surmised to constitute useful additional features with which to identify the molecular disruptions caused by deleterious AAS. A range of bioinformatic tools, designed to predict structural and functional sites in protein sequences, were then employed to demonstrate that intrinsic biases exist in terms of the distribution of different types of human AAS with respect to specific structural, functional and pathological features. Our Web tool, designed to potentiate the functional profiling of novel AAS, has been made available at http://profile.mutdb.org/.


Assuntos
Biologia Computacional/métodos , Regulação Neoplásica da Expressão Gênica , Neoplasias/genética , Polimorfismo Genético , Alelos , Aminoácidos/química , Aminoácidos/genética , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Variação Genética , Glicosilação , Humanos , Internet , Mutação de Sentido Incorreto , Fosforilação , Análise de Sequência de Proteína
17.
Nucleic Acids Res ; 36(Database issue): D815-9, 2008 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-17827212

RESUMO

Understanding how genetic variation affects the molecular function of gene products is an emergent area of bioinformatic research. Here, we present updates to MutDB (http://www.mutdb.org), a tool aiming to aid bioinformatic studies by integrating publicly available databases of human genetic variation with molecular features and clinical phenotype data. MutDB, first developed in 2002, integrates annotated SNPs in dbSNP and amino acid substitutions in Swiss-Prot with protein structural information, links to scores that predict functional disruption and other useful annotations. Though these functional annotations are mainly focused on nonsynonymous SNPs, some information on other SNP types included in dbSNP is also provided. Additionally, we have developed a new functionality that facilitates KEGG pathway visualization of genes containing SNPs and a SNP query tool for visualizing and exporting sets of SNPs that share selected features based on certain filters.


Assuntos
Substituição de Aminoácidos , Bases de Dados Genéticas , Polimorfismo de Nucleotídeo Único , Doenças Genéticas Inatas/genética , Humanos , Internet , Mutação , Proteínas/química , Proteínas/genética , Software , Interface Usuário-Computador
18.
Syst Med (New Rochelle) ; 3(1): 22-35, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32226924

RESUMO

The First International Conference in Systems and Network Medicine gathered together 200 global thought leaders, scientists, clinicians, academicians, industry and government experts, medical and graduate students, postdoctoral scholars and policymakers. Held at Georgetown University Conference Center in Washington D.C. on September 11-13, 2019, the event featured a day of pre-conference lectures and hands-on bioinformatic computational workshops followed by two days of deep and diverse scientific talks, panel discussions with eminent thought leaders, and scientific poster presentations. Topics ranged from: Systems and Network Medicine in Clinical Practice; the role of -omics technologies in Health Care; the role of Education and Ethics in Clinical Practice, Systems Thinking, and Rare Diseases; and the role of Artificial Intelligence in Medicine. The conference served as a unique nexus for interdisciplinary discovery and dialogue and fostered formation of new insights and possibilities for health care systems advances.

19.
Brief Bioinform ; 8(5): 333-46, 2007 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-17638813

RESUMO

The genomic era has been characterised by vast amounts of data from diverse sources, creating a need for new tools to extract biologically meaningful information. Bioinformatics is, for the most part, responding to that need. The sparseness of the genomic data associated with diseases, however, creates a new challenge. Understanding the complex interplay between genes and proteins requires integration of data from a wide variety of sources, i.e. gene expression, genetic linkage, protein interaction, and protein structure among others. Thus, computational tools have become critical for the integration, representation and visualization of heterogeneous biomedical data. Furthermore, several bioinformatics methods have been developed to formulate predictions about the functional role of genes and proteins, including their role in diseases. After an introduction to the complex interplay between proteins and genetic diseases, this review explores recent approaches to the understanding of the mechanisms of disease at the molecular level. Finally, because most known mechanisms leading to disease involve some form of protein interaction, this review focuses on the recent methodologies for understanding diseases through their underlying protein interactions. Recent contributions from genetics, protein structure and protein interaction network analyses to the understanding of diseases are discussed here.


Assuntos
Biomarcadores , Biologia Computacional/métodos , Predisposição Genética para Doença/genética , Testes Genéticos/métodos , Mapeamento de Interação de Proteínas/métodos , Proteínas/genética , Proteínas/metabolismo , Humanos
20.
Bioinformatics ; 24(16): i241-7, 2008 Aug 15.
Artigo em Inglês | MEDLINE | ID: mdl-18689832

RESUMO

MOTIVATION: Coding-region mutations in human genes are responsible for a diverse spectrum of diseases and phenotypes. Among lesions that have been studied extensively, there are insights into several of the biochemical functions disrupted by disease-causing mutations. Currently, there are more than 60 000 coding-region mutations associated with inherited disease catalogued in the Human Gene Mutation Database (HGMD, August 2007) and more than 70 000 polymorphic amino acid substitutions recorded in dbSNP (dbSNP, build 127). Understanding the mechanism and contribution these variants make to a clinical phenotype is a formidable problem. RESULTS: In this study, we investigate the role of phosphorylation in somatic cancer mutations and inherited diseases. Somatic cancer mutation datasets were shown to have a significant enrichment for mutations that cause gain or loss of phosphorylation when compared to our control datasets (putatively neutral nsSNPs and random amino acid substitutions). Of the somatic cancer mutations, those in kinase genes represent the most enriched set of mutations that disrupt phosphorylation sites, suggesting phosphorylation target site mutation is an active cause of phosphorylation deregulation. Overall, this evidence suggests both gain and loss of a phosphorylation site in a target protein may be important features for predicting cancer-causing mutations and may represent a molecular cause of disease for a number of inherited and somatic mutations.


Assuntos
Análise Mutacional de DNA/métodos , Proteínas de Neoplasias/genética , Neoplasias/genética , Fosfotransferases/genética , Polimorfismo de Nucleotídeo Único/genética , Variação Genética/genética , Humanos , Fosforilação
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa