Pesquisa | Biblioteca Virtual em Saúde

1.

Folding the human proteome using BioNeMo: A fused dataset of structural models for machine learning purposes.

Hetmann, Michael; Parigger, Lena; Sirelkhatim, Hassan; Stern, Abraham; Krassnigg, Andreas; Gruber, Karl; Steinkellner, Georg; Ruau, David; Gruber, Christian C.

Sci Data ; 11(1): 591, 2024 Jun 06.

Artigo em Inglês | MEDLINE | ID: mdl-38844754

RESUMO

Human proteins are crucial players in both health and disease. Understanding their molecular landscape is a central topic in biological research. Here, we present an extensive dataset of predicted protein structures for 42,042 distinct human proteins, including splicing variants, derived from the UniProt reference proteome UP000005640. To ensure high quality and comparability, the dataset was generated by combining state-of-the-art modeling-tools AlphaFold 2, OpenFold, and ESMFold, provided within NVIDIA's BioNeMo platform, as well as homology modeling using Innophore's CavitomiX platform. Our dataset is offered in both unedited and edited formats for diverse research requirements. The unedited version contains structures as generated by the different prediction methods, whereas the edited version contains refinements, including a dataset of structures without low prediction-confidence regions and structures in complex with predicted ligands based on homologs in the PDB. We are confident that this dataset represents the most comprehensive collection of human protein structures available today, facilitating diverse applications such as structure-based drug design and the prediction of protein function and interactions.

Assuntos

Aprendizado de Máquina , Proteoma , Humanos , Dobramento de Proteína , Bases de Dados de Proteínas , Conformação Proteica , Modelos Moleculares

2.

A graphical model approach visualizes regulatory relationships between genome-wide transcription factor binding profiles.

Ng, Felicia S L; Ruau, David; Wernisch, Lorenz; Göttgens, Berthold.

Brief Bioinform ; 19(1): 162-173, 2018 01 01.

Artigo em Inglês | MEDLINE | ID: mdl-27780826

RESUMO

Integrated analysis of multiple genome-wide transcription factor (TF)-binding profiles will be vital to advance our understanding of the global impact of TF binding. However, existing methods for measuring similarity in large numbers of chromatin immunoprecipitation assays with sequencing (ChIP-seq), such as correlation, mutual information or enrichment analysis, are limited in their ability to display functionally relevant TF relationships. In this study, we propose the use of graphical models to determine conditional independence between TFs and showed that network visualization provides a promising alternative to distinguish 'direct' versus 'indirect' TF interactions. We applied four algorithms to measure 'direct' dependence to a compendium of 367 mouse haematopoietic TF ChIP-seq samples and obtained a consensus network known as a 'TF association network' where edges in the network corresponded to likely causal pairwise relationships between TFs. The 'TF association network' illustrates the role of TFs in developmental pathways, is reminiscent of combinatorial TF regulation, corresponds to known protein-protein interactions and indicates substantial TF-binding reorganization in leukemic cell types. With the rapid increase in TF ChIP-Seq data sets, the approach presented here will be a powerful tool to study transcriptional programmes across a wide range of biological systems.

Assuntos

Gráficos por Computador , Regulação da Expressão Gênica , Genoma , Células-Tronco Hematopoéticas/metabolismo , Leucemia/genética , Fatores de Transcrição/metabolismo , Algoritmos , Animais , Sítios de Ligação , Células Cultivadas , Imunoprecipitação da Cromatina , Biologia Computacional/métodos , Células-Tronco Hematopoéticas/citologia , Leucemia/patologia , Camundongos , Modelos Estatísticos , Ligação Proteica , Fatores de Transcrição/genética

3.

DNMT3A Loss Drives Enhancer Hypomethylation in FLT3-ITD-Associated Leukemias.

Yang, Liubin; Rodriguez, Benjamin; Mayle, Allison; Park, Hyun Jung; Lin, Xueqiu; Luo, Min; Jeong, Mira; Curry, Choladda V; Kim, Sang-Bae; Ruau, David; Zhang, Xiaotian; Zhou, Ting; Zhou, Michael; Rebel, Vivienne I; Challen, Grant A; Göttgens, Berthold; Lee, Ju-Seog; Rau, Rachel; Li, Wei; Goodell, Margaret A.

Cancer Cell ; 30(2): 363-365, 2016 08 08.

Artigo em Inglês | MEDLINE | ID: mdl-27505680

4.

DNMT3A Loss Drives Enhancer Hypomethylation in FLT3-ITD-Associated Leukemias.

Yang, Liubin; Rodriguez, Benjamin; Mayle, Allison; Park, Hyun Jung; Lin, Xueqiu; Luo, Min; Jeong, Mira; Curry, Choladda V; Kim, Sang-Bae; Ruau, David; Zhang, Xiaotian; Zhou, Ting; Zhou, Michael; Rebel, Vivienne I; Challen, Grant A; Gottgens, Berthold; Lee, Ju-Seog; Rau, Rachel; Li, Wei; Goodell, Margaret A.

Cancer Cell ; 29(6): 922-934, 2016 06 13.

Artigo em Inglês | MEDLINE | ID: mdl-27300438

RESUMO

DNMT3A, the gene encoding the de novo DNA methyltransferase 3A, is among the most frequently mutated genes in hematologic malignancies. However, the mechanisms through which DNMT3A normally suppresses malignancy development are unknown. Here, we show that DNMT3A loss synergizes with the FLT3 internal tandem duplication in a dose-influenced fashion to generate rapid lethal lymphoid or myeloid leukemias similar to their human counterparts. Loss of DNMT3A leads to reduced DNA methylation, predominantly at hematopoietic enhancer regions in both mouse and human samples. Myeloid and lymphoid diseases arise from transformed murine hematopoietic stem cells. Broadly, our findings support a role for DNMT3A as a guardian of the epigenetic state at enhancer regions, critical for inhibition of leukemic transformation.

Assuntos

DNA (Citosina-5-)-Metiltransferases/genética , Metilação de DNA , Leucemia/genética , Tirosina Quinase 3 Semelhante a fms/genética , Animais , DNA Metiltransferase 3A , Elementos Facilitadores Genéticos , Epigênese Genética , Regulação Neoplásica da Expressão Gênica , Técnicas de Inativação de Genes , Humanos , Camundongos , Mutação , Neoplasias Experimentais

5.

Data Science Solution to Event Prediction in Outsourced Clinical Trial Models.

Dalevi, Daniel; Lovick, Susan; Mann, Helen; Metcalfe, Paul D; Spencer, Stuart; Hollis, Sally; Ruau, David.

Stud Health Technol Inform ; 216: 1065, 2015.

Artigo em Inglês | MEDLINE | ID: mdl-26262364

RESUMO

Late phase clinical trials are regularly outsourced to a Contract Research Organisation (CRO) while the risk and accountability remain within the sponsor company. Many statistical tasks are delivered by the CRO and later revalidated by the sponsor. Here, we report a technological approach to standardised event prediction. We have built a dynamic web application around an R-package with the aim of delivering reliable event predictions, simplifying communication and increasing trust between the CRO and the in-house statisticians via transparency. Short learning curve, interactivity, reproducibility and data diagnostics are key here. The current implementation is motivated by time-to-event prediction in oncology. We demonstrate a clear benefit of standardisation for both parties. The tool can be used for exploration, communication, sensitivity analysis and generating standard reports. At this point we wish to present this tool and share some of the insights we have gained during the development.

Assuntos

Sistemas de Notificação de Reações Adversas a Medicamentos/organização & administração , Ensaios Clínicos como Assunto/estatística & dados numéricos , Monitoramento de Medicamentos/métodos , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos/epidemiologia , Registros Eletrônicos de Saúde/estatística & dados numéricos , Serviços Terceirizados/estatística & dados numéricos , Simulação por Computador , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos/diagnóstico , Registros Eletrônicos de Saúde/classificação , Humanos , Incidência , Modelos Estatísticos , Medição de Risco/métodos , Software , Reino Unido/epidemiologia

6.

CODEX: a next-generation sequencing experiment database for the haematopoietic and embryonic stem cell communities.

Sánchez-Castillo, Manuel; Ruau, David; Wilkinson, Adam C; Ng, Felicia S L; Hannah, Rebecca; Diamanti, Evangelia; Lombard, Patrick; Wilson, Nicola K; Gottgens, Berthold.

Nucleic Acids Res ; 43(Database issue): D1117-23, 2015 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-25270877

RESUMO

CODEX (http://codex.stemcells.cam.ac.uk/) is a user-friendly database for the direct access and interrogation of publicly available next-generation sequencing (NGS) data, specifically aimed at experimental biologists. In an era of multi-centre genomic dataset generation, CODEX provides a single database where these samples are collected, uniformly processed and vetted. The main drive of CODEX is to provide the wider scientific community with instant access to high-quality NGS data, which, irrespective of the publishing laboratory, is directly comparable. CODEX allows users to immediately visualize or download processed datasets, or compare user-generated data against the database's cumulative knowledge-base. CODEX contains four types of NGS experiments: transcription factor chromatin immunoprecipitation coupled to high-throughput sequencing (ChIP-Seq), histone modification ChIP-Seq, DNase-Seq and RNA-Seq. These are largely encompassed within two specialized repositories, HAEMCODE and ESCODE, which are focused on haematopoiesis and embryonic stem cell samples, respectively. To date, CODEX contains over 1000 samples, including 221 unique TFs and 93 unique cell types. CODEX therefore provides one of the most complete resources of publicly available NGS data for the direct interrogation of transcriptional programmes that regulate cellular identity and fate in the context of mammalian development, homeostasis and disease.

Assuntos

Bases de Dados Genéticas , Células-Tronco Embrionárias/metabolismo , Células-Tronco Hematopoéticas/metabolismo , Sequenciamento de Nucleotídeos em Larga Escala , Animais , Imunoprecipitação da Cromatina , Hematopoese/genética , Histonas/metabolismo , Humanos , Internet , Camundongos , Análise de Sequência de DNA , Análise de Sequência de RNA , Software

7.

Constrained transcription factor spacing is prevalent and important for transcriptional control of mouse blood cells.

Ng, Felicia S L; Schütte, Judith; Ruau, David; Diamanti, Evangelia; Hannah, Rebecca; Kinston, Sarah J; Göttgens, Berthold.

Nucleic Acids Res ; 42(22): 13513-24, 2014 Dec 16.

Artigo em Inglês | MEDLINE | ID: mdl-25428352

RESUMO

Combinatorial transcription factor (TF) binding is essential for cell-type-specific gene regulation. However, much remains to be learned about the mechanisms of TF interactions, including to what extent constrained spacing and orientation of interacting TFs are critical for regulatory element activity. To examine the relative prevalence of the 'enhanceosome' versus the 'TF collective' model of combinatorial TF binding, a comprehensive analysis of TF binding site sequences in large scale datasets is necessary. We developed a motif-pair discovery pipeline to identify motif co-occurrences with preferential distance(s) between motifs in TF-bound regions. Utilizing a compendium of 289 mouse haematopoietic TF ChIP-seq datasets, we demonstrate that haematopoietic-related motif-pairs commonly occur with highly conserved constrained spacing and orientation between motifs. Furthermore, motif clustering revealed specific associations for both heterotypic and homotypic motif-pairs with particular haematopoietic cell types. We also showed that disrupting the spacing between motif-pairs significantly affects transcriptional activity in a well-known motif-pair-E-box and GATA, and in two previously unknown motif-pairs with constrained spacing-Ets and Homeobox as well as Ets and E-box. In this study, we provide evidence for widespread sequence-specific TF pair interaction with DNA that conforms to the 'enhanceosome' model, and furthermore identify associations between specific haematopoietic cell-types and motif-pairs.

Assuntos

Hematopoese/genética , Elementos Reguladores de Transcrição , Fatores de Transcrição/metabolismo , Transcrição Gênica , Animais , Sítios de Ligação , Células Sanguíneas/metabolismo , Imunoprecipitação da Cromatina , DNA/química , DNA/metabolismo , Camundongos , Motivos de Nucleotídeos , Análise de Sequência de DNA

8.

TRES predicts transcription control in embryonic stem cells.

Pooley, Christopher; Ruau, David; Lombard, Patrick; Gottgens, Berthold; Joshi, Anagha.

Bioinformatics ; 30(20): 2983-5, 2014 Oct 15.

Artigo em Inglês | MEDLINE | ID: mdl-24958811

RESUMO

SUMMARY: Unraveling transcriptional circuits controlling embryonic stem cell maintenance and fate has great potential for improving our understanding of normal development as well as disease. To facilitate this, we have developed a novel web tool called 'TRES' that predicts the likely upstream regulators for a given gene list. This is achieved by integrating transcription factor (TF) binding events from 187 ChIP-sequencing and ChIP-on-chip datasets in murine and human embryonic stem (ES) cells with over 1000 mammalian TF sequence motifs. Using 114 TF perturbation gene sets, as well as 115 co-expression clusters in ES cells, we validate the utility of this approach. AVAILABILITY AND IMPLEMENTATION: TRES is freely available at http://www.tres.roslin.ed.ac.uk. CONTACT: Anagha.Joshi@roslin.ed.ac.uk or bg200@cam.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Biologia Computacional/métodos , Células-Tronco Embrionárias/metabolismo , Regulação da Expressão Gênica , Transcrição Gênica , Animais , Imunoprecipitação da Cromatina , Humanos , Internet , Camundongos , Análise de Sequência com Séries de Oligonucleotídeos , Análise de Sequência , Fatores de Transcrição/metabolismo

9.

Disease risk factors identified through shared genetic architecture and electronic medical records.

Li, Li; Ruau, David J; Patel, Chirag J; Weber, Susan C; Chen, Rong; Tatonetti, Nicholas P; Dudley, Joel T; Butte, Atul J.

Sci Transl Med ; 6(234): 234ra57, 2014 Apr 30.

Artigo em Inglês | MEDLINE | ID: mdl-24786325

RESUMO

Genome-wide association studies have identified genetic variants for thousands of diseases and traits. We evaluated the relationships between specific risk factors (for example, blood cholesterol level) and diseases on the basis of their shared genetic architecture in a comprehensive human disease-single-nucleotide polymorphism association database (VARIMED), analyzing the findings from 8962 published association studies. Similarity between traits and diseases was statistically evaluated on the basis of their association with shared gene variants. We identified 120 disease-trait pairs that were statistically similar, and of these, we tested and validated five previously unknown disease-trait associations by searching electronic medical records (EMRs) from three independent medical centers for evidence of the trait appearing in patients within 1 year of first diagnosis of the disease. We validated that the mean corpuscular volume is elevated before diagnosis of acute lymphoblastic leukemia; both have associated variants in the gene IKZF1. Platelet count is decreased before diagnosis of alcohol dependence; both are associated with variants in the gene C12orf51. Alkaline phosphatase level is elevated in patients with venous thromboembolism; both share variants in ABO. Similarly, we found that prostate-specific antigen and serum magnesium levels were altered before the diagnosis of lung cancer and gastric cancer, respectively. Disease-trait associations identify traits that could serve as future prognostics, if validated through EMR and subsequent prospective trials.

Assuntos

Registros Eletrônicos de Saúde , Estudo de Associação Genômica Ampla/métodos , Humanos , Modelos Biológicos , Fatores de Risco

10.

Building an ENCODE-style data compendium on a shoestring.

Ruau, David; Ng, Felicia S L; Wilson, Nicola K; Hannah, Rebecca; Diamanti, Evangelia; Lombard, Patrick; Woodhouse, Steven; Göttgens, Berthold.

Nat Methods ; 10(10): 926, 2013 Oct.

Artigo em Inglês | MEDLINE | ID: mdl-24076986

Assuntos

Bases de Dados de Ácidos Nucleicos , Genoma , Camundongos/genética , Anotação de Sequência Molecular , Software , Animais , Internet

11.

Systematic identification of DNA variants associated with ultraviolet radiation using a novel Geographic-Wide Association Study (GeoWAS).

Hsu, Irving; Chen, Rong; Ramesh, Aditya; Corona, Erik; Kang, Hyunseok Peter; Ruau, David; Butte, Atul J.

BMC Med Genet ; 14: 62, 2013 Jun 20.

Artigo em Inglês | MEDLINE | ID: mdl-23786662

RESUMO

BACKGROUND: Long-term environmental variables are widely understood to play important roles in DNA variation. Previously, clinical studies examining the impacts of these variables on the human genome were localized to a single country, and used preselected DNA variants. Furthermore, clinical studies or surveys are either not available or difficult to carry out for developing countries. A systematic approach utilizing bioinformatics to identify associations among environmental variables, genetic variation, and diseases across various geographical locations is needed but has been lacking. METHODS: Using a novel Geographic-Wide Association Study (GeoWAS) methodology, we identified Single Nucleotide Polymorphisms (SNPs) in the Human Genome Diversity Project (HGDP) with population allele frequencies associated geographical ultraviolet radiation exposure, and then assessed the diseases known to be assigned with these SNPs. RESULTS: 2,857 radiation SNPs were identified from over 650,000 SNPs in 52 indigenous populations across the world. Using a quantitative disease-SNP database curated from 5,065 human genetic papers, we identified disease associations with those radiation SNPs. The correlation of the rs16891982 SNP in the SLC45A2 gene with melanoma was used as a case study for analysis of disease risk, and the results were consistent with the incidence and mortality rates of melanoma in published scientific literature. Finally, by analyzing the ontology of genes in which the radiation SNPs were significantly enriched, potential associations between SNPs and neurological disorders such as Alzheimer's disease were hypothesized. CONCLUSION: A systematic approach using GeoWAS has enabled us to identify DNA variation associated with ultraviolet radiation and their connections to diseases such as skin cancers. Our analyses have led to a better understating at the genetic level of why certain diseases are more predominant in specific geographical locations, due to the interactions between environmental variables such as ultraviolet radiation and the population types in those regions. The hypotheses proposed in GeoWAS can lead to future testing and interdisciplinary research.

Assuntos

Genoma Humano/efeitos da radiação , Polimorfismo de Nucleotídeo Único , Raios Ultravioleta/efeitos adversos , Antígenos de Neoplasias/genética , Bases de Dados Genéticas , Detecção Precoce de Câncer/métodos , Frequência do Gene , Estudos de Associação Genética/métodos , Genética Populacional/métodos , Geografia , Humanos , Melanoma/genética , Melanoma/patologia , Proteínas de Membrana Transportadoras/genética , Fenótipo , Grupos Raciais/genética , Fatores de Risco , Seleção Genética , Vitamina D/genética , Vitamina D/metabolismo

12.

Systematic identification of risk factors for Alzheimer's disease through shared genetic architecture and electronic medical records.

Li, Li; Ruau, David; Chen, Rong; Weber, Susan; Butte, Atul J.

Pac Symp Biocomput ; : 224-35, 2013.

Artigo em Inglês | MEDLINE | ID: mdl-23424127

RESUMO

Alzheimer's disease (AD) is one of the leading causes of death for older people in US with rapidly increasing incidence. AD irreversibly and progressively damages the brain, but there are treatments in clinical trials to potentially slow the development of AD. We hypothesize that the presence of clinical traits, sharing common genetic variants with AD, could be used as a non-invasive means to predict AD or trigger for administration of preventative therapeutics. We developed a method to compare the genetic architecture between AD and traits from prior GWAS studies. Six clinical traits were significantly associated with AD, capturing 5 known risk factors and 1 novel association: erythrocyte sedimentation rate (ESR). The association of ESR with AD was then validated using Electronic Medical Records (EMR) collected from Stanford Hospital and Clinics. We found that female patients and with abnormally elevated ESR were significantly associated with higher risk of AD diagnosis (OR: 1.85 [1.32-2.61], p=0.003), within 1 year prior to AD diagnosis (OR: 2.31 [1.06-5.01], p=0.032), and within 1 year after AD diagnosis (OR: 3.49 [1.93-6.31], p<0.0001). Additionally, significantly higher ESR values persist for all time courses analyzed. Our results suggest that ESR should be tested in a specific longitudinal study for association with AD diagnosis, and if positive, could be used as a prognostic marker.

Assuntos

Doença de Alzheimer/genética , Idoso , Doença de Alzheimer/sangue , Sedimentação Sanguínea , Estudos de Casos e Controles , Estudos de Coortes , Biologia Computacional , Bases de Dados Genéticas/estatística & dados numéricos , Registros Eletrônicos de Saúde/estatística & dados numéricos , Feminino , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla/estatística & dados numéricos , Humanos , Masculino , Polimorfismo de Nucleotídeo Único , Medicina de Precisão/estatística & dados numéricos , Prognóstico , Locos de Características Quantitativas , Fatores de Risco

13.

Quantifying multi-ethnic representation in genetic studies of high mortality diseases.

Chen, Rong; Dudley, Joel T; Ruau, David; Butte, Atul J.

AMIA Jt Summits Transl Sci Proc ; 2012: 11-8, 2012.

Artigo em Inglês | MEDLINE | ID: mdl-22779041

RESUMO

Most GWASs were performed using study populations with Caucasian ethnicity or ancestry, and findings from one ethnic subpopulation might not always translate to another. We curated 4,573 genetic studies on 763 human diseases and identified 3,461 disease-susceptible SNPs with genome-wide significance; only 10% of these had been validated in at least two different ethnic populations. SNPs for autoimmune diseases demonstrated the lowest percentage of cross-ethnicity validation. We used the mortality data from the Center for Disease Control and Prevention and identified 19 diseases killing over 10,000 Americans per year that were still lacking publications of even a single cross-ethnic SNP. Fifteen of these diseases had never been studied in large GWAS in non-Caucasian populations, including chronic liver diseases and cirrhosis, leukemia, and non-Hodgkin's lymphoma. Our results demonstrate that diseases killing most Americans are still lacking genetic studies across ethnicities.

14.

Integrative approach to pain genetics identifies pain sensitivity loci across diseases.

Ruau, David; Dudley, Joel T; Chen, Rong; Phillips, Nicholas G; Swan, Gary E; Lazzeroni, Laura C; Clark, J David; Butte, Atul J; Angst, Martin S.

PLoS Comput Biol ; 8(6): e1002538, 2012.

Artigo em Inglês | MEDLINE | ID: mdl-22685391

RESUMO

Identifying human genes relevant for the processing of pain requires difficult-to-conduct and expensive large-scale clinical trials. Here, we examine a novel integrative paradigm for data-driven discovery of pain gene candidates, taking advantage of the vast amount of existing disease-related clinical literature and gene expression microarray data stored in large international repositories. First, thousands of diseases were ranked according to a disease-specific pain index (DSPI), derived from Medical Subject Heading (MESH) annotations in MEDLINE. Second, gene expression profiles of 121 of these human diseases were obtained from public sources. Third, genes with expression variation significantly correlated with DSPI across diseases were selected as candidate pain genes. Finally, selected candidate pain genes were genotyped in an independent human cohort and prospectively evaluated for significant association between variants and measures of pain sensitivity. The strongest signal was with rs4512126 (5q32, ABLIM3, Pâ=â1.3×10â»¹°) for the sensitivity to cold pressor pain in males, but not in females. Significant associations were also observed with rs12548828, rs7826700 and rs1075791 on 8q22.2 within NCALD (Pâ=â1.7×10â»4, 1.8×10â»4, and 2.2×10â»4 respectively). Our results demonstrate the utility of a novel paradigm that integrates publicly available disease-specific gene expression data with clinical data curated from MEDLINE to facilitate the discovery of pain-relevant genes. This data-derived list of pain gene candidates enables additional focused and efficient biological studies validating additional candidates.

Assuntos

Dor/genética , Estudos de Coortes , Temperatura Baixa/efeitos adversos , Biologia Computacional , Bases de Dados Genéticas , Doença/genética , Feminino , Perfilação da Expressão Gênica , Estudo de Associação Genômica Ampla , Humanos , Medicina Integrativa , Proteínas com Domínio LIM/genética , Masculino , Modelos Genéticos , Neurocalcina/genética , Dor/etiologia , Biologia de Sistemas

15.

Type 2 diabetes risk alleles demonstrate extreme directional differentiation among human populations, compared to other diseases.

Chen, Rong; Corona, Erik; Sikora, Martin; Dudley, Joel T; Morgan, Alex A; Moreno-Estrada, Andres; Nilsen, Geoffrey B; Ruau, David; Lincoln, Stephen E; Bustamante, Carlos D; Butte, Atul J.

PLoS Genet ; 8(4): e1002621, 2012.

Artigo em Inglês | MEDLINE | ID: mdl-22511877

RESUMO

Many disease-susceptible SNPs exhibit significant disparity in ancestral and derived allele frequencies across worldwide populations. While previous studies have examined population differentiation of alleles at specific SNPs, global ethnic patterns of ensembles of disease risk alleles across human diseases are unexamined. To examine these patterns, we manually curated ethnic disease association data from 5,065 papers on human genetic studies representing 1,495 diseases, recording the precise risk alleles and their measured population frequencies and estimated effect sizes. We systematically compared the population frequencies of cross-ethnic risk alleles for each disease across 1,397 individuals from 11 HapMap populations, 1,064 individuals from 53 HGDP populations, and 49 individuals with whole-genome sequences from 10 populations. Type 2 diabetes (T2D) demonstrated extreme directional differentiation of risk allele frequencies across human populations, compared with null distributions of European-frequency matched control genomic alleles and risk alleles for other diseases. Most T2D risk alleles share a consistent pattern of decreasing frequencies along human migration into East Asia. Furthermore, we show that these patterns contribute to disparities in predicted genetic risk across 1,397 HapMap individuals, T2D genetic risk being consistently higher for individuals in the African populations and lower in the Asian populations, irrespective of the ethnicity considered in the initial discovery of risk alleles. We observed a similar pattern in the distribution of T2D Genetic Risk Scores, which are associated with an increased risk of developing diabetes in the Diabetes Prevention Program cohort, for the same individuals. This disparity may be attributable to the promotion of energy storage and usage appropriate to environments and inconsistent energy intake. Our results indicate that the differential frequencies of T2D risk alleles may contribute to the observed disparity in T2D incidence rates across ethnic populations.

Assuntos

Diabetes Mellitus Tipo 2/genética , Frequência do Gene , Predisposição Genética para Doença , Genética Populacional , Polimorfismo de Nucleotídeo Único/genética , Povo Asiático/genética , População Negra/genética , Frequência do Gene/genética , Genoma Humano , Estudo de Associação Genômica Ampla , Projeto HapMap , Haplótipos , Humanos , Desequilíbrio de Ligação , Fatores de Risco , População Branca/genética

16.

Sex differences in reported pain across 11,000 patients captured in electronic medical records.

Ruau, David; Liu, Linda Y; Clark, J David; Angst, Martin S; Butte, Atul J.

J Pain ; 13(3): 228-34, 2012 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-22245360

RESUMO

UNLABELLED: Clinically recorded pain scores are abundant in patient health records but are rarely used in research. The use of this information could help improve clinical outcomes. For example, a recent report by the Institute of Medicine stated that ineffective use of clinical information contributes to undertreatment of patient subpopulations--especially women. This study used diagnosis-associated pain scores from a large hospital database to document sex differences in reported pain. We used de-identified electronic medical records from Stanford Hospital and Clinics for more than 72,000 patients. Each record contained at least 1 disease-associated pain score. We found over 160,000 pain scores in more than 250 primary diagnoses, and analyzed differences in disease-specific pain reported by men and women. After filtering for diagnoses with minimum encounter numbers, we found diagnosis-specific sex differences in reported pain. The most significant differences occurred in patients with disorders of the musculoskeletal, circulatory, respiratory and digestive systems, followed by infectious diseases, and injury and poisoning. We also discovered sex-specific differences in pain intensity in previously unreported diseases, including disorders of the cervical region, and acute sinusitis (P = .01, .017, respectively). Pain scores were collected during hospital encounters. No information about the use of pre-encounter over-the-counter medications was available. To our knowledge, this is the largest data-driven study documenting sex differences of disease-associated pain. It highlights the utility of electronic medical record data to corroborate and expand on results of smaller clinical studies. Our findings emphasize the need for future research examining the mechanisms underlying differences in pain. PERSPECTIVE: This article highlights the potential of electronic medical records to conduct large-scale pain studies. Our results are consistent with previous studies reporting pain differences between sexes and also suggest that clinicians should pay increased attention to this idea.

Assuntos

Dor , Caracteres Sexuais , Adulto , Registros Eletrônicos de Saúde , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Estudos Retrospectivos , Adulto Jovem

17.

Comparison of automated and human assignment of MeSH terms on publicly-available molecular datasets.

Ruau, David; Mbagwu, Michael; Dudley, Joel T; Krishnan, Vijay; Butte, Atul J.

J Biomed Inform ; 44 Suppl 1: S39-S43, 2011 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-21420508

RESUMO

Publicly available molecular datasets can be used for independent verification or investigative repurposing, but depends on the presence, consistency and quality of descriptive annotations. Annotation and indexing of molecular datasets using well-defined controlled vocabularies or ontologies enables accurate and systematic data discovery, yet the majority of molecular datasets available through public data repositories lack such annotations. A number of automated annotation methods have been developed; however few systematic evaluations of the quality of annotations supplied by application of these methods have been performed using annotations from standing public data repositories. Here, we compared manually-assigned Medical Subject Heading (MeSH) annotations associated with experiments by data submitters in the PRoteomics IDEntification (PRIDE) proteomics data repository to automated MeSH annotations derived through the National Center for Biomedical Ontology Annotator and National Library of Medicine MetaMap programs. These programs were applied to free-text annotations for experiments in PRIDE. As many submitted datasets were referenced in publications, we used the manually curated MeSH annotations of those linked publications in MEDLINE as "gold standard". Annotator and MetaMap exhibited recall performance 3-fold greater than that of the manual annotations. We connected PRIDE experiments in a network topology according to shared MeSH annotations and found 373 distinct clusters, many of which were found to be biologically coherent by network analysis. The results of this study suggest that both Annotator and MetaMap are capable of annotating public molecular datasets with a quality comparable, and often exceeding, that of the actual data submitters, highlighting a continuous need to improve and apply automated methods to molecular datasets in public data repositories to maximize their value and utility.

Assuntos

Bases de Dados Factuais , Armazenamento e Recuperação da Informação/métodos , Medical Subject Headings , Humanos , Processamento de Linguagem Natural , Proteômica , Estados Unidos , Vocabulário Controlado

18.

Neural induction intermediates exhibit distinct roles of Fgf signaling.

Sterneckert, Jared; Stehling, Martin; Bernemann, Christof; Araúzo-Bravo, Marcos J; Greber, Boris; Gentile, Luca; Ortmeier, Claudia; Sinn, Martina; Wu, Guangming; Ruau, David; Zenke, Martin; Brintrup, Rhea; Klein, Diana C; Ko, Kinarm; Schöler, Hans R.

Stem Cells ; 28(10): 1772-81, 2010 Oct.

Artigo em Inglês | MEDLINE | ID: mdl-20715182

RESUMO

Formation of the neural plate is an intricate process in early mammalian embryonic development mediated by cells of the inner cell mass and involving a series of steps, including development of the epiblast. Here, we report on the creation of an embryonic stem (ES) cell-based system to isolate and identify neural induction intermediates with characteristics of epiblast cells and neural plate. We demonstrate that neural commitment requires prior differentiation of ES cells into epiblast cells that are indistinguishable from those derived from natural embryos. We also demonstrate that epiblast cells can be isolated and cultured as epiblast stem cell lines. Fgf signaling is shown to be required for the differentiation of ES cells into these epiblast cells. Fgf2, widely used for maintenance of both human ES cells and epiblast stem cells, inhibits formation of early neural cells by epiblast intermediates in a dose-dependent manner and is sufficient to promote transient self-renewal of epiblast stem cells. In contrast, Fgf8, the endogenous embryonic neural inducer, fails to promote epiblast self-renewal, but rather promotes more homogenous neural induction with transient self-renewal of early neural cells. Removal of Fgf signaling entirely from epiblast cells promotes rapid neural induction and subsequent neurogenesis. We conclude that Fgf signaling plays different roles during the differentiation of ES cells, with an initial requirement in epiblast formation and a subsequent role in self-renewal. Fgf2 and Fgf8 thus stimulate self-renewal in different cell types.

Assuntos

Células-Tronco Embrionárias/citologia , Células-Tronco Embrionárias/metabolismo , Fatores de Crescimento de Fibroblastos/farmacologia , Animais , Diferenciação Celular/efeitos dos fármacos , Diferenciação Celular/genética , Células Cultivadas , Feminino , Fator 2 de Crescimento de Fibroblastos/farmacologia , Fator 8 de Crescimento de Fibroblasto/farmacologia , Camadas Germinativas/citologia , Camadas Germinativas/efeitos dos fármacos , Humanos , Masculino , Camundongos , Placa Neural/citologia , Placa Neural/efeitos dos fármacos , Neurogênese/efeitos dos fármacos , Neurogênese/genética , Transdução de Sinais/efeitos dos fármacos

19.

Activated Notch1 target genes during embryonic cell differentiation depend on the cellular context and include lineage determinants and inhibitors.

Meier-Stiegen, Franziska; Schwanbeck, Ralf; Bernoth, Kristina; Martini, Simone; Hieronymus, Thomas; Ruau, David; Zenke, Martin; Just, Ursula.

PLoS One ; 5(7): e11481, 2010 Jul 08.

Artigo em Inglês | MEDLINE | ID: mdl-20628604

RESUMO

BACKGROUND: Notch receptor signaling controls developmental cell fates in a cell-context dependent manner. Although Notch signaling directly regulates transcription via the RBP-J/CSL DNA binding protein, little is known about the target genes that are directly activated by Notch in the respective tissues. METHODOLOGY/PRINCIPAL FINDINGS: To analyze how Notch signaling mediates its context dependent function(s), we utilized a Tamoxifen-inducible system to activate Notch1 in murine embryonic stem cells at different stages of mesodermal differentiation and performed global transcriptional analyses. We find that the majority of genes regulated by Notch1 are unique for the cell type and vary widely dependent on other signals. We further show that Notch1 signaling regulates expression of genes playing key roles in cell differentiation, cell cycle control and apoptosis in a context dependent manner. In addition to the known Notch1 targets of the Hes and Hey families of transcriptional repressors, Notch1 activates the expression of regulatory transcription factors such as Sox9, Pax6, Runx1, Myf5 and Id proteins that are critically involved in lineage decisions in the absence of protein synthesis. CONCLUSION/SIGNIFICANCE: We suggest that Notch signaling determines lineage decisions and expansion of stem cells by directly activating both key lineage specific transcription factors and their repressors (Id and Hes/Hey proteins) and propose a model by which Notch signaling regulates cell fate commitment and self renewal in dependence of the intrinsic and extrinsic cellular context.

Assuntos

Diferenciação Celular/fisiologia , Células-Tronco Embrionárias/citologia , Células-Tronco Embrionárias/metabolismo , Receptor Notch1/metabolismo , Animais , Apoptose/genética , Apoptose/fisiologia , Western Blotting , Ciclo Celular/genética , Ciclo Celular/fisiologia , Diferenciação Celular/genética , Linhagem Celular , Subunidade alfa 2 de Fator de Ligação ao Core/genética , Subunidade alfa 2 de Fator de Ligação ao Core/metabolismo , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Proteínas do Olho/genética , Proteínas do Olho/metabolismo , Proteínas de Homeodomínio/genética , Proteínas de Homeodomínio/metabolismo , Camundongos , Fator Regulador Miogênico 5/genética , Fator Regulador Miogênico 5/metabolismo , Análise de Sequência com Séries de Oligonucleotídeos , Fator de Transcrição PAX6 , Fatores de Transcrição Box Pareados/genética , Fatores de Transcrição Box Pareados/metabolismo , Receptor Notch1/genética , Proteínas Repressoras/genética , Proteínas Repressoras/metabolismo , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Fatores de Transcrição SOX9/genética , Fatores de Transcrição SOX9/metabolismo

20.

Induction of pluripotency in adult unipotent germline stem cells.

Ko, Kinarm; Tapia, Natalia; Wu, Guangming; Kim, Jeong Beom; Bravo, Marcos J Araúzo; Sasse, Philipp; Glaser, Tamara; Ruau, David; Han, Dong Wook; Greber, Boris; Hausdörfer, Kirsten; Sebastiano, Vittorio; Stehling, Martin; Fleischmann, Bernd K; Brüstle, Oliver; Zenke, Martin; Schöler, Hans R.

Cell Stem Cell ; 5(1): 87-96, 2009 Jul 02.

Artigo em Inglês | MEDLINE | ID: mdl-19570517

RESUMO

Mouse and human stem cells with features similar to those of embryonic stem cells have been derived from testicular cells. Although pluripotent stem cells have been obtained from defined germline stem cells (GSCs) of mouse neonatal testis, only multipotent stem cells have been obtained so far from defined cells of mouse adult testis. In this study we describe a robust and reproducible protocol for obtaining germline-derived pluripotent stem (gPS) cells from adult unipotent GSCs. Pluripotency of gPS cells was confirmed by in vitro and in vivo differentiation, including germ cell contribution and transmission. As determined by clonal analyses, gPS cells indeed originate from unipotent GSCs. We propose that the conversion process requires a GSC culture microenvironment that depends on the initial number of plated GSCs and the length of culture time.

Assuntos

Células-Tronco Adultas/citologia , Técnicas de Cultura de Células/métodos , Células Germinativas/citologia , Células-Tronco Pluripotentes/citologia , Células-Tronco Adultas/metabolismo , Animais , Blastocisto/citologia , Diferenciação Celular , Células Cultivadas , Metilação de DNA , Desenvolvimento Embrionário , Feminino , Células Germinativas/crescimento & desenvolvimento , Masculino , Camundongos , Modelos Animais , Células-Tronco Pluripotentes/metabolismo , Nicho de Células-Tronco/citologia , Testículo/citologia

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA