Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 115
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Nat Immunol ; 24(2): 349-358, 2023 02.
Artigo em Inglês | MEDLINE | ID: mdl-36717723

RESUMO

The biology driving individual patient responses to severe acute respiratory syndrome coronavirus 2 infection remains ill understood. Here, we developed a patient-centric framework leveraging detailed longitudinal phenotyping data and covering a year after disease onset, from 215 infected individuals with differing disease severities. Our analyses revealed distinct 'systemic recovery' profiles, with specific progression and resolution of the inflammatory, immune cell, metabolic and clinical responses. In particular, we found a strong inter-patient and intra-patient temporal covariation of innate immune cell numbers, kynurenine metabolites and lipid metabolites, which highlighted candidate immunologic and metabolic pathways influencing the restoration of homeostasis, the risk of death and that of long COVID. Based on these data, we identified a composite signature predictive of systemic recovery, using a joint model on cellular and molecular parameters measured soon after disease onset. New predictions can be generated using the online tool http://shiny.mrc-bsu.cam.ac.uk/apps/covid-19-systemic-recovery-prediction-app , designed to test our findings prospectively.


Assuntos
COVID-19 , Humanos , SARS-CoV-2 , Síndrome de COVID-19 Pós-Aguda , Cinurenina , Assistência Centrada no Paciente
2.
Immunity ; 54(6): 1257-1275.e8, 2021 06 08.
Artigo em Inglês | MEDLINE | ID: mdl-34051148

RESUMO

The kinetics of the immune changes in COVID-19 across severity groups have not been rigorously assessed. Using immunophenotyping, RNA sequencing, and serum cytokine analysis, we analyzed serial samples from 207 SARS-CoV2-infected individuals with a range of disease severities over 12 weeks from symptom onset. An early robust bystander CD8+ T cell immune response, without systemic inflammation, characterized asymptomatic or mild disease. Hospitalized individuals had delayed bystander responses and systemic inflammation that was already evident near symptom onset, indicating that immunopathology may be inevitable in some individuals. Viral load did not correlate with this early pathological response but did correlate with subsequent disease severity. Immune recovery is complex, with profound persistent cellular abnormalities in severe disease correlating with altered inflammatory responses, with signatures associated with increased oxidative phosphorylation replacing those driven by cytokines tumor necrosis factor (TNF) and interleukin (IL)-6. These late immunometabolic and immune defects may have clinical implications.


Assuntos
Linfócitos T CD8-Positivos/imunologia , COVID-19/imunologia , COVID-19/virologia , Interações Hospedeiro-Patógeno/imunologia , Ativação Linfocitária/imunologia , SARS-CoV-2/imunologia , Biomarcadores , Linfócitos T CD8-Positivos/metabolismo , COVID-19/diagnóstico , COVID-19/genética , Citocinas/metabolismo , Suscetibilidade a Doenças , Perfilação da Expressão Gênica , Humanos , Mediadores da Inflamação/metabolismo , Estudos Longitudinais , Ativação Linfocitária/genética , Fosforilação Oxidativa , Fenótipo , Prognóstico , Espécies Reativas de Oxigênio/metabolismo , Índice de Gravidade de Doença , Transcriptoma
3.
Nature ; 583(7814): 96-102, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32581362

RESUMO

Most patients with rare diseases do not receive a molecular diagnosis and the aetiological variants and causative genes for more than half such disorders remain to be discovered1. Here we used whole-genome sequencing (WGS) in a national health system to streamline diagnosis and to discover unknown aetiological variants in the coding and non-coding regions of the genome. We generated WGS data for 13,037 participants, of whom 9,802 had a rare disease, and provided a genetic diagnosis to 1,138 of the 7,065 extensively phenotyped participants. We identified 95 Mendelian associations between genes and rare diseases, of which 11 have been discovered since 2015 and at least 79 are confirmed to be aetiological. By generating WGS data of UK Biobank participants2, we found that rare alleles can explain the presence of some individuals in the tails of a quantitative trait for red blood cells. Finally, we identified four novel non-coding variants that cause disease through the disruption of transcription of ARPC1B, GATA1, LRBA and MPL. Our study demonstrates a synergy by using WGS for diagnosis and aetiological discovery in routine healthcare.


Assuntos
Internacionalidade , Programas Nacionais de Saúde , Doenças Raras/diagnóstico , Doenças Raras/genética , Sequenciamento Completo do Genoma , Complexo 2-3 de Proteínas Relacionadas à Actina/genética , Proteínas Adaptadoras de Transdução de Sinal/genética , Alelos , Bases de Dados Factuais , Eritrócitos/metabolismo , Fator de Transcrição GATA1/genética , Humanos , Fenótipo , Locos de Características Quantitativas , Receptores de Trombopoetina/genética , Medicina Estatal , Reino Unido
4.
Am J Hum Genet ; 108(6): 983-1000, 2021 06 03.
Artigo em Inglês | MEDLINE | ID: mdl-33909991

RESUMO

We present EPISPOT, a fully joint framework which exploits large panels of epigenetic annotations as variant-level information to enhance molecular quantitative trait locus (QTL) mapping. Thanks to a purpose-built Bayesian inferential algorithm, EPISPOT accommodates functional information for both cis and trans actions, including QTL hotspot effects. It effectively couples simultaneous QTL analysis of thousands of genetic variants and molecular traits with hypothesis-free selection of biologically interpretable annotations which directly contribute to the QTL effects. This unified, epigenome-aided learning boosts statistical power and sheds light on the regulatory basis of the uncovered hits; EPISPOT therefore marks an essential step toward improving the challenging detection and functional interpretation of trans-acting genetic variants and hotspots. We illustrate the advantages of EPISPOT in simulations emulating real-data conditions and in a monocyte expression QTL study, which confirms known hotspots and finds other signals, as well as plausible mechanisms of action. In particular, by highlighting the role of monocyte DNase-I sensitivity sites from >150 epigenetic annotations, we clarify the mediation effects and cell-type specificity of major hotspots close to the lysozyme gene. Our approach forgoes the daunting and underpowered task of one-annotation-at-a-time enrichment analyses for prioritizing cis and trans QTL hits and is tailored to any transcriptomic, proteomic, or metabolomic QTL problem. By enabling principled epigenome-driven QTL mapping transcriptome-wide, EPISPOT helps progress toward a better functional understanding of genetic regulation.


Assuntos
Algoritmos , Simulação por Computador , Epigenoma , Modelos Genéticos , Mutação , Fenótipo , Locos de Características Quantitativas , Teorema de Bayes , Mapeamento Cromossômico , Humanos
5.
PLoS Med ; 20(11): e1004310, 2023 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-37922316

RESUMO

BACKGROUND: Multimorbidity, characterised by the coexistence of multiple chronic conditions in an individual, is a rising public health concern. While much of the existing research has focused on cross-sectional patterns of multimorbidity, there remains a need to better understand the longitudinal accumulation of diseases. This includes examining the associations between important sociodemographic characteristics and the rate of progression of chronic conditions. METHODS AND FINDINGS: We utilised electronic primary care records from 13.48 million participants in England, drawn from the Clinical Practice Research Datalink (CPRD Aurum), spanning from 2005 to 2020 with a median follow-up of 4.71 years (IQR: 1.78, 11.28). The study focused on 5 important chronic conditions: cardiovascular disease (CVD), type 2 diabetes (T2D), chronic kidney disease (CKD), heart failure (HF), and mental health (MH) conditions. Key sociodemographic characteristics considered include ethnicity, social and material deprivation, gender, and age. We employed a flexible spline-based parametric multistate model to investigate the associations between these sociodemographic characteristics and the rate of different disease transitions throughout multimorbidity development. Our findings reveal distinct association patterns across different disease transition types. Deprivation, gender, and age generally demonstrated stronger associations with disease diagnosis compared to ethnic group differences. Notably, the impact of these factors tended to attenuate with an increase in the number of preexisting conditions, especially for deprivation, gender, and age. For example, the hazard ratio (HR) (95% CI; p-value) for the association of deprivation with T2D diagnosis (comparing the most deprived quintile to the least deprived) is 1.76 ([1.74, 1.78]; p < 0.001) for those with no preexisting conditions and decreases to 0.95 ([0.75, 1.21]; p = 0.69) with 4 preexisting conditions. Furthermore, the impact of deprivation, gender, and age was typically more pronounced when transitioning from an MH condition. For instance, the HR (95% CI; p-value) for the association of deprivation with T2D diagnosis when transitioning from MH is 2.03 ([1.95, 2.12], p < 0.001), compared to transitions from CVD 1.50 ([1.43, 1.58], p < 0.001), CKD 1.37 ([1.30, 1.44], p < 0.001), and HF 1.55 ([1.34, 1.79], p < 0.001). A primary limitation of our study is that potential diagnostic inaccuracies in primary care records, such as underdiagnosis, overdiagnosis, or ascertainment bias of chronic conditions, could influence our results. CONCLUSIONS: Our results indicate that early phases of multimorbidity development could warrant increased attention. The potential importance of earlier detection and intervention of chronic conditions is underscored, particularly for MH conditions and higher-risk populations. These insights may have important implications for the management of multimorbidity.


Assuntos
Doenças Cardiovasculares , Diabetes Mellitus Tipo 2 , Insuficiência Cardíaca , Insuficiência Renal Crônica , Humanos , Multimorbidade , Diabetes Mellitus Tipo 2/diagnóstico , Diabetes Mellitus Tipo 2/epidemiologia , Estudos Transversais , Inglaterra/epidemiologia , Insuficiência Cardíaca/diagnóstico , Insuficiência Cardíaca/epidemiologia , Doença Crônica , Insuficiência Renal Crônica/diagnóstico , Insuficiência Renal Crônica/epidemiologia , Atenção Primária à Saúde
6.
Stat Sci ; 37(2): 183-206, 2022 May.
Artigo em Inglês | MEDLINE | ID: mdl-35664221

RESUMO

We present interoperability as a guiding framework for statistical modelling to assist policy makers asking multiple questions using diverse datasets in the face of an evolving pandemic response. Interoperability provides an important set of principles for future pandemic preparedness, through the joint design and deployment of adaptable systems of statistical models for disease surveillance using probabilistic reasoning. We illustrate this through case studies for inferring and characterising spatial-temporal prevalence and reproduction numbers of SARS-CoV-2 infections in England.

7.
BMC Med Inform Decis Mak ; 21(1): 281, 2021 10 12.
Artigo em Inglês | MEDLINE | ID: mdl-34641870

RESUMO

BACKGROUND: An Informatics Consult has been proposed in which clinicians request novel evidence from large scale health data resources, tailored to the treatment of a specific patient. However, the availability of such consultations is lacking. We seek to provide an Informatics Consult for a situation where a treatment indication and contraindication coexist in the same patient, i.e., anti-coagulation use for stroke prevention in a patient with both atrial fibrillation (AF) and liver cirrhosis. METHODS: We examined four sources of evidence for the effect of warfarin on stroke risk or all-cause mortality from: (1) randomised controlled trials (RCTs), (2) meta-analysis of prior observational studies, (3) trial emulation (using population electronic health records (N = 3,854,710) and (4) genetic evidence (Mendelian randomisation). We developed prototype forms to request an Informatics Consult and return of results in electronic health record systems. RESULTS: We found 0 RCT reports and 0 trials recruiting for patients with AF and cirrhosis. We found broad concordance across the three new sources of evidence we generated. Meta-analysis of prior observational studies showed that warfarin use was associated with lower stroke risk (hazard ratio [HR] = 0.71, CI 0.39-1.29). In a target trial emulation, warfarin was associated with lower all-cause mortality (HR = 0.61, CI 0.49-0.76) and ischaemic stroke (HR = 0.27, CI 0.08-0.91). Mendelian randomisation served as a drug target validation where we found that lower levels of vitamin K1 (warfarin is a vitamin K1 antagonist) are associated with lower stroke risk. A pilot survey with an independent sample of 34 clinicians revealed that 85% of clinicians found information on prognosis useful and that 79% thought that they should have access to the Informatics Consult as a service within their healthcare systems. We identified candidate steps for automation to scale evidence generation and to accelerate the return of results. CONCLUSION: We performed a proof-of-concept Informatics Consult for evidence generation, which may inform treatment decisions in situations where there is dearth of randomised trials. Patients are surprised to know that their clinicians are currently not able to learn in clinic from data on 'patients like me'. We identify the key challenges in offering such an Informatics Consult as a service.


Assuntos
Fibrilação Atrial , Acidente Vascular Cerebral , Anticoagulantes/uso terapêutico , Fibrilação Atrial/tratamento farmacológico , Humanos , Informática , Encaminhamento e Consulta , Acidente Vascular Cerebral/tratamento farmacológico , Resultado do Tratamento , Varfarina/uso terapêutico
8.
Biom J ; 63(2): 289-304, 2021 02.
Artigo em Inglês | MEDLINE | ID: mdl-33155717

RESUMO

In precision medicine, a common problem is drug sensitivity prediction from cancer tissue cell lines. These types of problems entail modelling multivariate drug responses on high-dimensional molecular feature sets in typically >1000 cell lines. The dimensions of the problem require specialised models and estimation methods. In addition, external information on both the drugs and the features is often available. We propose to model the drug responses through a linear regression with shrinkage enforced through a normal inverse Gaussian prior. We let the prior depend on the external information, and estimate the model and external information dependence in an empirical-variational Bayes framework. We demonstrate the usefulness of this model in both a simulated setting and in the publicly available Genomics of Drug Sensitivity in Cancer data.


Assuntos
Genômica , Preparações Farmacêuticas , Teorema de Bayes , Distribuição Normal , Medicina de Precisão
9.
Am J Hum Genet ; 101(1): 104-114, 2017 Jul 06.
Artigo em Inglês | MEDLINE | ID: mdl-28669401

RESUMO

We present a rapid and powerful inference procedure for identifying loci associated with rare hereditary disorders using Bayesian model comparison. Under a baseline model, disease risk is fixed across all individuals in a study. Under an association model, disease risk depends on a latent bipartition of rare variants into pathogenic and non-pathogenic variants, the number of pathogenic alleles that each individual carries, and the mode of inheritance. A parameter indicating presence of an association and the parameters representing the pathogenicity of each variant and the mode of inheritance can be inferred in a Bayesian framework. Variant-specific prior information derived from allele frequency databases, consequence prediction algorithms, or genomic datasets can be integrated into the inference. Association models can be fitted to different subsets of variants in a locus and compared using a model selection procedure. This procedure can improve inference if only a particular class of variants confers disease risk and can suggest particular disease etiologies related to that class. We show that our method, called BeviMed, is more powerful and informative than existing rare variant association methods in the context of dominant and recessive disorders. The high computational efficiency of our algorithm makes it feasible to test for associations in the large non-coding fraction of the genome. We have applied BeviMed to whole-genome sequencing data from 6,586 individuals with diverse rare diseases. We show that it can identify multiple loci involved in rare diseases, while correctly inferring the modes of inheritance, the likely pathogenic variants, and the variant classes responsible.


Assuntos
Predisposição Genética para Doença , Variação Genética , Estudo de Associação Genômica Ampla , Doenças Raras/genética , Cardiomiopatias/genética , Simulação por Computador , Loci Gênicos , Humanos , Síndromes de Imunodeficiência/genética , Peptídeos e Proteínas de Sinalização Intercelular , Deficiência Intelectual Ligada ao Cromossomo X/genética , Proteínas Nucleares/genética , Osteocondrodisplasias/genética , Doenças da Imunodeficiência Primária , Probabilidade , Doenças Retinianas/genética , Trombocitopenia/genética
10.
Biostatistics ; 20(1): 1-16, 2019 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-29136109

RESUMO

Small area ecological studies are commonly used in epidemiology to assess the impact of area level risk factors on health outcomes when data are only available in an aggregated form. However, the resulting estimates are often biased due to unmeasured confounders, which typically are not available from the standard administrative registries used for these studies. Extra information on confounders can be provided through external data sets such as surveys or cohorts, where the data are available at the individual level rather than at the area level; however, such data typically lack the geographical coverage of administrative registries. We develop a framework of analysis which combines ecological and individual level data from different sources to provide an adjusted estimate of area level risk factors which is less biased. Our method (i) summarizes all available individual level confounders into an area level scalar variable, which we call ecological propensity score (EPS), (ii) implements a hierarchical structured approach to impute the values of EPS whenever they are missing, and (iii) includes the estimated and imputed EPS into the ecological regression linking the risk factors to the health outcome. Through a simulation study, we show that integrating individual level data into small area analyses via EPS is a promising method to reduce the bias intrinsic in ecological studies due to unmeasured confounders; we also apply the method to a real case study to evaluate the effect of air pollution on coronary heart disease hospital admissions in Greater London.


Assuntos
Bioestatística/métodos , Interpretação Estatística de Dados , Métodos Epidemiológicos , Pontuação de Propensão , Análise de Pequenas Áreas , Poluição do Ar/estatística & dados numéricos , Simulação por Computador , Doença das Coronárias/epidemiologia , Humanos , Londres , Admissão do Paciente/estatística & dados numéricos
11.
Biom J ; 62(7): 1650-1669, 2020 11.
Artigo em Inglês | MEDLINE | ID: mdl-32567714

RESUMO

Study designs where data have been aggregated by geographical areas are popular in environmental epidemiology. These studies are commonly based on administrative databases and, providing a complete spatial coverage, are particularly appealing to make inference on the entire population. However, the resulting estimates are often biased and difficult to interpret due to unmeasured confounders, which typically are not available from routinely collected data. We propose a framework to improve inference drawn from such studies exploiting information derived from individual-level survey data. The latter are summarized in an area-level scalar score by mimicking at ecological level the well-known propensity score methodology. The literature on propensity score for confounding adjustment is mainly based on individual-level studies and assumes a binary exposure variable. Here, we generalize its use to cope with area-referenced studies characterized by a continuous exposure. Our approach is based upon Bayesian hierarchical structures specified into a two-stage design: (i) geolocated individual-level data from survey samples are up-scaled at ecological level, then the latter are used to estimate a generalized ecological propensity score (EPS) in the in-sample areas; (ii) the generalized EPS is imputed in the out-of-sample areas under different assumptions about the missingness mechanisms, then it is included into the ecological regression, linking the exposure of interest to the health outcome. This delivers area-level risk estimates, which allow a fuller adjustment for confounding than traditional areal studies. The methodology is illustrated by using simulations and a case study investigating the risk of lung cancer mortality associated with nitrogen dioxide in England (UK).


Assuntos
Saúde Ambiental , Pontuação de Propensão , Teorema de Bayes , Inglaterra , Humanos , Neoplasias Pulmonares/mortalidade , Dióxido de Nitrogênio/efeitos adversos
12.
Am J Hum Genet ; 98(3): 490-499, 2016 Mar 03.
Artigo em Inglês | MEDLINE | ID: mdl-26924528

RESUMO

Rare genetic disorders, which can now be studied systematically with affordable genome sequencing, are often caused by high-penetrance rare variants. Such disorders are often heterogeneous and characterized by abnormalities spanning multiple organ systems ascertained with variable clinical precision. Existing methods for identifying genes with variants responsible for rare diseases summarize phenotypes with unstructured binary or quantitative variables. The Human Phenotype Ontology (HPO) allows composite phenotypes to be represented systematically but association methods accounting for the ontological relationship between HPO terms do not exist. We present a Bayesian method to model the association between an HPO-coded patient phenotype and genotype. Our method estimates the probability of an association together with an HPO-coded phenotype characteristic of the disease. We thus formalize a clinical approach to phenotyping that is lacking in standard regression techniques for rare disease research. We demonstrate the power of our method by uncovering a number of true associations in a large collection of genome-sequenced and HPO-coded cases with rare diseases.


Assuntos
Estudos de Associação Genética/métodos , Fenótipo , Doenças Raras/diagnóstico , Doenças Raras/genética , Actinina/genética , Proteínas Adaptadoras de Transdução de Sinal/genética , Teorema de Bayes , Bases de Dados Genéticas , Forminas , Fatores de Troca do Nucleotídeo Guanina/genética , Humanos , Modelos Logísticos , Modelos Genéticos
13.
Biometrics ; 75(4): 1288-1298, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-31009060

RESUMO

Despite major methodological developments, Bayesian inference in Gaussian graphical models remains challenging in high dimension due to the tremendous size of the model space. This article proposes a method to infer the marginal and conditional independence structures between variables by multiple testing, which bypasses the exploration of the model space. Specifically, we introduce closed-form Bayes factors under the Gaussian conjugate model to evaluate the null hypotheses of marginal and conditional independence between variables. Their computation for all pairs of variables is shown to be extremely efficient, thereby allowing us to address large problems with thousands of nodes as required by modern applications. Moreover, we derive exact tail probabilities from the null distributions of the Bayes factors. These allow the use of any multiplicity correction procedure to control error rates for incorrect edge inclusion. We demonstrate the proposed approach on various simulated examples as well as on a large gene expression data set from The Cancer Genome Atlas.


Assuntos
Teorema de Bayes , Modelos Estatísticos , Distribuição Normal , Simulação por Computador , Perfilação da Expressão Gênica , Genes Neoplásicos , Genoma , Humanos
14.
PLoS Genet ; 12(3): e1005908, 2016 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-27015630

RESUMO

Genome-wide association studies (GWAS) have transformed our understanding of the genetics of complex traits such as autoimmune diseases, but how risk variants contribute to pathogenesis remains largely unknown. Identifying genetic variants that affect gene expression (expression quantitative trait loci, or eQTLs) is crucial to addressing this. eQTLs vary between tissues and following in vitro cellular activation, but have not been examined in the context of human inflammatory diseases. We performed eQTL mapping in five primary immune cell types from patients with active inflammatory bowel disease (n = 91), anti-neutrophil cytoplasmic antibody-associated vasculitis (n = 46) and healthy controls (n = 43), revealing eQTLs present only in the context of active inflammatory disease. Moreover, we show that following treatment a proportion of these eQTLs disappear. Through joint analysis of expression data from multiple cell types, we reveal that previous estimates of eQTL immune cell-type specificity are likely to have been exaggerated. Finally, by analysing gene expression data from multiple cell types, we find eQTLs not previously identified by database mining at 34 inflammatory bowel disease-associated loci. In summary, this parallel eQTL analysis in multiple leucocyte subsets from patients with active disease provides new insights into the genetic basis of immune-mediated diseases.


Assuntos
Vasculite Associada a Anticorpo Anticitoplasma de Neutrófilos/genética , Estudos de Associação Genética , Doenças Inflamatórias Intestinais/genética , Locos de Características Quantitativas/genética , Vasculite Associada a Anticorpo Anticitoplasma de Neutrófilos/imunologia , Vasculite Associada a Anticorpo Anticitoplasma de Neutrófilos/patologia , Feminino , Regulação da Expressão Gênica , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Doenças Inflamatórias Intestinais/imunologia , Doenças Inflamatórias Intestinais/patologia , Masculino , Monócitos/imunologia , Monócitos/metabolismo , Neutrófilos/imunologia , Neutrófilos/metabolismo , Fenótipo , Linfócitos T/imunologia , Linfócitos T/metabolismo
15.
Bioinformatics ; 33(7): 1104-1106, 2017 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-28062448

RESUMO

Summary: Ontologies are widely used constructs for encoding and analyzing biomedical data, but the absence of simple and consistent tools has made exploratory and systematic analysis of such data unnecessarily difficult. Here we present three packages which aim to simplify such procedures. The ontologyIndex package enables arbitrary ontologies to be read into R, supports representation of ontological objects by native R types, and provides a parsimonius set of performant functions for querying ontologies. ontologySimilarity and ontologyPlot extend ontologyIndex with functionality for straightforward visualization and semantic similarity calculations, including statistical routines. Availability and Implementation: ontologyIndex , ontologyPlot and ontologySimilarity are all available on the Comprehensive R Archive Network website under https://cran.r-project.org/web/packages/ . Contact: Daniel Greene dg333@cam.ac.uk. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Ontologias Biológicas , Software
16.
Blood ; 127(23): 2903-14, 2016 06 09.
Artigo em Inglês | MEDLINE | ID: mdl-26912466

RESUMO

Macrothrombocytopenia (MTP) is a heterogeneous group of disorders characterized by enlarged and reduced numbers of circulating platelets, sometimes resulting in abnormal bleeding. In most MTP, this phenotype arises because of altered regulation of platelet formation from megakaryocytes (MKs). We report the identification of DIAPH1, which encodes the Rho-effector diaphanous-related formin 1 (DIAPH1), as a candidate gene for MTP using exome sequencing, ontological phenotyping, and similarity regression. We describe 2 unrelated pedigrees with MTP and sensorineural hearing loss that segregate with a DIAPH1 R1213* variant predicting partial truncation of the DIAPH1 diaphanous autoregulatory domain. The R1213* variant was linked to reduced proplatelet formation from cultured MKs, cell clustering, and abnormal cortical filamentous actin. Similarly, in platelets, there was increased filamentous actin and stable microtubules, indicating constitutive activation of DIAPH1. Overexpression of DIAPH1 R1213* in cells reproduced the cytoskeletal alterations found in platelets. Our description of a novel disorder of platelet formation and hearing loss extends the repertoire of DIAPH1-related disease and provides new insight into the autoregulation of DIAPH1 activity.


Assuntos
Proteínas Adaptadoras de Transdução de Sinal/genética , Perda Auditiva/genética , Mutação , Trombocitopenia/genética , Células A549 , Adolescente , Adulto , Idoso , Estudos de Casos e Controles , Células Cultivadas , Criança , Feminino , Forminas , Estudos de Associação Genética , Predisposição Genética para Doença , Células HEK293 , Perda Auditiva/complicações , Humanos , Masculino , Pessoa de Meia-Idade , Linhagem , Polimorfismo de Nucleotídeo Único , Síndrome , Trombocitopenia/complicações , Adulto Jovem
17.
PLoS Genet ; 11(6): e1005272, 2015 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-26106896

RESUMO

Identification of candidate causal variants in regions associated with risk of common diseases is complicated by linkage disequilibrium (LD) and multiple association signals. Nonetheless, accurate maps of these variants are needed, both to fully exploit detailed cell specific chromatin annotation data to highlight disease causal mechanisms and cells, and for design of the functional studies that will ultimately be required to confirm causal mechanisms. We adapted a Bayesian evolutionary stochastic search algorithm to the fine mapping problem, and demonstrated its improved performance over conventional stepwise and regularised regression through simulation studies. We then applied it to fine map the established multiple sclerosis (MS) and type 1 diabetes (T1D) associations in the IL-2RA (CD25) gene region. For T1D, both stepwise and stochastic search approaches identified four T1D association signals, with the major effect tagged by the single nucleotide polymorphism, rs12722496. In contrast, for MS, the stochastic search found two distinct competing models: a single candidate causal variant, tagged by rs2104286 and reported previously using stepwise analysis; and a more complex model with two association signals, one of which was tagged by the major T1D associated rs12722496 and the other by rs56382813. There is low to moderate LD between rs2104286 and both rs12722496 and rs56382813 (r2 ≃ 0:3) and our two SNP model could not be recovered through a forward stepwise search after conditioning on rs2104286. Both signals in the two variant model for MS affect CD25 expression on distinct subpopulations of CD4+ T cells, which are key cells in the autoimmune process. The results support a shared causal variant for T1D and MS. Our study illustrates the benefit of using a purposely designed model search strategy for fine mapping and the advantage of combining disease and protein expression data.


Assuntos
Teorema de Bayes , Mapeamento Cromossômico/métodos , Diabetes Mellitus Tipo 1/genética , Predisposição Genética para Doença , Esclerose Múltipla/genética , Algoritmos , Mapeamento Cromossômico/estatística & dados numéricos , Haplótipos , Humanos , Subunidade alfa de Receptor de Interleucina-2/genética , Desequilíbrio de Ligação , Polimorfismo de Nucleotídeo Único , Processos Estocásticos
18.
Genet Epidemiol ; 40(3): 188-201, 2016 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-27027514

RESUMO

Recently, large scale genome-wide association study (GWAS) meta-analyses have boosted the number of known signals for some traits into the tens and hundreds. Typically, however, variants are only analysed one-at-a-time. This complicates the ability of fine-mapping to identify a small set of SNPs for further functional follow-up. We describe a new and scalable algorithm, joint analysis of marginal summary statistics (JAM), for the re-analysis of published marginal summary statistics under joint multi-SNP models. The correlation is accounted for according to estimates from a reference dataset, and models and SNPs that best explain the complete joint pattern of marginal effects are highlighted via an integrated Bayesian penalized regression framework. We provide both enumerated and Reversible Jump MCMC implementations of JAM and present some comparisons of performance. In a series of realistic simulation studies, JAM demonstrated identical performance to various alternatives designed for single region settings. In multi-region settings, where the only multivariate alternative involves stepwise selection, JAM offered greater power and specificity. We also present an application to real published results from MAGIC (meta-analysis of glucose and insulin related traits consortium) - a GWAS meta-analysis of more than 15,000 people. We re-analysed several genomic regions that produced multiple significant signals with glucose levels 2 hr after oral stimulation. Through joint multivariate modelling, JAM was able to formally rule out many SNPs, and for one gene, ADCY5, suggests that an additional SNP, which transpired to be more biologically plausible, should be followed up with equal priority to the reported index.


Assuntos
Teorema de Bayes , Estudo de Associação Genômica Ampla/métodos , Polimorfismo de Nucleotídeo Único/genética , Adenilil Ciclases/genética , Algoritmos , Simulação por Computador , Jejum/metabolismo , Genômica , Glucose/metabolismo , Humanos , Insulina/metabolismo , Modelos Genéticos , Fenótipo
19.
Bioinformatics ; 32(4): 523-32, 2016 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-26504141

RESUMO

MOTIVATION: Analysing the joint association between a large set of responses and predictors is a fundamental statistical task in integrative genomics, exemplified by numerous expression Quantitative Trait Loci (eQTL) studies. Of particular interest are the so-called ': hotspots ': , important genetic variants that regulate the expression of many genes. Recently, attention has focussed on whether eQTLs are common to several tissues, cell-types or, more generally, conditions or whether they are specific to a particular condition. RESULTS: We have implemented MT-HESS, a Bayesian hierarchical model that analyses the association between a large set of predictors, e.g. SNPs, and many responses, e.g. gene expression, in multiple tissues, cells or conditions. Our Bayesian sparse regression algorithm goes beyond ': one-at-a-time ': association tests between SNPs and responses and uses a fully multivariate model search across all linear combinations of SNPs, coupled with a model of the correlation between condition/tissue-specific responses. In addition, we use a hierarchical structure to leverage shared information across different genes, thus improving the detection of hotspots. We show the increase of power resulting from our new approach in an extensive simulation study. Our analysis of two case studies highlights new hotspots that would remain undetected by standard approaches and shows how greater prediction power can be achieved when several tissues are jointly considered. AVAILABILITY AND IMPLEMENTATION: C[Formula: see text] source code and documentation including compilation instructions are available under GNU licence at http://www.mrc-bsu.cam.ac.uk/software/.


Assuntos
Algoritmos , Teorema de Bayes , Regulação da Expressão Gênica , Redes Reguladoras de Genes , Inflamação/genética , Doenças Inflamatórias Intestinais/genética , Locos de Características Quantitativas/genética , Software , Animais , Diabetes Mellitus Tipo 1/genética , Genômica/métodos , Humanos , Modelos Teóricos , Especificidade de Órgãos , Polimorfismo de Nucleotídeo Único/genética , Linguagens de Programação , Ratos , Distribuição Tecidual
20.
Stat Sci ; 32(3): 385-404, 2017 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-28883686

RESUMO

Big Datasets are endemic, but are often notoriously difficult to analyse because of their size, heterogeneity and quality. The purpose of this paper is to open a discourse on the potential for modern decision theoretic optimal experimental design methods, which by their very nature have traditionally been applied prospectively, to improve the analysis of Big Data through retrospective designed sampling in order to answer particular questions of interest. By appealing to a range of examples, it is suggested that this perspective on Big Data modelling and analysis has the potential for wide generality and advantageous inferential and computational properties. We highlight current hurdles and open research questions surrounding efficient computational optimisation in using retrospective designs, and in part this paper is a call to the optimisation and experimental design communities to work together in the field of Big Data analysis.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA