Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
1.
Mol Plant Pathol ; 25(4): e13451, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38590135

RESUMO

When compared with other phylogroups (PGs) of the Pseudomonas syringae species complex, P. syringae pv. syringae (Pss) strains within PG2 have a reduced repertoire of type III effectors (T3Es) but produce several phytotoxins. Effectors within the cherry pathogen Pss 9644 were grouped based on their frequency in strains from Prunus as the conserved effector locus (CEL) common to most P. syringae pathogens; a core of effectors common to PG2; a set of PRUNUS effectors common to cherry pathogens; and a FLEXIBLE set of T3Es. Pss 9644 also contains gene clusters for biosynthesis of toxins syringomycin, syringopeptin and syringolin A. After confirmation of virulence gene expression, mutants with a sequential series of T3E and toxin deletions were pathogenicity tested on wood, leaves and fruits of sweet cherry (Prunus avium) and leaves of ornamental cherry (Prunus incisa). The toxins had a key role in disease development in fruits but were less important in leaves and wood. An effectorless mutant retained some pathogenicity to fruit but not wood or leaves. Striking redundancy was observed amongst effector groups. The CEL effectors have important roles during the early stages of leaf infection and possibly acted synergistically with toxins in all tissues. Deletion of separate groups of T3Es had more effect in P. incisa than in P. avium. Mixed inocula were used to complement the toxin mutations in trans and indicated that strain mixtures may be important in the field. Our results highlight the niche-specific role of toxins in P. avium tissues and the complexity of effector redundancy in the pathogen Pss 9644.


Assuntos
Prunus avium , Prunus , Virulência/genética , Pseudomonas syringae , Prunus avium/metabolismo , Frutas/metabolismo , Mutação/genética , Prunus/metabolismo , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo
2.
Am J Hum Genet ; 109(5): 767-782, 2022 05 05.
Artigo em Inglês | MEDLINE | ID: mdl-35452592

RESUMO

Mendelian randomization and colocalization are two statistical approaches that can be applied to summarized data from genome-wide association studies (GWASs) to understand relationships between traits and diseases. However, despite similarities in scope, they are different in their objectives, implementation, and interpretation, in part because they were developed to serve different scientific communities. Mendelian randomization assesses whether genetic predictors of an exposure are associated with the outcome and interprets an association as evidence that the exposure has a causal effect on the outcome, whereas colocalization assesses whether two traits are affected by the same or distinct causal variants. When considering genetic variants in a single genetic region, both approaches can be performed. While a positive colocalization finding typically implies a non-zero Mendelian randomization estimate, the reverse is not generally true: there are several scenarios which would lead to a non-zero Mendelian randomization estimate but lack evidence for colocalization. These include the existence of distinct but correlated causal variants for the exposure and outcome, which would violate the Mendelian randomization assumptions, and a lack of strong associations with the outcome. As colocalization was developed in the GWAS tradition, typically evidence for colocalization is concluded only when there is strong evidence for associations with both traits. In contrast, a non-zero estimate from Mendelian randomization can be obtained despite only nominally significant genetic associations with the outcome at the locus. In this review, we discuss how the two approaches can provide complementary information on potential therapeutic targets.


Assuntos
Estudo de Associação Genômica Ampla , Análise da Randomização Mendeliana , Causalidade , Humanos , Fenótipo
3.
Clin Transl Immunology ; 11(3): e1379, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35284072

RESUMO

Objectives: Population-level measures of seropositivity are critical for understanding the epidemiology of an emerging pathogen, yet most antibody tests apply a strict cutoff for seropositivity that is not learnt in a data-driven manner, leading to uncertainty when classifying low-titer responses. To improve upon this, we evaluated cutoff-independent methods for their ability to assign likelihood of SARS-CoV-2 seropositivity to individual samples. Methods: Using robust ELISAs based on SARS-CoV-2 spike (S) and the receptor-binding domain (RBD), we profiled antibody responses in a group of SARS-CoV-2 PCR+ individuals (n = 138). Using these data, we trained probabilistic learners to assign likelihood of seropositivity to test samples of unknown serostatus (n = 5100), identifying a support vector machines-linear discriminant analysis learner (SVM-LDA) suited for this purpose. Results: In the training data from confirmed ancestral SARS-CoV-2 infections, 99% of participants had detectable anti-S and -RBD IgG in the circulation, with titers differing > 1000-fold between persons. In data of otherwise healthy individuals, 7.2% (n = 367) of samples were of uncertain serostatus, with values in the range of 3-6SD from the mean of pre-pandemic negative controls (n = 595). In contrast, SVM-LDA classified 6.4% (n = 328) of test samples as having a high likelihood (> 99% chance) of past infection, 4.5% (n = 230) to have a 50-99% likelihood, and 4.0% (n = 203) to have a 10-49% likelihood. As different probabilistic approaches were more consistent with each other than conventional SD-based methods, such tools allow for more statistically-sound seropositivity estimates in large cohorts. Conclusion: Probabilistic antibody testing frameworks can improve seropositivity estimates in populations with large titer variability.

4.
Genet Epidemiol ; 45(3): 324-337, 2021 04.
Artigo em Inglês | MEDLINE | ID: mdl-33369784

RESUMO

A transcriptome-wide association study (TWAS) attempts to identify disease associated genes by imputing gene expression into a genome-wide association study (GWAS) using an expression quantitative trait loci (eQTL) data set and then testing for associations with a trait of interest. Regulatory processes may be shared across related tissues and one natural extension of TWAS is harnessing cross-tissue correlation in gene expression to improve prediction accuracy. Here, we studied multi-tissue extensions of lasso regression and random forests (RF), joint lasso and RF-MTL (multi-task learning RF), respectively. We found that, on our chosen eQTL data set, multi-tissue methods were generally more accurate than their single-tissue counterparts, with RF-MTL performing the best. Simulations showed that these benefits generally translated into more associated genes identified, although highlighted that joint lasso had a tendency to erroneously identify genes in one tissue if there existed an eQTL signal for that gene in another. Applying the four methods to a type 1 diabetes GWAS, we found that multi-tissue methods found more unique associated genes for most of the tissues considered. We conclude that multi-tissue methods are competitive and, for some cell types, superior to single-tissue approaches and hold much promise for TWAS studies.


Assuntos
Estudo de Associação Genômica Ampla , Transcriptoma , Humanos , Modelos Genéticos , Fenótipo , Locos de Características Quantitativas
5.
Nucleic Acids Res ; 48(6): 2866-2879, 2020 04 06.
Artigo em Inglês | MEDLINE | ID: mdl-32112106

RESUMO

Identifying DNA cis-regulatory modules (CRMs) that control the expression of specific genes is crucial for deciphering the logic of transcriptional control. Natural genetic variation can point to the possible gene regulatory function of specific sequences through their allelic associations with gene expression. However, comprehensive identification of causal regulatory sequences in brute-force association testing without incorporating prior knowledge is challenging due to limited statistical power and effects of linkage disequilibrium. Sequence variants affecting transcription factor (TF) binding at CRMs have a strong potential to influence gene regulatory function, which provides a motivation for prioritizing such variants in association testing. Here, we generate an atlas of CRMs showing predicted allelic variation in TF binding affinity in human lymphoblastoid cell lines and test their association with the expression of their putative target genes inferred from Promoter Capture Hi-C and immediate linear proximity. We reveal >1300 CRM TF-binding variants associated with target gene expression, the majority of them undetected with standard association testing. A large proportion of CRMs showing associations with the expression of genes they contact in 3D localize to the promoter regions of other genes, supporting the notion of 'epromoters': dual-action CRMs with promoter and distal enhancer activity.


Assuntos
Regulação da Expressão Gênica , Regiões Promotoras Genéticas , Fatores de Transcrição/metabolismo , Sequência de Bases , Sítios de Ligação , Cromatina/metabolismo , Genes Reporter , Ligação Proteica , Locos de Características Quantitativas/genética , Transcrição Gênica
6.
Mach Learn ; 109(2): 251-277, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32174648

RESUMO

In phenotype prediction the physical characteristics of an organism are predicted from knowledge of its genotype and environment. Such studies, often called genome-wide association studies, are of the highest societal importance, as they are of central importance to medicine, crop-breeding, etc. We investigated three phenotype prediction problems: one simple and clean (yeast), and the other two complex and real-world (rice and wheat). We compared standard machine learning methods; elastic net, ridge regression, lasso regression, random forest, gradient boosting machines (GBM), and support vector machines (SVM), with two state-of-the-art classical statistical genetics methods; genomic BLUP and a two-step sequential method based on linear regression. Additionally, using the clean yeast data, we investigated how performance varied with the complexity of the biological mechanism, the amount of observational noise, the number of examples, the amount of missing data, and the use of different data representations. We found that for almost all the phenotypes considered, standard machine learning methods outperformed the methods from classical statistical genetics. On the yeast problem, the most successful method was GBM, followed by lasso regression, and the two statistical genetics methods; with greater mechanistic complexity GBM was best, while in simpler cases lasso was superior. In the wheat and rice studies the best two methods were SVM and BLUP. The most robust method in the presence of noise, missing data, etc. was random forests. The classical statistical genetics method of genomic BLUP was found to perform well on problems where there was population structure. This suggests that standard machine learning methods need to be refined to include population structure information when this is present. We conclude that the application of machine learning methods to phenotype prediction problems holds great promise, but that determining which methods is likely to perform well on any given problem is elusive and non-trivial.

7.
Am J Hum Genet ; 105(6): 1076-1090, 2019 12 05.
Artigo em Inglês | MEDLINE | ID: mdl-31679650

RESUMO

Cytokines are essential regulatory components of the immune system, and their aberrant levels have been linked to many disease states. Despite increasing evidence that cytokines operate in concert, many of the physiological interactions between cytokines, and the shared genetic architecture that underlies them, remain unknown. Here, we aimed to identify and characterize genetic variants with pleiotropic effects on cytokines. Using three population-based cohorts (n = 9,263), we performed multivariate genome-wide association studies (GWAS) for a correlation network of 11 circulating cytokines, then combined our results in meta-analysis. We identified a total of eight loci significantly associated with the cytokine network, of which two (PDGFRB and ABO) had not been detected previously. In addition, conditional analyses revealed a further four secondary signals at three known cytokine loci. Integration, through the use of Bayesian colocalization analysis, of publicly available GWAS summary statistics with the cytokine network associations revealed shared causal variants between the eight cytokine loci and other traits; in particular, cytokine network variants at the ABO, SERPINE2, and ZFPM2 loci showed pleiotropic effects on the production of immune-related proteins, on metabolic traits such as lipoprotein and lipid levels, on blood-cell-related traits such as platelet count, and on disease traits such as coronary artery disease and type 2 diabetes.


Assuntos
Biomarcadores/análise , Doenças Cardiovasculares/genética , Citocinas/genética , Pleiotropia Genética , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Adolescente , Adulto , Idoso , Proteínas Sanguíneas/genética , Proteínas Sanguíneas/imunologia , Doenças Cardiovasculares/imunologia , Doenças Cardiovasculares/patologia , Criança , Citocinas/imunologia , Feminino , Seguimentos , Redes Reguladoras de Genes , Predisposição Genética para Doença , Genoma Humano , Humanos , Estudos Longitudinais , Masculino , Pessoa de Meia-Idade , Prognóstico , Estudos Prospectivos , Adulto Jovem
8.
Nat Commun ; 10(1): 3216, 2019 07 19.
Artigo em Inglês | MEDLINE | ID: mdl-31324808

RESUMO

Thousands of genetic variants are associated with human disease risk, but linkage disequilibrium (LD) hinders fine-mapping the causal variants. Both lack of power, and joint tagging of two or more distinct causal variants by a single non-causal SNP, lead to inaccuracies in fine-mapping, with stochastic search more robust than stepwise. We develop a computationally efficient multinomial fine-mapping (MFM) approach that borrows information between diseases in a Bayesian framework. We show that MFM has greater accuracy than single disease analysis when shared causal variants exist, and negligible loss of precision otherwise. MFM analysis of six immune-mediated diseases reveals causal variants undetected in individual disease analysis, including in IL2RA where we confirm functional effects of multiple causal variants using allele-specific expression in sorted CD4+ T cells from genotype-selected individuals. MFM has the potential to increase fine-mapping resolution in related diseases enabling the identification of associated cellular and molecular phenotypes.


Assuntos
Autoimunidade/genética , Estudos de Associação Genética/métodos , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla/métodos , Modelos Genéticos , Alelos , Teorema de Bayes , Linfócitos T CD4-Positivos , Antígeno CTLA-4/genética , Mapeamento Cromossômico , Regulação da Expressão Gênica , Genótipo , Humanos , Subunidade alfa de Receptor de Interleucina-2/genética , Desequilíbrio de Ligação , Fenótipo , Polimorfismo de Nucleotídeo Único
9.
Front Plant Sci ; 7: 133, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-26904088

RESUMO

Perennial ryegrass (Lolium perenne L.) is one of the most widely grown forage grasses in temperate agriculture. In order to maintain and increase its usage as forage in livestock agriculture, there is a continued need for improvement in biomass yield, quality, disease resistance, and seed yield. Genetic gain for traits such as biomass yield has been relatively modest. This has been attributed to its long breeding cycle, and the necessity to use population based breeding methods. Thanks to recent advances in genotyping techniques there is increasing interest in genomic selection from which genomically estimated breeding values are derived. In this paper we compare the classical RRBLUP model with state-of-the-art machine learning techniques that should yield themselves easily to use in GS and demonstrate their application to predicting quantitative traits in a breeding population of L. perenne. Prediction accuracies varied from 0 to 0.59 depending on trait, prediction model and composition of the training population. The BLUP model produced the highest prediction accuracies for most traits and training populations. Forage quality traits had the highest accuracies compared to yield related traits. There appeared to be no clear pattern to the effect of the training population composition on the prediction accuracies. The heritability of the forage quality traits was generally higher than for the yield related traits, and could partly explain the difference in accuracy. Some population structure was evident in the breeding populations, and probably contributed to the varying effects of training population on the predictions. The average linkage disequilibrium between adjacent markers ranged from 0.121 to 0.215. Higher marker density and larger training population closely related with the test population are likely to improve the prediction accuracy.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA