Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 54
Filtrar
Más filtros

Base de datos
Tipo del documento
Intervalo de año de publicación
1.
Genes (Basel) ; 15(9)2024 Sep 07.
Artículo en Inglés | MEDLINE | ID: mdl-39336765

RESUMEN

Gene-based association analysis is a powerful tool for identifying genes that explain trait variability. An essential step of this analysis is a conditional analysis. It aims to eliminate the influence of SNPs outside the gene, which are in linkage disequilibrium with intragenic SNPs. The popular conditional analysis method, GCTA-COJO, accounts for the influence of several top independently associated SNPs outside the gene, correcting the z statistics for intragenic SNPs. We suggest a new TauCOR method for conditional gene-based analysis using summary statistics. This method accounts the influence of the full regional polygenic background, correcting the genotype correlations between intragenic SNPs. As a result, the distribution of z statistics for intragenic SNPs becomes conditionally independent of distribution for extragenic SNPs. TauCOR is compatible with any gene-based association test. TauCOR was tested on summary statistics simulated under different scenarios and on real summary statistics for a 'gold standard' gene list from the Open Targets Genetics project. TauCOR proved to be effective in all modelling scenarios and on real data. The TauCOR's strategy showed comparable sensitivity and higher specificity and accuracy than GCTA-COJO on both simulated and real data. The method can be successfully used to improve the effectiveness of gene-based association analyses.


Asunto(s)
Desequilibrio de Ligamiento , Herencia Multifactorial , Polimorfismo de Nucleótido Simple , Polimorfismo de Nucleótido Simple/genética , Herencia Multifactorial/genética , Humanos , Estudio de Asociación del Genoma Completo/métodos , Modelos Genéticos , Genotipo
2.
Plant Cell Rep ; 43(6): 156, 2024 May 31.
Artículo en Inglés | MEDLINE | ID: mdl-38819495

RESUMEN

KEY MESSAGE: In current study candidate gene (261 genes) based association mapping on 144 pigeonpea accessions for flowering time and related traits and 29 MTAs producing eight superior haplotypes were identified. In the current study, we have conducted an association analysis for flowering-associated traits in a diverse pigeonpea mini-core collection comprising 144 accessions using the SNP data of 261 flowering-related genes. In total, 13,449 SNPs were detected in the current study, which ranged from 743 (ICP10228) to 1469 (ICP6668) among the individuals. The nucleotide diversity (0.28) and Watterson estimates (0.34) reflected substantial diversity, while Tajima's D (-0.70) indicated the abundance of rare alleles in the collection. A total of 29 marker trait associations (MTAs) were identified, among which 19 were unique to days to first flowering (DOF) and/or days to fifty percent flowering (DFF), 9 to plant height (PH), and 1 to determinate (Det) growth habit using 3 years of phenotypic data. Among these MTAs, six were common to DOF and/or DFF, and four were common to DOF/DFF along with the PH, reflecting their pleiotropic action. These 29 MTAs spanned 25 genes, among which 10 genes clustered in the protein-protein network analysis, indicating their concerted involvement in floral induction. Furthermore, we identified eight haplotypes, four of which regulate late flowering, while the remaining four regulate early flowering using the MTAs. Interestingly, haplotypes conferring late flowering (H001, H002, and H008) were found to be taller, while those involved in early flowering (H003) were shorter in height. The expression pattern of these genes, as inferred from the transcriptome data, also underpinned their involvement in floral induction. The haplotypes identified will be highly useful to the pigeonpea breeding community for haplotype-based breeding.


Asunto(s)
Cajanus , Flores , Haplotipos , Polimorfismo de Nucleótido Simple , Flores/genética , Flores/fisiología , Flores/crecimiento & desarrollo , Haplotipos/genética , Cajanus/genética , Cajanus/crecimiento & desarrollo , Polimorfismo de Nucleótido Simple/genética , Genes de Plantas/genética , Fenotipo , Regulación de la Expresión Génica de las Plantas , Estudios de Asociación Genética , Sitios de Carácter Cuantitativo/genética
3.
Genet Epidemiol ; 48(3): 103-113, 2024 04.
Artículo en Inglés | MEDLINE | ID: mdl-38317324

RESUMEN

Genome-wide association studies (GWAS) have led to rapid growth in detecting genetic variants associated with various phenotypes. Owing to a great number of publicly accessible GWAS summary statistics, and the difficulty in obtaining individual-level genotype data, many existing gene-based association tests have been adapted to require only GWAS summary statistics rather than individual-level data. However, these association tests are restricted to unrelated individuals and thus do not apply to family samples directly. Moreover, due to its flexibility and effectiveness, the linear mixed model has been increasingly utilized in GWAS to handle correlated data, such as family samples. However, it remains unknown how to perform gene-based association tests in family samples using the GWAS summary statistics estimated from the linear mixed model. In this study, we show that, when family size is negligible compared to the total sample size, the diagonal block structure of the kinship matrix makes it possible to approximate the correlation matrix of marginal Z scores by linkage disequilibrium matrix. Based on this result, current methods utilizing summary statistics for unrelated individuals can be directly applied to family data without any modifications. Our simulation results demonstrate that this proposed strategy controls the type 1 error rate well in various situations. Finally, we exemplify the usefulness of the proposed approach with a dental caries GWAS data set.


Asunto(s)
Caries Dental , Estudio de Asociación del Genoma Completo , Humanos , Estudio de Asociación del Genoma Completo/métodos , Polimorfismo de Nucleótido Simple , Modelos Genéticos , Fenotipo
4.
Genes (Basel) ; 14(12)2023 12 12.
Artículo en Inglés | MEDLINE | ID: mdl-38137023

RESUMEN

The identification of superior haplotypes and haplotype combinations is essential for haplotype-based breeding (HBB), which provides selection targets for genomics-assisted breeding. In this study, genotypes of 42 functional genes in rice were analyzed by targeted capture sequencing in a panel of 180 Indica rice accessions. In total, 69 SNPs/Indels in seven genes were detected to be associated with grain length (GL), grain width (GW), ratio of grain length-width (L/W) and thousand-grain weight (TGW) using candidate gene-based association analysis, including BG1 and GS3 for GL, GW5 for GW, BG1 and GW5 for L/W, and AET1, SNAC1, qTGW3, DHD1 and GW5 for TGW. Furthermore, two haplotypes were identified for each of the seven genes according to these associated SNPs/Indels, and the amount of genetic variation explained by different haplotypes ranged from 3.24% to 27.66%. Additionally, three, three and eight haplotype combinations for GL, L/W and TGW explained 25.38%, 5.5% and 22.49% of the total genetic variation for each trait, respectively. Further analysis showed that Minghui63 had the superior haplotype combination Haplotype Combination 4 (HC4) for TGW. The most interesting finding was that some widely used restorer lines derived from Minghui63 also have the superior haplotype combination HC4, and our breeding varieties and lines using the haplotype-specific marker panel also confirmed that the TGW of the lines was much higher than that of their sister lines without HC4, suggesting that TGW-HC4 is the superior haplotype combination for TGW and can be utilized in rice breeding.


Asunto(s)
Oryza , Oryza/genética , Haplotipos , Alelos , Fitomejoramiento , Grano Comestible/genética
5.
Res Sq ; 2023 Sep 14.
Artículo en Inglés | MEDLINE | ID: mdl-37790478

RESUMEN

Background: Despite thousands of variants identified by genome-wide association studies (GWAS) to be associated with autism spectrum disorder (ASD), it is unclear which mutations are causal because most are noncoding. Consequently, reliable diagnostic biomarkers are lacking. RNA-seq analysis captures biomolecular complexity that GWAS cannot by considering transcriptomic patterns. Therefore, integrating DNA and RNA testing may reveal causal genes and useful biomarkers for ASD. Methods: We performed gene-based association studies using an adaptive test method with GWAS summary statistics from two large Psychiatric Genomics Consortium (PGC) datasets (ASD2019: 18,382 cases and 27,969 controls; ASD2017: 6,197 cases and 7,377 controls). We also investigated differential expression for genes identified with the adaptive test using an RNA-seq dataset (GSE30573: 3 cases and 3 controls) and DESeq2. Results: We identified 5 genes significantly associated with ASD in ASD2019 (KIZ-AS1, p = 8.67×10- 10; KIZ, p = 1.16×10- 9; XRN2, p = 7.73×10- 9; SOX7, p = 2.22×10- 7; LOC101929229 (also known as PINX1-DT), p = 2.14×10- 6). Two of the five genes were replicated in ASD2017: SOX7 (p = 0.00087) and LOC101929229 (p = 0.009), and KIZ was close to the replication boundary of replication (p = 0.06). We identified significant expression differences for SOX7 (p = 0.0017, adjusted p = 0.0085), LOC101929229 (p = 5.83×10- 7, adjusted p = 1.18×10- 5), and KIZ (p = 0.00099, adjusted p = 0.0055). SOX7 encodes a transcription factor that regulates developmental pathways, alterations in which may contribute to ASD. Limitations: The limitation of the gene-based analysis is the reliance on a reference population for estimating linkage disequilibrium between variants. The similarity of this reference population to the population of study is crucial to the accuracy of many gene-based analyses, including those performed in this study. As a result, the extent of our findings is limited to European populations, as this was our reference of choice. Future work includes a tighter integration of DNA and RNA information as well as extensions to non-European populations that have been under-researched. Conclusions: These findings suggest that SOX7 and its related SOX family genes encode transcription factors that are critical to the downregulation of the canonical Wnt/ß-catenin signaling pathway, an important developmental signaling pathway, providing credence to the biologic plausibility of the association between gene SOX7 and autism spectrum disorder.

6.
Genet Epidemiol ; 47(7): 475-495, 2023 10.
Artículo en Inglés | MEDLINE | ID: mdl-37341229

RESUMEN

The risk of congenital heart defects (CHDs) may be influenced by maternal genes, fetal genes, and their interactions. Existing methods commonly test the effects of maternal and fetal variants one-at-a-time and may have reduced statistical power to detect genetic variants with low minor allele frequencies. In this article, we propose a gene-based association test of interactions for maternal-fetal genotypes (GATI-MFG) using a case-mother and control-mother design. GATI-MFG can integrate the effects of multiple variants within a gene or genomic region and evaluate the joint effect of maternal and fetal genotypes while allowing for their interactions. In simulation studies, GATI-MFG had improved statistical power over alternative methods, such as the single-variant test and functional data analysis (FDA) under various disease scenarios. We further applied GATI-MFG to a two-phase genome-wide association study of CHDs for the testing of both common variants and rare variants using 947 CHD case mother-infant pairs and 1306 control mother-infant pairs from the National Birth Defects Prevention Study (NBDPS). After Bonferroni adjustment for 23,035 genes, two genes on chromosome 17, TMEM107 (p = 1.64e-06) and CTC1 (p = 2.0e-06), were identified for significant association with CHD in common variants analysis. Gene TMEM107 regulates ciliogenesis and ciliary protein composition and was found to be associated with heterotaxy. Gene CTC1 plays an essential role in protecting telomeres from degradation, which was suggested to be associated with cardiogenesis. Overall, GATI-MFG outperformed the single-variant test and FDA in the simulations, and the results of application to NBDPS samples are consistent with existing literature supporting the association of TMEM107 and CTC1 with CHDs.


Asunto(s)
Estudio de Asociación del Genoma Completo , Cardiopatías Congénitas , Femenino , Humanos , Modelos Genéticos , Genotipo , Cardiopatías Congénitas/genética , Madres , Estudios de Casos y Controles
7.
Brief Bioinform ; 24(4)2023 07 20.
Artículo en Inglés | MEDLINE | ID: mdl-37332016

RESUMEN

Trans-ethnic genome-wide association studies have revealed that many loci identified in European populations can be reproducible in non-European populations, indicating widespread trans-ethnic genetic similarity. However, how to leverage such shared information more efficiently in association analysis is less investigated for traits in underrepresented populations. We here propose a statistical framework, trans-ethnic genetic risk score informed gene-based association mixed model (GAMM), by hierarchically modeling single-nucleotide polymorphism effects in the target population as a function of effects of the same trait in well-studied populations. GAMM powerfully integrates genetic similarity across distinct ancestral groups to enhance power in understudied populations, as confirmed by extensive simulations. We illustrate the usefulness of GAMM via the application to 13 blood cell traits (i.e. basophil count, eosinophil count, hematocrit, hemoglobin concentration, lymphocyte count, mean corpuscular hemoglobin, mean corpuscular hemoglobin concentration, mean corpuscular volume, monocyte count, neutrophil count, platelet count, red blood cell count and total white blood cell count) in Africans of the UK Biobank (n = 3204) while utilizing genetic overlap shared in Europeans (n = 746 667) and East Asians (n = 162 255). We discovered multiple new associated genes, which had otherwise been missed by existing methods, and revealed that the trans-ethnic information indirectly contributed much to the phenotypic variance. Overall, GAMM represents a flexible and powerful statistical framework of association analysis for complex traits in underrepresented populations by integrating trans-ethnic genetic similarity across well-studied populations, and helps attenuate health inequities in current genetics research for people of minority populations.


Asunto(s)
Estudio de Asociación del Genoma Completo , Modelos Genéticos , Herencia Multifactorial , Humanos , Estudio de Asociación del Genoma Completo/métodos , Hemoglobinas/genética , Herencia Multifactorial/genética , Polimorfismo de Nucleótido Simple/genética , Factores de Riesgo , Predisposición Genética a la Enfermedad/etnología , Predisposición Genética a la Enfermedad/genética , Células Sanguíneas , Reino Unido , Pueblo Africano/genética , Pueblos del Este de Asia/genética , Pueblo Europeo/genética
8.
Am J Hum Genet ; 110(1): 23-29, 2023 01 05.
Artículo en Inglés | MEDLINE | ID: mdl-36480927

RESUMEN

We present LDAK-GBAT, a tool for gene-based association testing using summary statistics from genome-wide association studies that is computationally efficient, produces well-calibrated p values, and is significantly more powerful than existing tools. LDAK-GBAT takes approximately 30 min to analyze imputed data (2.9M common, genic SNPs), requiring less than 10 Gb memory. It shows good control of type 1 error given an appropriate reference panel. Across 109 phenotypes (82 from the UK Biobank, 18 from the Million Veteran Program, and nine from the Psychiatric Genetics Consortium), LDAK-GBAT finds on average 19% (SE: 1%) more significant genes than the existing tool sumFREGAT-ACAT, with even greater gains in comparison with MAGMA, GCTA-fastBAT, sumFREGAT-SKAT-O, and sumFREGAT-PCA.


Asunto(s)
Pruebas Genéticas , Estudio de Asociación del Genoma Completo , Fenotipo , Polimorfismo de Nucleótido Simple/genética
9.
Int J Mol Sci ; 23(21)2022 Nov 04.
Artículo en Inglés | MEDLINE | ID: mdl-36362342

RESUMEN

Genome-wide association study (GWAS) of Juvenile idiopathic arthritis (JIA) suffers from low power due to limited sample size and the interpretation challenge due to most signals located in non-coding regions. Gene-level analysis could alleviate these issues. Using GWAS summary statistics, we performed two typical gene-level analysis of JIA, transcriptome-wide association studies (TWAS) using FUnctional Summary-based ImputatiON (FUSION) and gene-based analysis using eQTL Multi-marker Analysis of GenoMic Annotation (eMAGMA), followed by comprehensive enrichment analysis. Among 33 overlapped significant genes from these two methods, 11 were previously reported, including TYK2 (PFUSION = 5.12 × 10-6, PeMAGMA = 1.94 × 10-7 for whole blood), IL-6R (PFUSION = 8.63 × 10-7, PeMAGMA = 2.74 × 10-6 for cells EBV-transformed lymphocytes), and Fas (PFUSION = 5.21 × 10-5, PeMAGMA = 1.08 × 10-6 for muscle skeletal). Some newly plausible JIA-associated genes are also reported, including IL-27 (PFUSION = 2.10 × 10-7, PeMAGMA = 3.93 × 10-8 for Liver), LAT (PFUSION = 1.53 × 10-4, PeMAGMA = 4.62 × 10-7 for Artery Aorta), and MAGI3 (PFUSION = 1.30 × 10-5, PeMAGMA = 1.73 × 10-7 for Muscle Skeletal). Enrichment analysis further highlighted 4 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways and 10 Gene Ontology (GO) terms. Our findings can benefit the understanding of genetic determinants and potential therapeutic targets for JIA.


Asunto(s)
Artritis Juvenil , Transcriptoma , Humanos , Estudio de Asociación del Genoma Completo/métodos , Artritis Juvenil/genética , ARN Mensajero/genética , Ontología de Genes , Predisposición Genética a la Enfermedad , Polimorfismo de Nucleótido Simple
10.
Front Plant Sci ; 13: 1015151, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36226300

RESUMEN

Lead (Pb) is a highly toxic contaminant to living organisms and the environment. Excessive Pb in soils affects crop yield and quality, thus threatening human health via the food chain. Herein, we investigated Pb tolerance among a maize association panel using root bushiness (BSH) under Pb treatment as an indicator. Through a genome-wide association study of relative BSH, we identified four single nucleotide polymorphisms (SNPs) and 30 candidate genes associated with Pb tolerance in maize seedlings. Transcriptome analysis showed that four of the 30 genes were differentially responsive to Pb treatment between two maize lines with contrasting Pb tolerance. Among these, the ZmbZIP107 transcription factor was confirmed as the key gene controlling maize tolerance to Pb by using gene-based association studies. Two 5' UTR_variants in ZmbZIP107 affected its expression level and Pb tolerance among different maize lines. ZmbZIP107 protein was specifically targeted to the nucleus and ZmbZIP107 mRNA showed the highest expression in maize seedling roots among different tissues. Heterologous expression of ZmbZIP107 enhanced rice tolerance to Pb stress and decreased Pb absorption in the roots. Our study provided the basis for revelation of the molecular mechanism underlying Pb tolerance and contributed to cultivation of Pb-tolerant varieties in maize.

11.
Plants (Basel) ; 11(11)2022 Jun 02.
Artículo en Inglés | MEDLINE | ID: mdl-35684271

RESUMEN

Malate dehydrogenase (MDH) is widely present in nature and regulates plant growth and development, as well as playing essential roles, especially in abiotic stress responses. Nevertheless, there is no comprehensive knowledge to date on MDH family members in rice. In this study, a total of 12 MDH members in rice were identified through genome-wide analysis and divided into three groups on the basis of their phylogenetic relationship and protein-conserved motifs. Evolutionary analysis showed that MDH proteins from rice, maize and wheat shared a close phylogenetic relationship, and the MDH family was conserved in the long-term process of domestication. We identified two segmental duplication events involving four genes, which could be the major force driving the expansion of the OsMDH family. The expression profile, cis-regulatory elements and qRT-PCR results of these genes revealed that a few OsMDH showed high tissue specificity, almost all of which had stress response elements in the promoter region, and ten MDH members were significantly induced by salt stress. Through gene-based association analysis, we found a significant correlation between salt tolerance at the seedling stage and the genetic variation of OsMDH8.1 and OsMDH12.1. Additionally, we found that the polymorphism in the promoter region of OsMDH8.1 might be related to the salt tolerance of rice. This study aimed to provide valuable information on the functional study of the rice MDH gene family related to salt stress response and revealed that OsMDH8.1 might be an important gene for the cultivar improvement of salt tolerance in rice.

12.
J Transl Med ; 20(1): 227, 2022 05 14.
Artículo en Inglés | MEDLINE | ID: mdl-35568861

RESUMEN

BACKGROUND: Previous studies demonstrated a positive relationship between birthweight and breast cancer; however, inconsistent, sometimes even controversial, observations also emerged, and the nature of such relationship remains unknown. METHODS: Using summary statistics of birthweight and breast cancer, we assessed the fetal/maternal-specific genetic correlation between them via LDSC and prioritized fetal/maternal-specific pleiotropic genes through MAIUP. Relying on summary statistics we conducted Mendelian randomization (MR) to evaluate the fetal/maternal-specific origin of causal relationship between birthweight, age of menarche, age at menopause and breast cancer. RESULTS: With summary statistics we identified a positive genetic correlation between fetal-specific birthweight and breast cancer (rg = 0.123 and P = 0.013) as well as a negative but insignificant correlation between maternal-specific birthweight and breast cancer (rg = - 0.068, P = 0.206); and detected 84 pleiotropic genes shared by fetal-specific birthweight and breast cancer, 49 shared by maternal-specific birthweight and breast cancer. We also revealed fetal-specific birthweight indirectly influenced breast cancer risk in adulthood via the path of age of menarche or age at menopause in terms of MR-based mediation analysis. CONCLUSION: This study reveals that shared genetic foundation and causal mediation commonly drive the connection between the two traits, and that fetal/maternal-specific birthweight plays substantially distinct roles in such relationship. However, our work offers little supportive evidence for the fetal origins hypothesis of breast cancer originating in utero.


Asunto(s)
Neoplasias de la Mama , Estudio de Asociación del Genoma Completo , Adulto , Peso al Nacer/genética , Neoplasias de la Mama/genética , Causalidad , Femenino , Humanos , Análisis de la Aleatorización Mendeliana , Polimorfismo de Nucleótido Simple
13.
Elife ; 112022 04 12.
Artículo en Inglés | MEDLINE | ID: mdl-35412455

RESUMEN

Linkage disequilibrium and disease-associated variants in the non-coding regions make it difficult to distinguish the truly associated genes from the redundantly associated genes for complex diseases. In this study, we proposed a new conditional gene-based framework called eDESE that leveraged an improved effective chi-squared statistic to control the type I error rates and remove the redundant associations. eDESE initially performed the association analysis by mapping variants to genes according to their physical distance. We further demonstrated that the isoform-level eQTLs could be more powerful than the gene-level eQTLs in the association analysis using a simulation study. Then the eQTL-guided strategies, that is, mapping variants to genes according to their gene/isoform-level variant-gene cis-eQTLs associations, were also integrated with eDESE. We then applied eDESE to predict the potential susceptibility genes of schizophrenia and found that the potential susceptibility genes were enriched with many neuronal or synaptic signaling-related terms in the Gene Ontology knowledgebase and antipsychotics-gene interaction terms in the drug-gene interaction database (DGIdb). More importantly, seven potential susceptibility genes identified by eDESE were the target genes of multiple antipsychotics in DrugBank. Comparing the potential susceptibility genes identified by eDESE and other benchmark approaches (i.e., MAGMA and S-PrediXcan) implied that strategy based on the isoform-level eQTLs could be an important supplement for the other two strategies (physical distance and gene-level eQTLs). We have implemented eDESE in our integrative platform KGGSEE (http://pmglab.top/kggsee/#/) and hope that eDESE can facilitate the prediction of candidate susceptibility genes and isoforms for complex diseases in a multi-tissue context.


Asunto(s)
Predisposición Genética a la Enfermedad , Sitios de Carácter Cuantitativo , Esquizofrenia , Antipsicóticos/uso terapéutico , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo , Humanos , Polimorfismo de Nucleótido Simple , Isoformas de Proteínas/genética , Sitios de Carácter Cuantitativo/genética , Esquizofrenia/genética
14.
Int J Mol Sci ; 23(3)2022 Jan 19.
Artículo en Inglés | MEDLINE | ID: mdl-35162989

RESUMEN

Grain shape is an important agronomic character of rice, which affects the appearance, processing, and the edible quality. Screening and identifying more new genes associated with grain shape is beneficial to further understanding the genetic basis of grain shape and provides more gene resources for genetic breeding. This study has a natural population containing 623 indica rice cultivars. Genome-wide association studies/GWAS of several traits related to grain shape (grain length/GL, grain width/GW, grain length to width ratio/GLWR, grain circumferences/GC, and grain size/grain area/GS) were conducted by combining phenotypic data from four environments and the second-generation resequencing data, which have identified 39 important Quantitative trait locus/QTLs. We analyzed the 39 QTLs using three methods: gene-based association analysis, haplotype analysis, and functional annotation and identified three cloned genes (GS3, GW5, OsDER1) and seven new candidate genes in the candidate interval. At the same time, to effectively utilize the genes in the grain shape-related gene bank, we have also analyzed the allelic combinations of the three cloned genes. Finally, the extreme allele combination corresponding to each trait was found through statistical analysis. This study's novel candidate genes and allele combinations will provide a valuable reference for future breeding work.


Asunto(s)
Oryza , Alelos , Grano Comestible/genética , Estudio de Asociación del Genoma Completo , Oryza/genética , Fitomejoramiento , Sitios de Carácter Cuantitativo
15.
Brief Bioinform ; 23(1)2022 01 17.
Artículo en Inglés | MEDLINE | ID: mdl-34571531

RESUMEN

Pleiotropy has important implication on genetic connection among complex phenotypes and facilitates our understanding of disease etiology. Genome-wide association studies provide an unprecedented opportunity to detect pleiotropic associations; however, efficient pleiotropy test methods are still lacking. We here consider pleiotropy identification from a methodological perspective of high-dimensional composite null hypothesis and propose a powerful gene-based method called MAIUP. MAIUP is constructed based on the traditional intersection-union test with two sets of independent P-values as input and follows a novel idea that was originally proposed under the high-dimensional mediation analysis framework. The key improvement of MAIUP is that it takes the composite null nature of pleiotropy test into account by fitting a three-component mixture null distribution, which can ultimately generate well-calibrated P-values for effective control of family-wise error rate and false discover rate. Another attractive advantage of MAIUP is its ability to effectively address the issue of overlapping subjects commonly encountered in association studies. Simulation studies demonstrate that compared with other methods, only MAIUP can maintain correct type I error control and has higher power across a wide range of scenarios. We apply MAIUP to detect shared associated genes among 14 psychiatric disorders with summary statistics and discover many new pleiotropic genes that are otherwise not identified if failing to account for the issue of composite null hypothesis testing. Functional and enrichment analyses offer additional evidence supporting the validity of these identified pleiotropic genes associated with psychiatric disorders. Overall, MAIUP represents an efficient method for pleiotropy identification.


Asunto(s)
Estudio de Asociación del Genoma Completo , Trastornos Mentales , Simulación por Computador , Pleiotropía Genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo/métodos , Humanos , Trastornos Mentales/genética , Fenotipo , Polimorfismo de Nucleótido Simple
16.
Clin Genet ; 101(1): 78-86, 2022 01.
Artículo en Inglés | MEDLINE | ID: mdl-34687547

RESUMEN

Genome-wide association study (GWAS) have identified more than 300 single nucleotide polymorphisms at 163 independent loci associated with coronary artery disease (CAD). However, there is no full understanding about the causal genes for CAD and the mechanisms of their action. We aimed to perform a post GWAS analysis to identify genes whose polymorphism may influence the risk of CAD. Using the UK Biobank GWAS summary statistics, we performed a gene-based association analysis. We found 63 genes significantly associated with CAD due to their within-gene polymorphisms. Many of these genes are well known. Some known CAD genes such as FURIN and SORT1 did not show the gene-based association because their variants had low GWAS signals or gene-based association was inflated by the strong GWAS signal outside the gene. For several known CAD genes, we demonstrated that their effects could be explained not only or not at all by their own variants but by the variants within the neighboring genes controlling their expression. Using several bioinformatics techniques, we suggested potential mechanisms underlying gene-CAD associations. Three genes, CDK19, NCALD, and ARHGEF12 were not previously associated with CAD. The role of these genes should be clarified in further studies.


Asunto(s)
Alelos , Biología Computacional/métodos , Enfermedad de la Arteria Coronaria/genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo/métodos , Bancos de Muestras Biológicas , Enfermedad de la Arteria Coronaria/diagnóstico , Bases de Datos Genéticas , Estudios de Asociación Genética , Genómica/métodos , Humanos , Fenotipo , Polimorfismo de Nucleótido Simple , Reino Unido
17.
Am J Hum Genet ; 109(1): 81-96, 2022 01 06.
Artículo en Inglés | MEDLINE | ID: mdl-34932938

RESUMEN

Large-scale gene sequencing studies for complex traits have the potential to identify causal genes with therapeutic implications. We performed gene-based association testing of blood lipid levels with rare (minor allele frequency < 1%) predicted damaging coding variation by using sequence data from >170,000 individuals from multiple ancestries: 97,493 European, 30,025 South Asian, 16,507 African, 16,440 Hispanic/Latino, 10,420 East Asian, and 1,182 Samoan. We identified 35 genes associated with circulating lipid levels; some of these genes have not been previously associated with lipid levels when using rare coding variation from population-based samples. We prioritize 32 genes in array-based genome-wide association study (GWAS) loci based on aggregations of rare coding variants; three (EVI5, SH2B3, and PLIN1) had no prior association of rare coding variants with lipid levels. Most of our associated genes showed evidence of association among multiple ancestries. Finally, we observed an enrichment of gene-based associations for low-density lipoprotein cholesterol drug target genes and for genes closest to GWAS index single-nucleotide polymorphisms (SNPs). Our results demonstrate that gene-based associations can be beneficial for drug target development and provide evidence that the gene closest to the array-based GWAS index SNP is often the functional gene for blood lipid levels.


Asunto(s)
Exoma , Variación Genética , Estudio de Asociación del Genoma Completo , Lípidos/sangre , Sistemas de Lectura Abierta , Alelos , Glucemia/genética , Estudios de Casos y Controles , Biología Computacional/métodos , Bases de Datos Genéticas , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/metabolismo , Predisposición Genética a la Enfermedad , Genética de Población , Estudio de Asociación del Genoma Completo/métodos , Humanos , Metabolismo de los Lípidos/genética , Hígado/metabolismo , Hígado/patología , Anotación de Secuencia Molecular , Herencia Multifactorial , Fenotipo , Polimorfismo de Nucleótido Simple
18.
BMC Med ; 19(1): 314, 2021 12 13.
Artículo en Inglés | MEDLINE | ID: mdl-34895209

RESUMEN

BACKGROUND: Recent genome-wide association studies (GWASs) have revealed the polygenic nature of psychiatric disorders and discovered a few of single-nucleotide polymorphisms (SNPs) associated with multiple psychiatric disorders. However, the extent and pattern of pleiotropy among distinct psychiatric disorders remain not completely clear. METHODS: We analyzed 14 psychiatric disorders using summary statistics available from the largest GWASs by far. We first applied the cross-trait linkage disequilibrium score regression (LDSC) to estimate genetic correlation between disorders. Then, we performed a gene-based pleiotropy analysis by first aggregating a set of SNP-level associations into a single gene-level association signal using MAGMA. From a methodological perspective, we viewed the identification of pleiotropic associations across the entire genome as a high-dimensional problem of composite null hypothesis testing and utilized a novel method called PLACO for pleiotropy mapping. We ultimately implemented functional analysis for identified pleiotropic genes and used Mendelian randomization for detecting causal association between these disorders. RESULTS: We confirmed extensive genetic correlation among psychiatric disorders, based on which these disorders can be grouped into three diverse categories. We detected a large number of pleiotropic genes including 5884 associations and 2424 unique genes and found that differentially expressed pleiotropic genes were significantly enriched in pancreas, liver, heart, and brain, and that the biological process of these genes was remarkably enriched in regulating neurodevelopment, neurogenesis, and neuron differentiation, offering substantial evidence supporting the validity of identified pleiotropic loci. We further demonstrated that among all the identified pleiotropic genes there were 342 unique ones linked with 6353 drugs with drug-gene interaction which can be classified into distinct types including inhibitor, agonist, blocker, antagonist, and modulator. We also revealed causal associations among psychiatric disorders, indicating that genetic overlap and causality commonly drove the observed co-existence of these disorders. CONCLUSIONS: Our study is among the first large-scale effort to characterize gene-level pleiotropy among a greatly expanded set of psychiatric disorders and provides important insight into shared genetic etiology underlying these disorders. The findings would inform psychiatric nosology, identify potential neurobiological mechanisms predisposing to specific clinical presentations, and pave the way to effective drug targets for clinical treatment.


Asunto(s)
Estudio de Asociación del Genoma Completo , Trastornos Mentales , Pleiotropía Genética , Predisposición Genética a la Enfermedad , Humanos , Trastornos Mentales/genética , Fenotipo , Polimorfismo de Nucleótido Simple
19.
Proc Natl Acad Sci U S A ; 118(47)2021 11 23.
Artículo en Inglés | MEDLINE | ID: mdl-34799441

RESUMEN

Gene-based tests are valuable techniques for identifying genetic factors in complex traits. Here, we propose a gene-based testing framework that incorporates data on long-range chromatin interactions, several recent technical advances for region-based tests, and leverages the knockoff framework for synthetic genotype generation for improved gene discovery. Through simulations and applications to genome-wide association studies (GWAS) and whole-genome sequencing data for multiple diseases and traits, we show that the proposed test increases the power over state-of-the-art gene-based tests in the literature, identifies genes that replicate in larger studies, and can provide a more narrow focus on the possible causal genes at a locus by reducing the confounding effect of linkage disequilibrium. Furthermore, our results show that incorporating genetic variation in distal regulatory elements tends to improve power over conventional tests. Results for UK Biobank and BioBank Japan traits are also available in a publicly accessible database that allows researchers to query gene-based results in an easy fashion.


Asunto(s)
Cromatina , Pruebas Genéticas/métodos , Genotipo , Estudio de Asociación del Genoma Completo/métodos , Humanos , Japón , Desequilibrio de Ligamiento , Pulmón , Modelos Genéticos , Fenotipo , Sitios de Carácter Cuantitativo , Secuenciación Completa del Genoma/métodos
20.
Food Energy Secur ; 10(3): e292, 2021 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-34594548

RESUMEN

Trehalose 6-phosphate (T6P) signalling regulates carbon use and allocation and is a target to improve crop yields. However, the specific contributions of trehalose phosphate synthase (TPS) and trehalose phosphate phosphatase (TPP) genes to source- and sink-related traits remain largely unknown. We used enrichment capture sequencing on TPS and TPP genes to estimate and partition the genetic variation of yield-related traits in a spring wheat (Triticum aestivum) breeding panel specifically built to capture the diversity across the 75,000 CIMMYT wheat cultivar collection. Twelve phenotypes were correlated to variation in TPS and TPP genes including plant height and biomass (source), spikelets per spike, spike growth and grain filling traits (sink) which showed indications of both positive and negative gene selection. Individual genes explained proportions of heritability for biomass and grain-related traits. Three TPS1 homologues were particularly significant for trait variation. Epistatic interactions were found within and between the TPS and TPP gene families for both plant height and grain-related traits. Gene-based prediction improved predictive ability for grain weight when gene effects were combined with the whole-genome markers. Our study has generated a wealth of information on natural variation of TPS and TPP genes related to yield potential which confirms the role for T6P in resource allocation and in affecting traits such as grain number and size confirming other studies which now opens up the possibility of harnessing natural genetic variation more widely to better understand the contribution of native genes to yield traits for incorporation into breeding programmes.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA