Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
1.
Cell ; 186(10): 2044-2061, 2023 05 11.
Article in English | MEDLINE | ID: mdl-37172561

ABSTRACT

Phenotypic sex-based differences exist for many complex traits. In other cases, phenotypes may be similar, but underlying biology may vary. Thus, sex-aware genetic analyses are becoming increasingly important for understanding the mechanisms driving these differences. To this end, we provide a guide outlining the current best practices for testing various models of sex-dependent genetic effects in complex traits and disease conditions, noting that this is an evolving field. Insights from sex-aware analyses will not only teach us about the biology of complex traits but also aid in achieving the goals of precision medicine and health equity for all.


Subject(s)
Models, Genetic , Sex Characteristics , Animals , Female , Male , Multifactorial Inheritance , Phenotype , Quality Control , Genome-Wide Association Study , Guidelines as Topic , Gene-Environment Interaction , Humans
2.
Am J Respir Crit Care Med ; 206(10): 1271-1280, 2022 11 15.
Article in English | MEDLINE | ID: mdl-35822943

ABSTRACT

Rationale: Obstructive sleep apnea (OSA) is a common disorder associated with increased risk for cardiovascular disease, diabetes, and premature mortality. There is strong clinical and epidemiologic evidence supporting the importance of genetic factors influencing OSA but limited data implicating specific genes. Objectives: To search for rare variants contributing to OSA severity. Methods: Leveraging high-depth genomic sequencing data from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program and imputed genotype data from multiple population-based studies, we performed linkage analysis in the CFS (Cleveland Family Study), followed by multistage gene-based association analyses in independent cohorts for apnea-hypopnea index (AHI) in a total of 7,708 individuals of European ancestry. Measurements and Main Results: Linkage analysis in the CFS identified a suggestive linkage peak on chromosome 7q31 (LOD = 2.31). Gene-based analysis identified 21 noncoding rare variants in CAV1 (Caveolin-1) associated with lower AHI after accounting for multiple comparisons (P = 7.4 × 10-8). These noncoding variants together significantly contributed to the linkage evidence (P < 10-3). Follow-up analysis revealed significant associations between these variants and increased CAV1 expression, and increased CAV1 expression in peripheral monocytes was associated with lower AHI (P = 0.024) and higher minimum overnight oxygen saturation (P = 0.007). Conclusions: Rare variants in CAV1, a membrane-scaffolding protein essential in multiple cellular and metabolic functions, are associated with higher CAV1 gene expression and lower OSA severity, suggesting a novel target for modulating OSA severity.


Subject(s)
Sleep Apnea, Obstructive , Humans , Caveolin 1/genetics , Sleep Apnea, Obstructive/genetics , Sequence Analysis, DNA , High-Throughput Nucleotide Sequencing
3.
BMC Genomics ; 23(1): 148, 2022 Feb 19.
Article in English | MEDLINE | ID: mdl-35183128

ABSTRACT

BACKGROUND: While large genome-wide association studies have identified nearly one thousand loci associated with variation in blood pressure, rare variant identification is still a challenge. In family-based cohorts, genome-wide linkage scans have been successful in identifying rare genetic variants for blood pressure. This study aims to identify low frequency and rare genetic variants within previously reported linkage regions on chromosomes 1 and 19 in African American families from the Trans-Omics for Precision Medicine (TOPMed) program. Genetic association analyses weighted by linkage evidence were completed with whole genome sequencing data within and across TOPMed ancestral groups consisting of 60,388 individuals of European, African, East Asian, Hispanic, and Samoan ancestries. RESULTS: Associations of low frequency and rare variants in RCN3 and multiple other genes were observed for blood pressure traits in TOPMed samples. The association of low frequency and rare coding variants in RCN3 was further replicated in UK Biobank samples (N = 403,522), and reached genome-wide significance for diastolic blood pressure (p = 2.01 × 10- 7). CONCLUSIONS: Low frequency and rare variants in RCN3 contributes blood pressure variation. This study demonstrates that focusing association analyses in linkage regions greatly reduces multiple-testing burden and improves power to identify novel rare variants associated with blood pressure traits.


Subject(s)
Genome-Wide Association Study , Precision Medicine , Blood Pressure/genetics , Genetic Linkage , Genetic Predisposition to Disease , Humans , Polymorphism, Single Nucleotide , Whole Genome Sequencing
4.
Am J Hum Genet ; 105(5): 1057-1068, 2019 11 07.
Article in English | MEDLINE | ID: mdl-31668705

ABSTRACT

Average arterial oxyhemoglobin saturation during sleep (AvSpO2S) is a clinically relevant measure of physiological stress associated with sleep-disordered breathing, and this measure predicts incident cardiovascular disease and mortality. Using high-depth whole-genome sequencing data from the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) project and focusing on genes with linkage evidence on chromosome 8p23,1,2 we observed that six coding and 51 noncoding variants in a gene that encodes the GTPase-activating protein (DLC1) are significantly associated with AvSpO2S and replicated in independent subjects. The combined DLC1 association evidence of discovery and replication cohorts reaches genome-wide significance in European Americans (p = 7.9 × 10-7). A risk score for these variants, built on an independent dataset, explains 0.97% of the AvSpO2S variation and contributes to the linkage evidence. The 51 noncoding variants are enriched in regulatory features in a human lung fibroblast cell line and contribute to DLC1 expression variation. Mendelian randomization analysis using these variants indicates a significant causal effect of DLC1 expression in fibroblasts on AvSpO2S. Multiple sources of information, including genetic variants, gene expression, and methylation, consistently suggest that DLC1 is a gene associated with AvSpO2S.


Subject(s)
Chromosomes, Human, Pair 8/genetics , GTPase-Activating Proteins/genetics , Oxyhemoglobins/genetics , Sleep/genetics , Tumor Suppressor Proteins/genetics , Genetic Linkage/genetics , Genome-Wide Association Study , Humans , Whole Genome Sequencing/methods
5.
BMC Genomics ; 21(1): 476, 2020 Jul 11.
Article in English | MEDLINE | ID: mdl-32652930

ABSTRACT

BACKGROUND: Fitness epistasis, the interaction effect of genes at different loci on fitness, makes an important contribution to adaptive evolution. Although fitness interaction evidence has been observed in model organisms, it is more difficult to detect and remains poorly understood in human populations as a result of limited statistical power and experimental constraints. Fitness epistasis is inferred from non-independence between unlinked loci. We previously observed ancestral block correlation between chromosomes 4 and 6 in African Americans. The same approach fails when examining ancestral blocks on the same chromosome due to the strong confounding effect observed in a recently admixed population. RESULTS: We developed a novel approach to eliminate the bias caused by admixture linkage disequilibrium when searching for fitness epistasis on the same chromosome. We applied this approach in 16,252 unrelated African Americans and identified significant ancestral correlations in two pairs of genomic regions (P-value< 8.11 × 10- 7) on chromosomes 1 and 10. The ancestral correlations were not explained by population admixture. Historical African-European crossover events are reduced between pairs of epistatic regions. We observed multiple pairs of co-expressed genes shared by the two regions on each chromosome, including ADAR being co-expressed with IFI44 in almost all tissues and DARC being co-expressed with VCAM1, S1PR1 and ELTD1 in multiple tissues in the Genotype-Tissue Expression (GTEx) data. Moreover, the co-expressed gene pairs are associated with the same diseases/traits in the GWAS Catalog, such as white blood cell count, blood pressure, lung function, inflammatory bowel disease and educational attainment. CONCLUSIONS: Our analyses revealed two instances of fitness epistasis on chromosomes 1 and 10, and the findings suggest a potential approach to improving our understanding of adaptive evolution.


Subject(s)
Epistasis, Genetic , Genetic Fitness , Genome-Wide Association Study/methods , Black or African American/genetics , Chromosomes, Human, Pair 1/genetics , Chromosomes, Human, Pair 10/genetics , Computer Simulation , Humans , Linkage Disequilibrium , Polymorphism, Single Nucleotide , Receptors, G-Protein-Coupled/genetics
6.
PLoS Genet ; 13(3): e1006678, 2017 03.
Article in English | MEDLINE | ID: mdl-28346479

ABSTRACT

Many large genome-wide association studies (GWAS) have identified common blood pressure (BP) variants. However, most of the identified BP variants do not overlap with the linkage evidence observed from family studies. We thus hypothesize that multiple rare variants contribute to the observed linkage evidence. We performed linkage analysis using 517 individuals in 130 European families from the Cleveland Family Study (CFS) who have been genotyped on the Illumina OmniExpress Exome array. The largest linkage peak was observed on chromosome 16p13 (MLOD = 2.81) for systolic blood pressure (SBP). Follow-up conditional linkage and association analyses in the linkage region identified multiple rare, coding variants in RBFOX1 associated with reduced SBP. In a 17-member CFS family, carriers of the missense variant rs149974858 are normotensive despite being obese (average BMI = 60 kg/m2). Gene-based association test of rare variants using SKAT-O showed significant association with SBP (p-value = 0.00403) and DBP (p-value = 0.0258) in the CFS participants and the association was replicated in large independent replication studies (N = 57,234, p-value = 0.013 for SBP, 0.0023 for PP). RBFOX1 is expressed in brain tissues, the atrial appendage and left ventricle in the heart, and in skeletal muscle tissues, organs/tissues which are potentially related to blood pressure. Our study showed that associations of rare variants could be efficiently detected using family information.


Subject(s)
Blood Pressure/genetics , Genetic Predisposition to Disease/genetics , Polymorphism, Single Nucleotide , RNA Splicing Factors/genetics , Adult , Body Mass Index , Chromosomes, Human, Pair 16/genetics , Family Health , Female , Gene Expression , Gene Frequency , Genetic Linkage , Genetic Predisposition to Disease/ethnology , Genome-Wide Association Study/methods , Genotype , Humans , Male , Middle Aged , Pedigree , White People/genetics
7.
Hum Genet ; 138(2): 199-210, 2019 Feb.
Article in English | MEDLINE | ID: mdl-30671673

ABSTRACT

In this study, we investigated low-frequency and rare variants associated with blood pressure (BP) by focusing on a linkage region on chromosome 16p13. We used whole genome sequencing (WGS) data obtained through the NHLBI Trans-Omics for Precision Medicine (TOPMed) program on 395 Cleveland Family Study (CFS) European Americans (CFS-EA). By analyzing functional coding variants and non-coding rare variants with CADD score > 10 residing within the chromosomal region in families with linkage evidence, we observed 25 genes with nominal statistical evidence (burden or SKAT p < 0.05). One of the genes is RBFOX1, an evolutionarily conserved RNA-binding protein that regulates tissue-specific alternative splicing that we previously reported to be associated with BP using exome array data in CFS. After follow-up analysis of the 25 genes in ten independent TOPMed studies with individuals of European, African, and East Asian ancestry, and Hispanics (N = 29,988), we identified variants in SLX4 (p = 2.19 × 10-4) to be significantly associated with BP traits when accounting for multiple testing. We also replicated the associations previously reported for RBFOX1 (p = 0.007). Follow-up analysis with GTEx eQTL data shows SLX4 variants are associated with gene expression in coronary artery, multiple brain tissues, and right atrial appendage of the heart. Our study demonstrates that linkage analysis of family data can provide an efficient approach for detecting rare variants associated with complex traits in WGS data.


Subject(s)
Blood Pressure/genetics , Chromosomes, Human, Pair 16/genetics , Exome , Genetic Linkage , Genetic Variation , Genome, Human , High-Throughput Nucleotide Sequencing , Alternative Splicing/genetics , Female , Follow-Up Studies , Genome-Wide Association Study , Humans , Male , RNA Splicing Factors/genetics , Recombinases/genetics
8.
Int J Mol Sci ; 18(2)2017 Feb 15.
Article in English | MEDLINE | ID: mdl-28212287

ABSTRACT

Genomic medicine attempts to build individualized strategies for diagnostic or therapeutic decision-making by utilizing patients' genomic information. Big Data analytics uncovers hidden patterns, unknown correlations, and other insights through examining large-scale various data sets. While integration and manipulation of diverse genomic data and comprehensive electronic health records (EHRs) on a Big Data infrastructure exhibit challenges, they also provide a feasible opportunity to develop an efficient and effective approach to identify clinically actionable genetic variants for individualized diagnosis and therapy. In this paper, we review the challenges of manipulating large-scale next-generation sequencing (NGS) data and diverse clinical data derived from the EHRs for genomic medicine. We introduce possible solutions for different challenges in manipulating, managing, and analyzing genomic and clinical data to implement genomic medicine. Additionally, we also present a practical Big Data toolset for identifying clinically actionable genetic variants using high-throughput NGS data and EHRs.


Subject(s)
Data Mining , Electronic Health Records , Genomics , Precision Medicine , Cloud Computing , Computational Biology/methods , Data Mining/methods , Databases, Factual , Genetic Variation , Genomics/methods , High-Throughput Nucleotide Sequencing , Humans , Medical Informatics/methods , Precision Medicine/methods
9.
Hypertension ; 79(8): 1656-1667, 2022 08.
Article in English | MEDLINE | ID: mdl-35652341

ABSTRACT

BACKGROUND: The availability of whole-genome sequencing data in large studies has enabled the assessment of coding and noncoding variants across the allele frequency spectrum for their associations with blood pressure. METHODS: We conducted a multiancestry whole-genome sequencing analysis of blood pressure among 51 456 Trans-Omics for Precision Medicine and Centers for Common Disease Genomics program participants (stage-1). Stage-2 analyses leveraged array data from UK Biobank (N=383 145), Million Veteran Program (N=318 891), and Reasons for Geographic and Racial Differences in Stroke (N=10 643) participants, along with whole-exome sequencing data from UK Biobank (N=199 631) participants. RESULTS: Two blood pressure signals achieved genome-wide significance in meta-analyses of stage-1 and stage-2 single variant findings (P<5×10-8). Among them, a rare intergenic variant at novel locus, LOC100506274, was associated with lower systolic blood pressure in stage-1 (beta [SE]=-32.6 [6.0]; P=4.99×10-8) but not stage-2 analysis (P=0.11). Furthermore, a novel common variant at the known INSR locus was suggestively associated with diastolic blood pressure in stage-1 (beta [SE]=-0.36 [0.07]; P=4.18×10-7) and attained genome-wide significance in stage-2 (beta [SE]=-0.29 [0.03]; P=7.28×10-23). Nineteen additional signals suggestively associated with blood pressure in meta-analysis of single and aggregate rare variant findings (P<1×10-6 and P<1×10-4, respectively). DISCUSSION: We report one promising but unconfirmed rare variant for blood pressure and, more importantly, contribute insights for future blood pressure sequencing studies. Our findings suggest promise of aggregate analyses to complement single variant analysis strategies and the need for larger, diverse samples, and family studies to enable robust rare variant identification.


Subject(s)
Hypertension , Blood Pressure/genetics , Genome-Wide Association Study , Genomics , Humans , Hypertension/genetics , Polymorphism, Single Nucleotide , Precision Medicine
10.
Genome Med ; 11(1): 53, 2019 08 23.
Article in English | MEDLINE | ID: mdl-31443733

ABSTRACT

BACKGROUND: Clinical laboratories implement a variety of measures to classify somatic sequence variants and identify clinically significant variants to facilitate the implementation of precision medicine. To standardize the interpretation process, the Association for Molecular Pathology (AMP), American Society of Clinical Oncology (ASCO), and College of American Pathologists (CAP) published guidelines for the interpretation and reporting of sequence variants in cancer in 2017. These guidelines classify somatic variants using a four-tiered system with ten criteria. Even with the standardized guidelines, assessing clinical impacts of somatic variants remains to be tedious. Additionally, manual implementation of the guidelines may vary among professionals and may lack reproducibility when the supporting evidence is not documented in a consistent manner. RESULTS: We developed a semi-automated tool called "Variant Interpretation for Cancer" (VIC) to accelerate the interpretation process and minimize individual biases. VIC takes pre-annotated files and automatically classifies sequence variants based on several criteria, with the ability for users to integrate additional evidence to optimize the interpretation on clinical impacts. We evaluated VIC using several publicly available databases and compared with several predictive software programs. We found that VIC is time-efficient and conservative in classifying somatic variants under default settings, especially for variants with strong and/or potential clinical significance. Additionally, we also tested VIC on two cancer-panel sequencing datasets to show its effectiveness in facilitating manual interpretation of somatic variants. CONCLUSIONS: Although VIC cannot replace human reviewers, it will accelerate the interpretation process on somatic variants. VIC can also be customized by clinical laboratories to fit into their analytical pipelines to facilitate the laborious process of somatic variant interpretation. VIC is freely available at https://github.com/HGLab/VIC/ .


Subject(s)
Computational Biology , Genetic Predisposition to Disease , Genetic Variation , Neoplasms/genetics , Software , Alleles , Biomarkers, Tumor , Computational Biology/methods , Databases, Genetic , Gene Frequency , Genetic Testing , Germ-Line Mutation , Humans , Molecular Sequence Annotation , Neoplasms/diagnosis , Precision Medicine
12.
PLoS One ; 11(10): e0163912, 2016.
Article in English | MEDLINE | ID: mdl-27701450

ABSTRACT

Meta-analysis of single trait for multiple cohorts has been used for increasing statistical power in genome-wide association studies (GWASs). Although hundreds of variants have been identified by GWAS, these variants only explain a small fraction of phenotypic variation. Cross-phenotype association analysis (CPASSOC) can further improve statistical power by searching for variants that contribute to multiple traits, which is often relevant to pleiotropy. In this study, we performed CPASSOC analysis on the summary statistics from the Genetic Investigation of ANthropometric Traits (GIANT) consortium using a novel method recently developed by our group. Sex-specific meta-analysis data for height, body mass index (BMI), and waist-to-hip ratio adjusted for BMI (WHRadjBMI) from discovery phase of the GIANT consortium study were combined using CPASSOC for each trait as well as 3 traits together. The conventional meta-analysis results from the discovery phase data of GIANT consortium studies were used to compare with that from CPASSOC analysis. The CPASSOC analysis was able to identify 17 loci associated with anthropometric traits that were missed by conventional meta-analysis. Among these loci, 16 have been reported in literature by including additional samples and 1 is novel. We also demonstrated that CPASSOC is able to detect pleiotropic effects when analyzing multiple traits.


Subject(s)
Genome-Wide Association Study/methods , Polymorphism, Single Nucleotide , Quantitative Trait Loci , Anthropometry , Body Mass Index , Databases, Genetic , Female , Humans , Male , Models, Statistical , Multivariate Analysis , Waist-Hip Ratio
13.
PLoS One ; 11(9): e0162721, 2016.
Article in English | MEDLINE | ID: mdl-27685652

ABSTRACT

BACKGROUND: Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. RESULTS: In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. CONCLUSIONS: This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.

14.
PLoS One ; 11(12): e0167847, 2016.
Article in English | MEDLINE | ID: mdl-27930734

ABSTRACT

BACKGROUND: It is unclear whether and how whole-genome sequencing (WGS) data can be used to implement genomic medicine. Our objective is to retrospectively evaluate whether WGS can facilitate improving prevention and care for patients with susceptibility to cancer syndromes. METHODS AND FINDINGS: We analyzed genetic mutations in 60 autosomal dominant cancer-predisposition genes in 300 deceased patients with WGS data and nearly complete long-term (over 30 years) medical records. To infer biological insights from massive amounts of WGS data and comprehensive clinical data in a short period of time, we developed an in-house analysis pipeline within the SeqHBase software framework to quickly identify pathogenic or likely pathogenic variants. The clinical data of the patients who carried pathogenic and/or likely pathogenic variants were further reviewed to assess their clinical conditions using their lifetime EHRs. Among the 300 participants, 5 (1.7%) carried pathogenic or likely pathogenic variants in 5 cancer-predisposing genes: one in APC, BRCA1, BRCA2, NF1, and TP53 each. When assessing the clinical data, each of the 5 patients had one or more different types of cancers, fully consistent with their genetic profiles. Among these 5 patients, 2 died due to cancer while the others had multiple disorders later in their lifetimes; however, they may have benefited from early diagnosis and treatment for healthier lives, had the patients had genetic testing in their earlier lifetimes. CONCLUSIONS: We demonstrated a case study where the discovery of pathogenic or likely pathogenic germline mutations from population-wide WGS correlates with clinical outcome. The use of WGS may have clinical impacts to improve healthcare delivery.


Subject(s)
Electronic Health Records , Genes, Neoplasm , Genetic Predisposition to Disease , Genome, Human , Mutation , Neoplasms/genetics , Humans , Sequence Analysis, DNA/methods
SELECTION OF CITATIONS
SEARCH DETAIL