Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 64
Filtrar
1.
N Engl J Med ; 2024 May 19.
Artigo em Inglês | MEDLINE | ID: mdl-38767252

RESUMO

BACKGROUND: Adjustment for race is discouraged in lung-function testing, but the implications of adopting race-neutral equations have not been comprehensively quantified. METHODS: We obtained longitudinal data from 369,077 participants in the National Health and Nutrition Examination Survey, U.K. Biobank, the Multi-Ethnic Study of Atherosclerosis, and the Organ Procurement and Transplantation Network. Using these data, we compared the race-based 2012 Global Lung Function Initiative (GLI-2012) equations with race-neutral equations introduced in 2022 (GLI-Global). Evaluated outcomes included national projections of clinical, occupational, and financial reclassifications; individual lung-allocation scores for transplantation priority; and concordance statistics (C statistics) for clinical prediction tasks. RESULTS: Among the 249 million persons in the United States between 6 and 79 years of age who are able to produce high-quality spirometric results, the use of GLI-Global equations may reclassify ventilatory impairment for 12.5 million persons, medical impairment ratings for 8.16 million, occupational eligibility for 2.28 million, grading of chronic obstructive pulmonary disease for 2.05 million, and military disability compensation for 413,000. These potential changes differed according to race; for example, classifications of nonobstructive ventilatory impairment may change dramatically, increasing 141% (95% confidence interval [CI], 113 to 169) among Black persons and decreasing 69% (95% CI, 63 to 74) among White persons. Annual disability payments may increase by more than $1 billion among Black veterans and decrease by $0.5 billion among White veterans. GLI-2012 and GLI-Global equations had similar discriminative accuracy with regard to respiratory symptoms, health care utilization, new-onset disease, death from any cause, death related to respiratory disease, and death among persons on a transplant waiting list, with differences in C statistics ranging from -0.008 to 0.011. CONCLUSIONS: The use of race-based and race-neutral equations generated similarly accurate predictions of respiratory outcomes but assigned different disease classifications, occupational eligibility, and disability compensation for millions of persons, with effects diverging according to race. (Funded by the National Heart Lung and Blood Institute and the National Institute of Environmental Health Sciences.).

2.
Genome Res ; 2024 May 15.
Artigo em Inglês | MEDLINE | ID: mdl-38749656

RESUMO

Underrepresented populations are often excluded from genomic studies due in part to a lack of resources supporting their analyses. The 1000 Genomes Project (1kGP) and Human Genome Diversity Project (HGDP), which have recently been sequenced to high coverage, are valuable genomic resources because of the global diversity they capture and their open data sharing policies. Here, we harmonized a high quality set of 4,094 whole genomes from 80 populations in the HGDP and 1kGP with data from the Genome Aggregation Database (gnomAD) and identified over 153 million high-quality SNVs, indels, and SVs. We performed a detailed ancestry analysis of this cohort, characterizing population structure and patterns of admixture across populations, analyzing site frequency spectra, and measuring variant counts at global and subcontinental levels. We also demonstrate substantial added value from this dataset compared to the prior versions of the component resources, typically combined via liftOver and variant intersection; for example, we catalog millions of new genetic variants, mostly rare, compared to previous releases. In addition to unrestricted individual-level public release, we provide detailed tutorials for conducting many of the most common quality control steps and analyses with these data in a scalable cloud-computing environment and publicly release this new phased joint callset for use as a haplotype resource in phasing and imputation pipelines. This jointly called reference panel will serve as a key resource to support research of diverse ancestry populations.

3.
bioRxiv ; 2024 Apr 09.
Artigo em Inglês | MEDLINE | ID: mdl-38645052

RESUMO

Genomic scientists have long been promised cheaper DNA sequencing, but deep whole genomes are still costly, especially when considered for large cohorts in population-level studies. More affordable options include microarrays + imputation, whole exome sequencing (WES), or low-pass whole genome sequencing (WGS) + imputation. WES + array + imputation has recently been shown to yield 99% of association signals detected by WGS. However, a method free from ascertainment biases of arrays or the need for merging different data types that still benefits from deeper exome coverage to enhance novel coding variant detection does not exist. We developed a new, combined, "Blended Genome Exome" (BGE) in which a whole genome library is generated, an aliquot of that genome is amplified by PCR, the exome regions are selected and enriched, and the genome and exome libraries are combined back into a single tube for sequencing (33% exome, 67% genome). This creates a single CRAM with a low-coverage whole genome (2-3x) combined with a higher coverage exome (30-40x). This BGE can be used for imputing common variants throughout the genome as well as for calling rare coding variants. We tested this new method and observed >99% r 2 concordance between imputed BGE data and existing 30x WGS data for exome and genome variants. BGE can serve as a useful and cost-efficient alternative sequencing product for genomic researchers, requiring ten-fold less sequencing compared to 30x WGS without the need for complicated harmonization of array and sequencing data.

4.
Am J Hum Genet ; 111(5): 809-824, 2024 May 02.
Artigo em Inglês | MEDLINE | ID: mdl-38642557

RESUMO

Advancements in genomic technologies have shown remarkable promise for improving health trajectories. The Human Genome Project has catalyzed the integration of genomic tools into clinical practice, such as disease risk assessment, prenatal testing and reproductive genomics, cancer diagnostics and prognostication, and therapeutic decision making. Despite the promise of genomic technologies, their full potential remains untapped without including individuals of diverse ancestries and integrating social determinants of health (SDOHs). The NHGRI launched the 2020 Strategic Vision with ten bold predictions by 2030, including "individuals from ancestrally diverse backgrounds will benefit equitably from advances in human genomics." Meeting this goal requires a holistic approach that brings together genomic advancements with careful consideration to healthcare access as well as SDOHs to ensure that translation of genetics research is inclusive, affordable, and accessible and ultimately narrows rather than widens health disparities. With this prediction in mind, this review delves into the two paramount applications of genetic testing-reproductive genomics and precision oncology. When discussing these applications of genomic advancements, we evaluate current accessibility limitations, highlight challenges in achieving representativeness, and propose paths forward to realize the ultimate goal of their equitable applications.


Assuntos
Genômica , Medicina de Precisão , Humanos , Genômica/métodos , Medicina de Precisão/métodos , Genoma Humano , Testes Genéticos , Neoplasias/genética , Acessibilidade aos Serviços de Saúde
5.
Cell Genom ; 4(4): 100523, 2024 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-38508198

RESUMO

Polygenic risk scores (PRSs) are an emerging tool to predict the clinical phenotypes and outcomes of individuals. We propose PRSmix, a framework that leverages the PRS corpus of a target trait to improve prediction accuracy, and PRSmix+, which incorporates genetically correlated traits to better capture the human genetic architecture for 47 and 32 diseases/traits in European and South Asian ancestries, respectively. PRSmix demonstrated a mean prediction accuracy improvement of 1.20-fold (95% confidence interval [CI], [1.10; 1.3]; p = 9.17 × 10-5) and 1.19-fold (95% CI, [1.11; 1.27]; p = 1.92 × 10-6), and PRSmix+ improved the prediction accuracy by 1.72-fold (95% CI, [1.40; 2.04]; p = 7.58 × 10-6) and 1.42-fold (95% CI, [1.25; 1.59]; p = 8.01 × 10-7) in European and South Asian ancestries, respectively. Compared to the previously cross-trait-combination methods with scores from pre-defined correlated traits, we demonstrated that our method improved prediction accuracy for coronary artery disease up to 3.27-fold (95% CI, [2.1; 4.44]; p value after false discovery rate (FDR) correction = 2.6 × 10-4). Our method provides a comprehensive framework to benchmark and leverage the combined power of PRS for maximal performance in a desired target population.


Assuntos
Doença da Artéria Coronariana , Osteopatia , Humanos , Herança Multifatorial/genética , Estratificação de Risco Genético , Benchmarking , Doença da Artéria Coronariana/diagnóstico
7.
Cell Rep Med ; 5(2): 101430, 2024 Feb 20.
Artigo em Inglês | MEDLINE | ID: mdl-38382466

RESUMO

Primary open-angle glaucoma (POAG), a leading cause of irreversible blindness globally, shows disparity in prevalence and manifestations across ancestries. We perform meta-analysis across 15 biobanks (of the Global Biobank Meta-analysis Initiative) (n = 1,487,441: cases = 26,848) and merge with previous multi-ancestry studies, with the combined dataset representing the largest and most diverse POAG study to date (n = 1,478,037: cases = 46,325) and identify 17 novel significant loci, 5 of which were ancestry specific. Gene-enrichment and transcriptome-wide association analyses implicate vascular and cancer genes, a fifth of which are primary ciliary related. We perform an extensive statistical analysis of SIX6 and CDKN2B-AS1 loci in human GTEx data and across large electronic health records showing interaction between SIX6 gene and causal variants in the chr9p21.3 locus, with expression effect on CDKN2A/B. Our results suggest that some POAG risk variants may be ancestry specific, sex specific, or both, and support the contribution of genes involved in programmed cell death in POAG pathogenesis.


Assuntos
Predisposição Genética para Doença , Glaucoma de Ângulo Aberto , Masculino , Feminino , Humanos , Predisposição Genética para Doença/genética , Glaucoma de Ângulo Aberto/genética , Glaucoma de Ângulo Aberto/epidemiologia , Polimorfismo de Nucleotídeo Único , Proliferação de Células , Biologia
9.
bioRxiv ; 2024 Feb 28.
Artigo em Inglês | MEDLINE | ID: mdl-36747613

RESUMO

Underrepresented populations are often excluded from genomic studies due in part to a lack of resources supporting their analyses. The 1000 Genomes Project (1kGP) and Human Genome Diversity Project (HGDP), which have recently been sequenced to high coverage, are valuable genomic resources because of the global diversity they capture and their open data sharing policies. Here, we harmonized a high quality set of 4,094 whole genomes from HGDP and 1kGP with data from the Genome Aggregation Database (gnomAD) and identified over 153 million high-quality SNVs, indels, and SVs. We performed a detailed ancestry analysis of this cohort, characterizing population structure and patterns of admixture across populations, analyzing site frequency spectra, and measuring variant counts at global and subcontinental levels. We also demonstrate substantial added value from this dataset compared to the prior versions of the component resources, typically combined via liftover and variant intersection; for example, we catalog millions of new genetic variants, mostly rare, compared to previous releases. In addition to unrestricted individual-level public release, we provide detailed tutorials for conducting many of the most common quality control steps and analyses with these data in a scalable cloud-computing environment and publicly release this new phased joint callset for use as a haplotype resource in phasing and imputation pipelines. This jointly called reference panel will serve as a key resource to support research of diverse ancestry populations.

10.
Nature ; 625(7993): 92-100, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38057664

RESUMO

The depletion of disruptive variation caused by purifying natural selection (constraint) has been widely used to investigate protein-coding genes underlying human disorders1-4, but attempts to assess constraint for non-protein-coding regions have proved more difficult. Here we aggregate, process and release a dataset of 76,156 human genomes from the Genome Aggregation Database (gnomAD)-the largest public open-access human genome allele frequency reference dataset-and use it to build a genomic constraint map for the whole genome (genomic non-coding constraint of haploinsufficient variation (Gnocchi)). We present a refined mutational model that incorporates local sequence context and regional genomic features to detect depletions of variation. As expected, the average constraint for protein-coding sequences is stronger than that for non-coding regions. Within the non-coding genome, constrained regions are enriched for known regulatory elements and variants that are implicated in complex human diseases and traits, facilitating the triangulation of biological annotation, disease association and natural selection to non-coding DNA analysis. More constrained regulatory elements tend to regulate more constrained protein-coding genes, which in turn suggests that non-coding constraint can aid the identification of constrained genes that are as yet unrecognized by current gene constraint metrics. We demonstrate that this genome-wide constraint map improves the identification and interpretation of functional human genetic variation.


Assuntos
Genoma Humano , Genômica , Modelos Genéticos , Mutação , Humanos , Acesso à Informação , Bases de Dados Genéticas , Conjuntos de Dados como Assunto , Frequência do Gene , Genoma Humano/genética , Mutação/genética , Seleção Genética
12.
Nat Commun ; 14(1): 8297, 2023 Dec 14.
Artigo em Inglês | MEDLINE | ID: mdl-38097585

RESUMO

Smoking is the leading risk factor for chronic obstructive pulmonary disease (COPD) worldwide, yet many people who never smoke develop COPD. We perform a longitudinal analysis of COPD in the UK Biobank to derive and validate the Socioeconomic and Environmental Risk Score which captures additive and cumulative environmental, behavioral, and socioeconomic exposure risks beyond tobacco smoking. The Socioeconomic and Environmental Risk Score is more predictive of COPD than smoking status and pack-years. Individuals in the highest decile of the risk score have a greater risk for incident COPD compared to the remaining population. Never smokers in the highest decile of exposure risk are more likely to develop COPD than previous and current smokers in the lowest decile. In general, the prediction accuracy of the Social and Environmental Risk Score is lower in non-European populations. While smoking status is often considered in screening COPD, our finding highlights the importance of other non-smoking environmental and socioeconomic variables.


Assuntos
Doença Pulmonar Obstrutiva Crônica , Humanos , Doença Pulmonar Obstrutiva Crônica/epidemiologia , Doença Pulmonar Obstrutiva Crônica/etiologia , Fatores de Risco , Fumar/efeitos adversos , Fumar/epidemiologia
13.
medRxiv ; 2023 Oct 25.
Artigo em Inglês | MEDLINE | ID: mdl-37961173

RESUMO

Mass General Brigham, an integrated healthcare system based in the Greater Boston area of Massachusetts, annually serves 1.5 million patients. We established the Mass General Brigham Biobank (MGBB), encompassing 142,238 participants, to unravel the intricate relationships among genomic profiles, environmental context, and disease manifestations within clinical practice. In this study, we highlight the impact of ancestral diversity in the MGBB by employing population genetics, geospatial assessment, and association analyses of rare and common genetic variants. The population structures captured by the genetics mirror the sequential immigration to the Greater Boston area throughout American history, highlighting communities tied to shared genetic and environmental factors. Our investigation underscores the potency of unbiased, large-scale analyses in a healthcare-affiliated biobank, elucidating the dynamic interplay across genetics, immigration, structural geospatial factors, and health outcomes in one of the earliest American sites of European colonization.

14.
Cell Genom ; 3(10): 100408, 2023 Oct 11.
Artigo em Inglês | MEDLINE | ID: mdl-37868036

RESUMO

Polygenic risk scores (PRSs) developed from multi-ancestry genome-wide association studies (GWASs), PRSmulti, hold promise for improving PRS accuracy and generalizability across populations. To establish best practices for leveraging the increasing diversity of genomic studies, we investigated how various factors affect the performance of PRSmulti compared with PRSs constructed from single-ancestry GWASs (PRSsingle). Through extensive simulations and empirical analyses, we showed that PRSmulti overall outperformed PRSsingle in understudied populations, except when the understudied population represented a small proportion of the multi-ancestry GWAS. Furthermore, integrating PRSs based on local ancestry-informed GWASs and large-scale, European-based PRSs improved predictive performance in understudied African populations, especially for less polygenic traits with large-effect ancestry-enriched variants. Our work highlights the importance of diversifying genomic studies to achieve equitable PRS performance across ancestral populations and provides guidance for developing PRSs from multiple studies.

15.
medRxiv ; 2023 Apr 05.
Artigo em Inglês | MEDLINE | ID: mdl-37066248

RESUMO

Smoking is the leading risk factor for chronic obstructive pulmonary disease (COPD) worldwide, yet many people who never smoke develop COPD. We hypothesize that considering other socioeconomic and environmental factors can better predict and stratify the risk of COPD in both non-smokers and smokers. We performed longitudinal analysis of COPD in the UK Biobank to develop the Socioeconomic and Environmental Risk Score (SERS) which captures additive and cumulative environmental, behavioral, and socioeconomic exposure risks beyond tobacco smoking. We tested the ability of SERS to predict and stratify the risk of COPD in current, previous, and never smokers of European and non-European ancestries in comparison to a composite genome-wide polygenic risk score (PGS). We tested associations using Cox regression models and assessed the predictive performance of models using Harrell's C index. SERS (C index = 0.770, 95% CI 0.756 to 0.784) was more predictive of COPD than smoking status (C index = 0.738, 95% CI 0.724 to 0.752), pack-years (C index = 0.742, 95% CI 0.727 to 0.756). Compared to the remaining population, individuals in the highest decile of the SERS had hazard ratios (HR) = 7.24 (95% CI 6.51 to 8.05, P < 0.0001) for incident COPD. Never smokers in the highest decile of exposure risk were more likely to develop COPD than previous and current smokers in the lowest decile with HR=4.95 (95% CI 1.56 to 15.69, P=6.65×10-3) and 2.92 (95%CI 1.51 to 5.61, P=1.38×10-3), respectively. In general, the prediction accuracy of SERS was lower in the non-European populations compared to the European evaluation set. In addition to genetic factors, socioeconomic and environmental factors beyond smoking can predict and stratify COPD risk for both non- and smoking individuals. Smoking status is often considered in screening; other non-smoking environmental and non-genetic variables should be evaluated prospectively for their clinical utility.

16.
Hastings Cent Rep ; 53 Suppl 1: S2-S49, 2023 03.
Artigo em Inglês | MEDLINE | ID: mdl-37078667

RESUMO

In this consensus report by a diverse group of academics who conduct and/or are concerned about social and behavioral genomics (SBG) research, the authors recount the often-ugly history of scientific attempts to understand the genetic contributions to human behaviors and social outcomes. They then describe what the current science-including genomewide association studies and polygenic indexes-can and cannot tell us, as well as its risks and potential benefits. They conclude with a discussion of responsible behavior in the context of SBG research. SBG research that compares individuals within a group according to a "sensitive" phenotype requires extra attention to responsible conduct and to responsible communication about the research and its findings. SBG research (1) on sensitive phenotypes that (2) compares two or more groups defined by (a) race, (b) ethnicity, or (c) genetic ancestry (where genetic ancestry could easily be misunderstood as race or ethnicity) requires a compelling justification to be conducted, funded, or published. All authors agree that this justification at least requires a convincing argument that a study's design could yield scientifically valid results; some authors would additionally require the study to have a socially favorable risk-benefit profile.


Assuntos
Comunicação , Genômica , Humanos , Fenótipo , Responsabilidade Social
17.
HGG Adv ; 4(2): 100184, 2023 04 13.
Artigo em Inglês | MEDLINE | ID: mdl-36873096

RESUMO

African populations are vastly underrepresented in genetic studies but have the most genetic variation and face wide-ranging environmental exposures globally. Because systematic evaluations of genetic prediction had not yet been conducted in ancestries that span African diversity, we calculated polygenic risk scores (PRSs) in simulations across Africa and in empirical data from South Africa, Uganda, and the United Kingdom to better understand the generalizability of genetic studies. PRS accuracy improves with ancestry-matched discovery cohorts more than from ancestry-mismatched studies. Within ancestrally and ethnically diverse South African individuals, we find that PRS accuracy is low for all traits but varies across groups. Differences in African ancestries contribute more to variability in PRS accuracy than other large cohort differences considered between individuals in the United Kingdom versus Uganda. We computed PRS in African ancestry populations using existing European-only versus ancestrally diverse genetic studies; the increased diversity produced the largest accuracy gains for hemoglobin concentration and white blood cell count, reflecting large-effect ancestry-enriched variants in genes known to influence sickle cell anemia and the allergic response, respectively. Differences in PRS accuracy across African ancestries originating from diverse regions are as large as across out-of-Africa continental ancestries, requiring commensurate nuance.


Assuntos
Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Predisposição Genética para Doença/genética , Herança Multifatorial/genética , Uganda/epidemiologia , Polimorfismo de Nucleotídeo Único
18.
medRxiv ; 2023 Mar 23.
Artigo em Inglês | MEDLINE | ID: mdl-36865265

RESUMO

Polygenic risk scores (PRS) are an emerging tool to predict the clinical phenotypes and outcomes of individuals. Validation and transferability of existing PRS across independent datasets and diverse ancestries are limited, which hinders the practical utility and exacerbates health disparities. We propose PRSmix, a framework that evaluates and leverages the PRS corpus of a target trait to improve prediction accuracy, and PRSmix+, which incorporates genetically correlated traits to better capture the human genetic architecture. We applied PRSmix to 47 and 32 diseases/traits in European and South Asian ancestries, respectively. PRSmix demonstrated a mean prediction accuracy improvement of 1.20-fold (95% CI: [1.10; 1.3]; P-value = 9.17 × 10-5) and 1.19-fold (95% CI: [1.11; 1.27]; P-value = 1.92 × 10-6), and PRSmix+ improved the prediction accuracy by 1.72-fold (95% CI: [1.40; 2.04]; P-value = 7.58 × 10-6) and 1.42-fold (95% CI: [1.25; 1.59]; P-value = 8.01 × 10-7) in European and South Asian ancestries, respectively. Compared to the previously established cross-trait-combination method with scores from pre-defined correlated traits, we demonstrated that our method can improve prediction accuracy for coronary artery disease up to 3.27-fold (95% CI: [2.1; 4.44]; P-value after FDR correction = 2.6 × 10-4). Our method provides a comprehensive framework to benchmark and leverage the combined power of PRS for maximal performance in a desired target population.

19.
Cell Genom ; 3(1): 100241, 2023 Jan 11.
Artigo em Inglês | MEDLINE | ID: mdl-36777179

RESUMO

Polygenic risk scores (PRSs) have been widely explored in precision medicine. However, few studies have thoroughly investigated their best practices in global populations across different diseases. We here utilized data from Global Biobank Meta-analysis Initiative (GBMI) to explore methodological considerations and PRS performance in 9 different biobanks for 14 disease endpoints. Specifically, we constructed PRSs using pruning and thresholding (P + T) and PRS-continuous shrinkage (CS). For both methods, using a European-based linkage disequilibrium (LD) reference panel resulted in comparable or higher prediction accuracy compared with several other non-European-based panels. PRS-CS overall outperformed the classic P + T method, especially for endpoints with higher SNP-based heritability. Notably, prediction accuracy is heterogeneous across endpoints, biobanks, and ancestries, especially for asthma, which has known variation in disease prevalence across populations. Overall, we provide lessons for PRS construction, evaluation, and interpretation using GBMI resources and highlight the importance of best practices for PRS in the biobank-scale genomics era.

20.
Am J Hum Genet ; 109(9): 1667-1679, 2022 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-36055213

RESUMO

African populations are the most diverse in the world yet are sorely underrepresented in medical genetics research. Here, we examine the structure of African populations using genetic and comprehensive multi-generational ethnolinguistic data from the Neuropsychiatric Genetics of African Populations-Psychosis study (NeuroGAP-Psychosis) consisting of 900 individuals from Ethiopia, Kenya, South Africa, and Uganda. We find that self-reported language classifications meaningfully tag underlying genetic variation that would be missed with consideration of geography alone, highlighting the importance of culture in shaping genetic diversity. Leveraging our uniquely rich multi-generational ethnolinguistic metadata, we track language transmission through the pedigree, observing the disappearance of several languages in our cohort as well as notable shifts in frequency over three generations. We find suggestive evidence for the rate of language transmission in matrilineal groups having been higher than that for patrilineal ones. We highlight both the diversity of variation within Africa as well as how within-Africa variation can be informative for broader variant interpretation; many variants that are rare elsewhere are common in parts of Africa. The work presented here improves the understanding of the spectrum of genetic variation in African populations and highlights the enormous and complex genetic and ethnolinguistic diversity across Africa.


Assuntos
Variação Genética , Genética Populacional , África Austral , População Negra/genética , Estruturas Genéticas , Variação Genética/genética , Humanos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA