Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 29
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Cell ; 184(8): 2068-2083.e11, 2021 04 15.
Artigo em Inglês | MEDLINE | ID: mdl-33861964

RESUMO

Understanding population health disparities is an essential component of equitable precision health efforts. Epidemiology research often relies on definitions of race and ethnicity, but these population labels may not adequately capture disease burdens and environmental factors impacting specific sub-populations. Here, we propose a framework for repurposing data from electronic health records (EHRs) in concert with genomic data to explore the demographic ties that can impact disease burdens. Using data from a diverse biobank in New York City, we identified 17 communities sharing recent genetic ancestry. We observed 1,177 health outcomes that were statistically associated with a specific group and demonstrated significant differences in the segregation of genetic variants contributing to Mendelian diseases. We also demonstrated that fine-scale population structure can impact the prediction of complex disease risk within groups. This work reinforces the utility of linking genomic data to EHRs and provides a framework toward fine-scale monitoring of population health.


Assuntos
Etnicidade/genética , Saúde da População , Bases de Dados Genéticas , Registros Eletrônicos de Saúde , Genômica , Humanos , Autorrelato
2.
Cell ; 173(7): 1692-1704.e11, 2018 06 14.
Artigo em Inglês | MEDLINE | ID: mdl-29779949

RESUMO

Heritability is essential for understanding the biological causes of disease but requires laborious patient recruitment and phenotype ascertainment. Electronic health records (EHRs) passively capture a wide range of clinically relevant data and provide a resource for studying the heritability of traits that are not typically accessible. EHRs contain next-of-kin information collected via patient emergency contact forms, but until now, these data have gone unused in research. We mined emergency contact data at three academic medical centers and identified 7.4 million familial relationships while maintaining patient privacy. Identified relationships were consistent with genetically derived relatedness. We used EHR data to compute heritability estimates for 500 disease phenotypes. Overall, estimates were consistent with the literature and between sites. Inconsistencies were indicative of limitations and opportunities unique to EHR research. These analyses provide a validation of the use of EHRs for genetics and disease research.


Assuntos
Registros Eletrônicos de Saúde , Doenças Genéticas Inatas/genética , Algoritmos , Bases de Dados Factuais , Relações Familiares , Doenças Genéticas Inatas/patologia , Genótipo , Humanos , Linhagem , Fenótipo , Característica Quantitativa Herdável
3.
Nature ; 582(7811): 234-239, 2020 06.
Artigo em Inglês | MEDLINE | ID: mdl-32499652

RESUMO

On average, Peruvian individuals are among the shortest in the world1. Here we show that Native American ancestry is associated with reduced height in an ethnically diverse group of Peruvian individuals, and identify a population-specific, missense variant in the FBN1 gene (E1297G) that is significantly associated with lower height. Each copy of the minor allele (frequency of 4.7%) reduces height by 2.2 cm (4.4 cm in homozygous individuals). To our knowledge, this is the largest effect size known for a common height-associated variant. FBN1 encodes the extracellular matrix protein fibrillin 1, which is a major structural component of microfibrils. We observed less densely packed fibrillin-1-rich microfibrils with irregular edges in the skin of individuals who were homozygous for G1297 compared with individuals who were homozygous for E1297. Moreover, we show that the E1297G locus is under positive selection in non-African populations, and that the E1297 variant shows subtle evidence of positive selection specifically within the Peruvian population. This variant is also significantly more frequent in coastal Peruvian populations than in populations from the Andes or the Amazon, which suggests that short stature might be the result of adaptation to factors that are associated with the coastal environment in Peru.


Assuntos
Estatura/genética , Fibrilina-1/genética , Mutação de Sentido Incorreto , Seleção Genética , Feminino , Frequência do Gene , Estudo de Associação Genômica Ampla , Hereditariedade , Humanos , Indígenas Sul-Americanos/genética , Masculino , Microfibrilas/química , Microfibrilas/genética , Peru
4.
Nature ; 570(7762): 514-518, 2019 06.
Artigo em Inglês | MEDLINE | ID: mdl-31217584

RESUMO

Genome-wide association studies (GWAS) have laid the foundation for investigations into the biology of complex traits, drug development and clinical guidelines. However, the majority of discovery efforts are based on data from populations of European ancestry1-3. In light of the differential genetic architecture that is known to exist between populations, bias in representation can exacerbate existing disease and healthcare disparities. Critical variants may be missed if they have a low frequency or are completely absent in European populations, especially as the field shifts its attention towards rare variants, which are more likely to be population-specific4-10. Additionally, effect sizes and their derived risk prediction scores derived in one population may not accurately extrapolate to other populations11,12. Here we demonstrate the value of diverse, multi-ethnic participants in large-scale genomic studies. The Population Architecture using Genomics and Epidemiology (PAGE) study conducted a GWAS of 26 clinical and behavioural phenotypes in 49,839 non-European individuals. Using strategies tailored for analysis of multi-ethnic and admixed populations, we describe a framework for analysing diverse populations, identify 27 novel loci and 38 secondary signals at known loci, as well as replicate 1,444 GWAS catalogue associations across these traits. Our data show evidence of effect-size heterogeneity across ancestries for published GWAS associations, substantial benefits for fine-mapping using diverse cohorts and insights into clinical implications. In the United States-where minority populations have a disproportionately higher burden of chronic conditions13-the lack of representation of diverse populations in genetic research will result in inequitable access to precision medicine for those with the highest burden of disease. We strongly advocate for continued, large genome-wide efforts in diverse populations to maximize genetic discovery and reduce health disparities.


Assuntos
Povo Asiático/genética , População Negra/genética , Estudo de Associação Genômica Ampla/métodos , Hispânico ou Latino/genética , Grupos Minoritários , Herança Multifatorial/genética , Saúde da Mulher , Estatura/genética , Estudos de Coortes , Feminino , Genética Médica/métodos , Equidade em Saúde/tendências , Disparidades nos Níveis de Saúde , Humanos , Masculino , Estados Unidos
5.
Am J Hum Genet ; 108(11): 2099-2111, 2021 11 04.
Artigo em Inglês | MEDLINE | ID: mdl-34678161

RESUMO

The integration of genomic data into health systems offers opportunities to identify genomic factors underlying the continuum of rare and common disease. We applied a population-scale haplotype association approach based on identity-by-descent (IBD) in a large multi-ethnic biobank to a spectrum of disease outcomes derived from electronic health records (EHRs) and uncovered a risk locus for liver disease. We used genome sequencing and in silico approaches to fine-map the signal to a non-coding variant (c.2784-12T>C) in the gene ABCB4. In vitro analysis confirmed the variant disrupted splicing of the ABCB4 pre-mRNA. Four of five homozygotes had evidence of advanced liver disease, and there was a significant association with liver disease among heterozygotes, suggesting the variant is linked to increased risk of liver disease in an allele dose-dependent manner. Population-level screening revealed the variant to be at a carrier rate of 1.95% in Puerto Rican individuals, likely as the result of a Puerto Rican founder effect. This work demonstrates that integrating EHR and genomic data at a population scale can facilitate strategies for understanding the continuum of genomic risk for common diseases, particularly in populations underrepresented in genomic medicine.


Assuntos
Atenção à Saúde/organização & administração , Predisposição Genética para Doença , Hepatopatias/genética , Subfamília B de Transportador de Cassetes de Ligação de ATP/genética , Registros Eletrônicos de Saúde , Haplótipos , Heterozigoto , Hispânico ou Latino/genética , Homozigoto , Humanos , Porto Rico
6.
Clin Gastroenterol Hepatol ; 21(10): 2578-2587.e11, 2023 09.
Artigo em Inglês | MEDLINE | ID: mdl-36610497

RESUMO

BACKGROUND & AIMS: Genetic variants affecting liver disease risk vary among racial and ethnic groups. Hispanics/Latinos in the United States have a high prevalence of PNPLA3 I148M, which increases liver disease risk, and a low prevalence of HSD17B13 predicted loss-of-function (pLoF) variants, which reduce risk. Less is known about the prevalence of liver disease-associated variants among Hispanic/Latino subpopulations defined by country of origin and genetic ancestry. We evaluated the prevalence of HSD17B13 pLoF variants and PNPLA3 I148M, and their associations with quantitative liver phenotypes in Hispanic/Latino participants from an electronic health record-linked biobank in New York City. METHODS: This study included 8739 adult Hispanic/Latino participants of the BioMe biobank with genotyping and exome sequencing data. We estimated the prevalence of Hispanic/Latino individuals harboring HSD17B13 and PNPLA3 variants, stratified by genetic ancestry, and performed association analyses between variants and liver enzymes and Fibrosis-4 (FIB-4) scores. RESULTS: Individuals with ancestry from Ecuador and Mexico had the lowest frequency of HSD17B13 pLoF variants (10%/7%) and the highest frequency of PNPLA3 I148M (54%/65%). These ancestry groups had the highest outpatient alanine aminotransferase (ALT) and aspartate aminotransferase (AST) levels, and the largest proportion of individuals with a FIB-4 score greater than 2.67. HSD17B13 pLoF variants were associated with reduced ALT level (P = .002), AST level (P < .001), and FIB-4 score (P = .045). PNPLA3 I148M was associated with increased ALT level, AST level, and FIB-4 score (P < .001 for all). HSD17B13 pLoF variants mitigated the increase in ALT conferred by PNPLA3 I148M (P = .006). CONCLUSIONS: Variation in HSD17B13 and PNPLA3 variants across genetic ancestry groups may contribute to differential risk for liver fibrosis among Hispanic/Latino individuals.


Assuntos
Cirrose Hepática , Hepatopatia Gordurosa não Alcoólica , Humanos , Predisposição Genética para Doença , Hispânico ou Latino/genética , Cirrose Hepática/enzimologia , Cirrose Hepática/genética , Hepatopatia Gordurosa não Alcoólica/enzimologia , Hepatopatia Gordurosa não Alcoólica/genética , Polimorfismo de Nucleotídeo Único
7.
Am J Med Genet B Neuropsychiatr Genet ; 192(7-8): 171-182, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37334860

RESUMO

Fibromyalgia is a complex disease of unclear etiology that is complicated by difficulties in diagnosis, treatment, and clinical heterogeneity. To clarify this etiology, healthcare-based data are leveraged to assess the influences on fibromyalgia in several domains. Prevalence is less than 1% of females in our population register data, and about 1/10th that in males. Fibromyalgia often presents with co-occurring conditions including back pain, rheumatoid arthritis, and anxiety. More comorbidities are identified with hospital-associated biobank data, falling into three broad categories of pain-related, autoimmune, and psychiatric disorders. Selecting representative phenotypes with published genome-wide association results for polygenic scoring, we confirm genetic predispositions to psychiatric, pain sensitivity, and autoimmune conditions show associations with fibromyalgia, although these may differ by ancestry group. We conduct a genome-wide association analysis of fibromyalgia in biobank samples, which did not result in any genome-wide significant loci; further studies with increased sample size are necessary to identify specific genetic effects on fibromyalgia. Overall, fibromyalgia appears to have strong clinical and likely genetic links to several disease categories, and could usefully be understood as a composite manifestation of these etiological sources.


Assuntos
Artrite Reumatoide , Fibromialgia , Masculino , Feminino , Humanos , Fibromialgia/genética , Fibromialgia/diagnóstico , Fibromialgia/epidemiologia , Estudo de Associação Genômica Ampla , Dor/genética , Dor/complicações , Dor/diagnóstico , Comorbidade , Artrite Reumatoide/complicações , Artrite Reumatoide/diagnóstico , Artrite Reumatoide/epidemiologia
8.
Pharmacogenomics J ; 21(2): 174-189, 2021 04.
Artigo em Inglês | MEDLINE | ID: mdl-33168928

RESUMO

The emergence of genomic data in biobanks and health systems offers new ways to derive medically important phenotypes, including acute phenotypes occurring during inpatient clinical care. Here we study the genetic underpinnings of the rapid response to phenylephrine, an α1-adrenergic receptor agonist commonly used to treat hypotension during anesthesia and surgery. We quantified this response by extracting blood pressure (BP) measurements 5 min before and after the administration of phenylephrine. Based on this derived phenotype, we show that systematic differences exist between self-reported ancestry groups: European-Americans (EA; n = 1387) have a significantly higher systolic response to phenylephrine than African-Americans (AA; n = 1217) and Hispanic/Latinos (HA; n = 1713) (31.3% increase, p value < 6e-08 and 22.9% increase, p value < 5e-05 respectively), after adjusting for genetic ancestry, demographics, and relevant clinical covariates. We performed a genome-wide association study to investigate genetic factors underlying individual differences in this derived phenotype. We discovered genome-wide significant association signals in loci and genes previously associated with BP measured in ambulatory settings, and a general enrichment of association in these genes. Finally, we discovered two low frequency variants, present at ~1% in EAs and AAs, respectively, where patients carrying one copy of these variants show no phenylephrine response. This work demonstrates our ability to derive a quantitative phenotype suited for comparative statistics and genome-wide association studies from dense clinical and physiological measures captured for managing patients during surgery. We identify genetic variants underlying non response to phenylephrine, with implications for preemptive pharmacogenomic screening to improve safety during surgery.


Assuntos
Adrenérgicos/uso terapêutico , Fenilefrina/uso terapêutico , Negro ou Afro-Americano/genética , Pressão Sanguínea/efeitos dos fármacos , Pressão Sanguínea/genética , Feminino , Estudo de Associação Genômica Ampla/métodos , Genômica/métodos , Humanos , Masculino , Pessoa de Meia-Idade , Período Perioperatório/métodos , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , População Branca/genética
9.
J Hepatol ; 72(6): 1070-1081, 2020 06.
Artigo em Inglês | MEDLINE | ID: mdl-32145261

RESUMO

BACKGROUND & AIMS: The Ile138Met variant (rs738409) in the PNPLA3 gene has the largest effect on non-alcoholic fatty liver disease (NAFLD), increasing the risk of progression to severe forms of liver disease. It remains unknown if the variant plays a role in age of NAFLD onset. We aimed to determine if rs738409 impacts on the age of NAFLD diagnosis. METHODS: We applied a novel natural language processing (NLP) algorithm to a longitudinal electronic health records (EHR) dataset of >27,000 individuals with genetic data from a multi-ethnic biobank, defining NAFLD cases (n = 1,703) and confirming controls (n = 8,119). We conducted i) a survival analysis to determine if age at diagnosis differed by rs738409 genotype, ii) a receiver operating characteristics analysis to assess the utility of the rs738409 genotype in discriminating NAFLD cases from controls, and iii) a phenome-wide association study (PheWAS) between rs738409 and 10,095 EHR-derived disease diagnoses. RESULTS: The PNPLA3 G risk allele was associated with: i) earlier age of NAFLD diagnosis, with the strongest effect in Hispanics (hazard ratio 1.33; 95% CI 1.15-1.53; p <0.0001) among whom a NAFLD diagnosis was 15% more likely in risk allele carriers vs. non-carriers; ii) increased NAFLD risk (odds ratio 1.61; 95% CI 1.349-1.73; p <0.0001), with the strongest effect among Hispanics (odds ratio 1.43; 95% CI 1.28-1.59; p <0.0001); iii) additional liver diseases in a PheWAS (p <4.95 × 10-6) where the risk variant also associated with earlier age of diagnosis. CONCLUSION: Given the role of the rs738409 in NAFLD diagnosis age, our results suggest that stratifying risk within populations known to have an enhanced risk of liver disease, such as Hispanic carriers of the rs738409 variant, would be effective in earlier identification of those who would benefit most from early NAFLD prevention and treatment strategies. LAY SUMMARY: Despite clear associations between the PNPLA3 rs738409 variant and elevated risk of progression from non-alcoholic fatty liver disease (NAFLD) to more severe forms of liver disease, it remains unknown if PNPLA3 rs738409 plays a role in the age of NAFLD onset. Herein, we found that this risk variant is associated with an earlier age of NAFLD and other liver disease diagnoses; an observation most pronounced in Hispanic Americans. We conclude that PNPLA3 rs738409 could be used to better understand liver disease risk within vulnerable populations and identify patients that may benefit from early prevention strategies.


Assuntos
Bancos de Espécimes Biológicos , Lipase/genética , Proteínas de Membrana/genética , Hepatopatia Gordurosa não Alcoólica/diagnóstico , Hepatopatia Gordurosa não Alcoólica/genética , Polimorfismo de Nucleotídeo Único , Adolescente , Adulto , Fatores Etários , Idoso , Idoso de 80 Anos ou mais , Alelos , Estudos de Casos e Controles , Criança , Pré-Escolar , Registros Eletrônicos de Saúde , Feminino , Frequência do Gene , Predisposição Genética para Doença , Genótipo , Hispânico ou Latino/genética , Humanos , Lactente , Recém-Nascido , Estimativa de Kaplan-Meier , Estudos Longitudinais , Masculino , Pessoa de Meia-Idade , Hepatopatia Gordurosa não Alcoólica/etnologia , Hepatopatia Gordurosa não Alcoólica/mortalidade , Adulto Jovem
10.
BMC Genet ; 20(1): 52, 2019 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-31266448

RESUMO

BACKGROUND: Genetic diversity is known to confer survival advantage in many species across the tree of life. Here, we hypothesize that such pattern applies to humans as well and could be a result of higher fitness in individuals with higher genomic heterozygosity. RESULTS: We use healthy aging as a proxy for better health and fitness, and observe greater heterozygosity in healthy-aged individuals. Specifically, we find that only common genetic variants show significantly higher excess of heterozygosity in the healthy-aged cohort. Lack of difference in heterozygosity for low-frequency variants or disease-associated variants excludes the possibility of compensation for deleterious recessive alleles as a mechanism. In addition, coding SNPs with the highest excess of heterozygosity in the healthy-aged cohort are enriched in genes involved in extracellular matrix and glycoproteins, a group of genes known to be under long-term balancing selection. We also find that individual heterozygosity rate is a significant predictor of electronic health record (EHR)-based estimates of 10-year survival probability in men but not in women, accounting for several factors including age and ethnicity. CONCLUSIONS: Our results demonstrate that the genomic heterozygosity is associated with human healthspan, and that the relationship between higher heterozygosity and healthy aging could be explained by heterozygote advantage. Further characterization of this relationship will have important implications in aging-associated disease risk prediction.


Assuntos
Genoma Humano , Estudo de Associação Genômica Ampla , Genômica , Envelhecimento Saudável/genética , Heterozigoto , Alelos , Feminino , Frequência do Gene , Variação Genética , Estudo de Associação Genômica Ampla/métodos , Genômica/métodos , Humanos , Masculino , Polimorfismo de Nucleotídeo Único
11.
Hum Genet ; 137(10): 847-862, 2018 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-30317457

RESUMO

Primary open angle glaucoma (POAG) is a complex disease with a major genetic contribution. Its prevalence varies greatly among ethnic groups, and is up to five times more frequent in black African populations compared to Europeans. So far, worldwide efforts to elucidate the genetic complexity of POAG in African populations has been limited. We conducted a genome-wide association study in 1113 POAG cases and 1826 controls from Tanzanian, South African and African American study samples. Apart from confirming evidence of association at TXNRD2 (rs16984299; OR[T] 1.20; P = 0.003), we found that a genetic risk score combining the effects of the 15 previously reported POAG loci was significantly associated with POAG in our samples (OR 1.56; 95% CI 1.26-1.93; P = 4.79 × 10-5). By genome-wide association testing we identified a novel candidate locus, rs141186647, harboring EXOC4 (OR[A] 0.48; P = 3.75 × 10-8), a gene transcribing a component of the exocyst complex involved in vesicle transport. The low frequency and high degree of genetic heterogeneity at this region hampered validation of this finding in predominantly West-African replication sets. Our results suggest that established genetic risk factors play a role in African POAG, however, they do not explain the higher disease load. The high heterogeneity within Africans remains a challenge to identify the genetic commonalities for POAG in this ethnicity, and demands studies of extremely large size.


Assuntos
População Negra/genética , Loci Gênicos , Estudo de Associação Genômica Ampla , Glaucoma de Ângulo Aberto/genética , Tiorredoxina Redutase 2/genética , Proteínas de Transporte Vesicular/genética , Idoso , Idoso de 80 Anos ou mais , Feminino , Humanos , Masculino , Pessoa de Meia-Idade
12.
Pac Symp Biocomput ; 28: 121-132, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36540970

RESUMO

Groups of distantly related individuals who share a short segment of their genome identical-by-descent (IBD) can provide insights about rare traits and diseases in massive biobanks using IBD mapping. Clustering algorithms play an important role in finding these groups accurately and at scale. We set out to analyze the fitness of commonly used, fast and scalable clustering algorithms for IBD mapping applications. We designed a realistic benchmark for local IBD graphs and utilized it to compare the statistical power of clustering algorithms via simulating 2.3 million clusters across 850 experiments. We found Infomap and Markov Clustering (MCL) community detection methods to have high statistical power in most of the scenarios. They yield a 30% increase in power compared to the current state-of-art approach, with a 3 orders of magnitude lower runtime. We also found that standard clustering metrics, such as modularity, cannot predict statistical power of algorithms in IBD mapping applications. We extend our findings to real datasets by analyzing the Population Architecture using Genomics and Epidemiology (PAGE) Study dataset with 51,000 samples and 2 million shared segments on Chromosome 1, resulting in the extraction of 39 million local IBD clusters. We demonstrate the power of our approach by recovering signals of rare genetic variation in the Whole-Exome Sequence data of 200,000 individuals in the UK Biobank. We provide an efficient implementation to enable clustering at scale for IBD mapping for various populations and scenarios.Supplementary Information: The code, along with supplementary methods and figures are available at https://github.com/roohy/localIBDClustering.


Assuntos
Algoritmos , Biologia Computacional , Humanos , Genômica , Análise por Conglomerados
13.
Front Genet ; 14: 1181167, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37600667

RESUMO

Peripheral artery disease (PAD) is a form of atherosclerotic cardiovascular disease, affecting ∼8 million Americans, and is known to have racial and ethnic disparities. PAD has been reported to have a significantly higher prevalence in African Americans (AAs) compared to non-Hispanic European Americans (EAs). Hispanic/Latinos (HLs) have been reported to have lower or similar rates of PAD compared to EAs, despite having a paradoxically high burden of PAD risk factors; however, recent work suggests prevalence may differ between sub-groups. Here, we examined a large cohort of diverse adults in the BioMe biobank in New York City. We observed the prevalence of PAD at 1.7% in EAs vs. 8.5% and 9.4% in AAs and HLs, respectively, and among HL sub-groups, the prevalence was found at 11.4% and 11.5% in Puerto Rican and Dominican populations, respectively. Follow-up analysis that adjusted for common risk factors demonstrated that Dominicans had the highest increased risk for PAD relative to EAs [OR = 3.15 (95% CI 2.33-4.25), p < 6.44 × 10-14]. To investigate whether genetic factors may explain this increased risk, we performed admixture mapping by testing the association between local ancestry and PAD in Dominican BioMe participants (N = 1,813) separately from European, African, and Native American (NAT) continental ancestry tracts. The top association with PAD was an NAT ancestry tract at chromosome 2q35 [OR = 1.96 (SE = 0.16), p < 2.75 × 10-05) with 22.6% vs. 12.9% PAD prevalence in heterozygous NAT tract carriers versus non-carriers, respectively. Fine-mapping at this locus implicated tag SNP rs78529201 located within a long intergenic non-coding RNA (lincRNA) LINC00607, a gene expression regulator of key genes related to thrombosis and extracellular remodeling of endothelial cells, suggesting a putative link of the 2q35 locus to PAD etiology. Efforts to reproduce the signal in other Hispanic cohorts were unsuccessful. In summary, we showed how leveraging health system data helped understand nuances of PAD risk across HL sub-groups and admixture mapping approaches elucidated a putative risk locus in a Dominican population.

14.
medRxiv ; 2023 Mar 29.
Artigo em Inglês | MEDLINE | ID: mdl-37034679

RESUMO

Peripheral artery disease (PAD) is a form of atherosclerotic cardiovascular disease, affecting ∼8 million Americans, and is known to have racial and ethnic disparities. PAD has been reported to have significantly higher prevalence in African Americans (AAs) compared to non-Hispanic European Americans (EAs). Hispanic/Latinos (HLs) have been reported to have lower or similar rates of PAD compared to EAs, despite having a paradoxically high burden of PAD risk factors, however recent work suggests prevalence may differ between sub-groups. Here we examined a large cohort of diverse adults in the Bio Me biobank in New York City (NYC). We observed the prevalence of PAD at 1.7% in EAs vs 8.5% and 9.4% in AAs and HLs, respectively; and among HL sub-groups, at 11.4% and 11.5% in Puerto Rican and Dominican populations, respectively. Follow-up analysis that adjusted for common risk factors demonstrated that Dominicans had the highest increased risk for PAD relative to EAs (OR=3.15 (95% CI 2.33-4.25), P <6.44×10 -14 ). To investigate whether genetic factors may explain this increased risk, we performed admixture mapping by testing the association between local ancestry (LA) and PAD in Dominican Bio Me participants (N=1,940) separately for European (EUR), African (AFR) and Native American (NAT) continental ancestry tracts. We identified a NAT ancestry tract at chromosome 2q35 that was significantly associated with PAD (OR=2.05 (95% CI 1.51-2.78), P <4.06×10 -6 ) with 22.5% vs 12.5% PAD prevalence in heterozygous NAT tract carriers versus non-carriers, respectively. Fine-mapping at this locus implicated tag SNP rs78529201 located within a long intergenic non-coding RNA (lincRNA) LINC00607 , a gene expression regulator of key genes related to thrombosis and extracellular remodeling of endothelial cells, suggesting a putative link of the 2q35 locus to PAD etiology. In summary, we showed how leveraging health systems data helped understand nuances of PAD risk across HL sub-groups and admixture mapping approaches elucidated a novel risk locus in a Dominican population.

15.
Nat Genet ; 55(4): 549-558, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-36941441

RESUMO

Individuals of admixed ancestries (for example, African Americans) inherit a mosaic of ancestry segments (local ancestry) originating from multiple continental ancestral populations. This offers the unique opportunity of investigating the similarity of genetic effects on traits across ancestries within the same population. Here we introduce an approach to estimate correlation of causal genetic effects (radmix) across local ancestries and analyze 38 complex traits in African-European admixed individuals (N = 53,001) to observe very high correlations (meta-analysis radmix = 0.95, 95% credible interval 0.93-0.97), much higher than correlation of causal effects across continental ancestries. We replicate our results using regression-based methods from marginal genome-wide association study summary statistics. We also report realistic scenarios where regression-based methods yield inflated heterogeneity-by-ancestry due to ancestry-specific tagging of causal effects, and/or polygenicity. Our results motivate genetic analyses that assume minimal heterogeneity in causal effects by ancestry, with implications for the inclusion of ancestry-diverse individuals in studies.


Assuntos
Genética Populacional , Herança Multifatorial , Humanos , Herança Multifatorial/genética , Estudo de Associação Genômica Ampla/métodos , Grupos Raciais/genética , Negro ou Afro-Americano/genética , Polimorfismo de Nucleotídeo Único/genética
16.
Fam Cancer ; 21(2): 235-239, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-33886068

RESUMO

CDH1 pathogenic variants confer a markedly elevated lifetime risk of developing diffuse gastric cancer (DGC) and lobular breast cancer (LBC). The aim of this study was to evaluate the prevalence and clinical impact of CDH1 pathogenic variants in the unselected and ancestrally diverse BioMe Biobank. We evaluated exome sequence data from 30,223 adult BioMe participants to identify CDH1 positive individuals, defined as those harboring a variant previously classified as pathogenic or likely pathogenic or a predicted loss-of-function variant in CDH1. We reviewed electronic health records and BioMe enrollment surveys for personal and family history of malignancy and evidence of prior clinical genetic testing. Using a genomics-first approach, we identified 6 CDH1 positive individuals in BioMe (~ 1 in 5000). CDH1 positive individuals had a median age of 42 years (range 35-62 years), all were non-European by self-report, and one was female. None had evidence of either a personal or family history of DGC or LBC. Our findings suggest a low risk of DGC and LBC in unselected patients harboring a pathogenic variant in CDH1. Knowledge of CDH1-related cancer risk in individuals with no personal or family history may better inform surveillance and prophylactic measures.


Assuntos
Antígenos CD , Caderinas , Mutação em Linhagem Germinativa , Neoplasias Gástricas , Adulto , Antígenos CD/genética , Caderinas/genética , Feminino , Predisposição Genética para Doença , Humanos , Masculino , Pessoa de Meia-Idade , Risco , Neoplasias Gástricas/epidemiologia , Neoplasias Gástricas/genética , Neoplasias Gástricas/patologia , Sequenciamento do Exoma
17.
Front Genet ; 12: 722602, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34567074

RESUMO

Identity-by-descent (IBD), the detection of shared segments inherited from a common ancestor, is a fundamental concept in genomics with broad applications in the characterization and analysis of genomes. While historically the concept of IBD was extensively utilized through linkage analyses and in studies of founder populations, applications of IBD-based methods subsided during the genome-wide association study era. This was primarily due to the computational expense of IBD detection, which becomes increasingly relevant as the field moves toward the analysis of biobank-scale datasets that encompass individuals from highly diverse backgrounds. To address these computational barriers, the past several years have seen new methodological advances enabling IBD detection for datasets in the hundreds of thousands to millions of individuals, enabling novel analyses at an unprecedented scale. Here, we describe the latest innovations in IBD detection and describe opportunities for the application of IBD-based methods across a broad range of questions in the field of genomics.

18.
Nat Commun ; 12(1): 3546, 2021 06 10.
Artigo em Inglês | MEDLINE | ID: mdl-34112768

RESUMO

The ability to identify segments of genomes identical-by-descent (IBD) is a part of standard workflows in both statistical and population genetics. However, traditional methods for finding local IBD across all pairs of individuals scale poorly leading to a lack of adoption in very large-scale datasets. Here, we present iLASH, an algorithm based on similarity detection techniques that shows equal or improved accuracy in simulations compared to current leading methods and speeds up analysis by several orders of magnitude on genomic datasets, making IBD estimation tractable for millions of individuals. We apply iLASH to the PAGE dataset of ~52,000 multi-ethnic participants, including several founder populations with elevated IBD sharing, identifying IBD segments in ~3 minutes per chromosome compared to over 6 days for a state-of-the-art algorithm. iLASH enables efficient analysis of very large-scale datasets, as we demonstrate by computing IBD across the UK Biobank (~500,000 individuals), detecting 12.9 billion pairwise connections.


Assuntos
Genética Populacional/métodos , Genômica/métodos , Algoritmos , Simulação por Computador , Bases de Dados Genéticas , Genoma Humano , Haplótipos , Humanos , Linhagem , Polimorfismo de Nucleotídeo Único , Controle de Qualidade , Reino Unido/epidemiologia , Reino Unido/etnologia
19.
Genome Med ; 13(1): 17, 2021 02 05.
Artigo em Inglês | MEDLINE | ID: mdl-33546753

RESUMO

BACKGROUND: Population-based genomic screening has the predicted ability to reduce morbidity and mortality associated with medically actionable conditions. However, much research is needed to develop standards for genomic screening and to understand the perspectives of people offered this new testing modality. This is particularly true for non-European ancestry populations who are vastly underrepresented in genomic medicine research. Therefore, we implemented a pilot genomic screening program in the BioMe Biobank in New York City, where the majority of participants are of non-European ancestry. METHODS: We initiated genomic screening for well-established genes associated with hereditary breast and ovarian cancer syndrome (HBOC), Lynch syndrome (LS), and familial hypercholesterolemia (FH). We evaluated and included an additional gene (TTR) associated with hereditary transthyretin amyloidosis (hATTR), which has a common founder variant in African ancestry populations. We evaluated the characteristics of 74 participants who received results associated with these conditions. We also assessed the preferences of 7461 newly enrolled BioMe participants to receive genomic results. RESULTS: In the pilot genomic screening program, 74 consented participants received results related to HBOC (N = 26), LS (N = 6), FH (N = 8), and hATTR (N = 34). Thirty-three of 34 (97.1%) participants who received a result related to hATTR were self-reported African American/African (AA) or Hispanic/Latinx (HL), compared to 14 of 40 (35.0%) participants who received a result related to HBOC, LS, or FH. Among the 7461 participants enrolled after the BioMe protocol modification to allow the return of genomic results, 93.4% indicated that they would want to receive results. Younger participants, women, and HL participants were more likely to opt to receive results. CONCLUSIONS: The addition of TTR to a pilot genomic screening program meant that we returned results to a higher proportion of AA and HL participants, in comparison with genes traditionally included in genomic screening programs in the USA. We found that the majority of participants in a multi-ethnic biobank are interested in receiving genomic results for medically actionable conditions. These findings increase knowledge about the perspectives of diverse research participants on receiving genomic results and inform the broader implementation of genomic medicine in underrepresented patient populations.


Assuntos
Testes Genéticos , Genética Populacional , Adolescente , Adulto , Fatores Etários , Idoso , Idoso de 80 Anos ou mais , Etnicidade/genética , Feminino , Medicina Genômica , Humanos , Masculino , Pessoa de Meia-Idade , Projetos Piloto , Inquéritos e Questionários , Adulto Jovem
20.
J Child Neurol ; 36(2): 93-98, 2021 02.
Artigo em Inglês | MEDLINE | ID: mdl-32928027

RESUMO

OBJECTIVE: To describe a founder mutation effect and the clinical phenotype of homozygous FRRS1L c.737_739delGAG (p.Gly246del) variant in 15 children of Puerto Rican (Boricua) ancestry presenting with early infantile epileptic encephalopathy (EIEE-37) with prominent movement disorder. BACKGROUND: EIEE-37 is caused by biallelic loss of function variants in the FRRS1L gene, which is critical for AMPA-receptor function, resulting in intractable epilepsy and dyskinesia. METHODS: A retrospective, multicenter chart review of patients sharing the same homozygous FRRS1L (p.Gly246del) pathogenic variant identified by clinical genetic testing. Clinical information was collected regarding neurodevelopmental outcomes, neuroimaging, electrographic features and clinical response to antiseizure medications. RESULTS: Fifteen patients from 12 different families of Puerto Rican ancestry were homozygous for the FRRS1L (p.Gly246del) pathogenic variant, with ages ranging from 1 to 25 years. The onset of seizures was from 6 to 24 months. All had hypotonia, severe global developmental delay, and most had hyperkinetic involuntary movements. Developmental regression during the first year of life was common (86%). Electroencephalogram showed hypsarrhythmia in 66% (10/15), with many older children evolving into Lennox-Gastaut syndrome. Six patients demonstrated progressive volume loss and/or cerebellar atrophy on brain magnetic resonance imaging (MRI). CONCLUSIONS: We describe the largest cohort to date of patients with epileptic encephalopathy. We estimate that 0.76% of unaffected individuals of Puerto Rican ancestry carry this pathogenic variant due to a founder effect. Children homozygous for the FRRS1L (p.Gly246del) Boricua variant exhibit a very homogenous phenotype of early developmental regression and epilepsy, starting with infantile spasms and evolving into Lennox-Gastaut syndrome with hyperkinetic movement disorder.


Assuntos
Hispânico ou Latino/genética , Síndrome de Lennox-Gastaut/genética , Proteínas de Membrana/genética , Mutação/genética , Proteínas do Tecido Nervoso/genética , Espasmos Infantis/genética , Adolescente , Adulto , Criança , Pré-Escolar , Estudos de Coortes , Eletroencefalografia , Feminino , Hispânico ou Latino/estatística & dados numéricos , Humanos , Lactente , Masculino , Porto Rico , Estudos Retrospectivos , Espasmos Infantis/fisiopatologia , Adulto Jovem
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA