Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 140
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Cell ; 184(8): 2068-2083.e11, 2021 04 15.
Artículo en Inglés | MEDLINE | ID: mdl-33861964

RESUMEN

Understanding population health disparities is an essential component of equitable precision health efforts. Epidemiology research often relies on definitions of race and ethnicity, but these population labels may not adequately capture disease burdens and environmental factors impacting specific sub-populations. Here, we propose a framework for repurposing data from electronic health records (EHRs) in concert with genomic data to explore the demographic ties that can impact disease burdens. Using data from a diverse biobank in New York City, we identified 17 communities sharing recent genetic ancestry. We observed 1,177 health outcomes that were statistically associated with a specific group and demonstrated significant differences in the segregation of genetic variants contributing to Mendelian diseases. We also demonstrated that fine-scale population structure can impact the prediction of complex disease risk within groups. This work reinforces the utility of linking genomic data to EHRs and provides a framework toward fine-scale monitoring of population health.


Asunto(s)
Etnicidad/genética , Salud Poblacional , Bases de Datos Genéticas , Registros Electrónicos de Salud , Genómica , Humanos , Autoinforme
2.
Cell ; 177(1): 58-69, 2019 03 21.
Artículo en Inglés | MEDLINE | ID: mdl-30901549

RESUMEN

Personalized medicine has largely been enabled by the integration of genomic and other data with electronic health records (EHRs) in the United States and elsewhere. Increased EHR adoption across various clinical settings and the establishment of EHR-linked population-based biobanks provide unprecedented opportunities for the types of translational and implementation research that drive personalized medicine. We review advances in the digitization of health information and the proliferation of genomic research in health systems and provide insights into emerging paths for the widespread implementation of personalized medicine.


Asunto(s)
Registros Electrónicos de Salud/tendencias , Medicina de Precisión/métodos , Medicina de Precisión/tendencias , Pruebas Genéticas , Genoma Humano/genética , Genómica/métodos , Genómica/tendencias , Humanos , Estados Unidos
3.
Cell ; 173(7): 1692-1704.e11, 2018 06 14.
Artículo en Inglés | MEDLINE | ID: mdl-29779949

RESUMEN

Heritability is essential for understanding the biological causes of disease but requires laborious patient recruitment and phenotype ascertainment. Electronic health records (EHRs) passively capture a wide range of clinically relevant data and provide a resource for studying the heritability of traits that are not typically accessible. EHRs contain next-of-kin information collected via patient emergency contact forms, but until now, these data have gone unused in research. We mined emergency contact data at three academic medical centers and identified 7.4 million familial relationships while maintaining patient privacy. Identified relationships were consistent with genetically derived relatedness. We used EHR data to compute heritability estimates for 500 disease phenotypes. Overall, estimates were consistent with the literature and between sites. Inconsistencies were indicative of limitations and opportunities unique to EHR research. These analyses provide a validation of the use of EHRs for genetics and disease research.


Asunto(s)
Registros Electrónicos de Salud , Enfermedades Genéticas Congénitas/genética , Algoritmos , Bases de Datos Factuales , Relaciones Familiares , Enfermedades Genéticas Congénitas/patología , Genotipo , Humanos , Linaje , Fenotipo , Carácter Cuantitativo Heredable
4.
Nat Rev Genet ; 25(1): 8-25, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-37620596

RESUMEN

Polygenic risk scores (PRSs) summarize the genetic predisposition of a complex human trait or disease and may become a valuable tool for advancing precision medicine. However, PRSs that are developed in populations of predominantly European genetic ancestries can increase health disparities due to poor predictive performance in individuals of diverse and complex genetic ancestries. We describe genetic and modifiable risk factors that limit the transferability of PRSs across populations and review the strengths and weaknesses of existing PRS construction methods for diverse ancestries. Developing PRSs that benefit global populations in research and clinical settings provides an opportunity for innovation and is essential for health equity.


Asunto(s)
Predisposición Genética a la Enfermedad , Humanos , Factores de Riesgo , Herencia Multifactorial , Medicina de Precisión , Estudio de Asociación del Genoma Completo
5.
Nature ; 616(7958): 755-763, 2023 04.
Artículo en Inglés | MEDLINE | ID: mdl-37046083

RESUMEN

Mutations in a diverse set of driver genes increase the fitness of haematopoietic stem cells (HSCs), leading to clonal haematopoiesis1. These lesions are precursors for blood cancers2-6, but the basis of their fitness advantage remains largely unknown, partly owing to a paucity of large cohorts in which the clonal expansion rate has been assessed by longitudinal sampling. Here, to circumvent this limitation, we developed a method to infer the expansion rate from data from a single time point. We applied this method to 5,071 people with clonal haematopoiesis. A genome-wide association study revealed that a common inherited polymorphism in the TCL1A promoter was associated with a slower expansion rate in clonal haematopoiesis overall, but the effect varied by driver gene. Those carrying this protective allele exhibited markedly reduced growth rates or prevalence of clones with driver mutations in TET2, ASXL1, SF3B1 and SRSF2, but this effect was not seen in clones with driver mutations in DNMT3A. TCL1A was not expressed in normal or DNMT3A-mutated HSCs, but the introduction of mutations in TET2 or ASXL1 led to the expression of TCL1A protein and the expansion of HSCs in vitro. The protective allele restricted TCL1A expression and expansion of mutant HSCs, as did experimental knockdown of TCL1A expression. Forced expression of TCL1A promoted the expansion of human HSCs in vitro and mouse HSCs in vivo. Our results indicate that the fitness advantage of several commonly mutated driver genes in clonal haematopoiesis may be mediated by TCL1A activation.


Asunto(s)
Hematopoyesis Clonal , Células Madre Hematopoyéticas , Animales , Humanos , Ratones , Alelos , Hematopoyesis Clonal/genética , Estudio de Asociación del Genoma Completo , Hematopoyesis/genética , Células Madre Hematopoyéticas/citología , Células Madre Hematopoyéticas/metabolismo , Mutación , Regiones Promotoras Genéticas
6.
Nature ; 604(7906): 437-446, 2022 04.
Artículo en Inglés | MEDLINE | ID: mdl-35444317

RESUMEN

The human reference genome is the most widely used resource in human genetics and is due for a major update. Its current structure is a linear composite of merged haplotypes from more than 20 people, with a single individual comprising most of the sequence. It contains biases and errors within a framework that does not represent global human genomic variation. A high-quality reference with global representation of common variants, including single-nucleotide variants, structural variants and functional elements, is needed. The Human Pangenome Reference Consortium aims to create a more sophisticated and complete human reference genome with a graph-based, telomere-to-telomere representation of global genomic diversity. Here we leverage innovations in technology, study design and global partnerships with the goal of constructing the highest-possible quality human pangenome reference. Our goal is to improve data representation and streamline analyses to enable routine assembly of complete diploid genomes. With attention to ethical frameworks, the human pangenome reference will contain a more accurate and diverse representation of global genomic variation, improve gene-disease association studies across populations, expand the scope of genomics research to the most repetitive and polymorphic regions of the genome, and serve as the ultimate genetic resource for future biomedical research and precision medicine.


Asunto(s)
Genoma Humano , Genómica , Genoma Humano/genética , Haplotipos/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Análisis de Secuencia de ADN
7.
Annu Rev Genomics Hum Genet ; 25(1): 77-104, 2024 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-38663087

RESUMEN

The Human Genome Project was an enormous accomplishment, providing a foundation for countless explorations into the genetics and genomics of the human species. Yet for many years, the human genome reference sequence remained incomplete and lacked representation of human genetic diversity. Recently, two major advances have emerged to address these shortcomings: complete gap-free human genome sequences, such as the one developed by the Telomere-to-Telomere Consortium, and high-quality pangenomes, such as the one developed by the Human Pangenome Reference Consortium. Facilitated by advances in long-read DNA sequencing and genome assembly algorithms, complete human genome sequences resolve regions that have been historically difficult to sequence, including centromeres, telomeres, and segmental duplications. In parallel, pangenomes capture the extensive genetic diversity across populations worldwide. Together, these advances usher in a new era of genomics research, enhancing the accuracy of genomic analysis, paving the path for precision medicine, and contributing to deeper insights into human biology.


Asunto(s)
Genoma Humano , Proyecto Genoma Humano , Humanos , Variación Genética , Genómica/métodos , Análisis de Secuencia de ADN/métodos , Telómero/genética
8.
Am J Hum Genet ; 111(5): 990-995, 2024 05 02.
Artículo en Inglés | MEDLINE | ID: mdl-38636510

RESUMEN

Since genotype imputation was introduced, researchers have been relying on the estimated imputation quality from imputation software to perform post-imputation quality control (QC). However, this quality estimate (denoted as Rsq) performs less well for lower-frequency variants. We recently published MagicalRsq, a machine-learning-based imputation quality calibration, which leverages additional typed markers from the same cohort and outperforms Rsq as a QC metric. In this work, we extended the original MagicalRsq to allow cross-cohort model training and named the new model MagicalRsq-X. We removed the cohort-specific estimated minor allele frequency and included linkage disequilibrium scores and recombination rates as additional features. Leveraging whole-genome sequencing data from TOPMed, specifically participants in the BioMe, JHS, WHI, and MESA studies, we performed comprehensive cross-cohort evaluations for predominantly European and African ancestral individuals based on their inferred global ancestry with the 1000 Genomes and Human Genome Diversity Project data as reference. Our results suggest MagicalRsq-X outperforms Rsq in almost every setting, with 7.3%-14.4% improvement in squared Pearson correlation with true R2, corresponding to 85-218 K variant gains. We further developed a metric to quantify the genetic distances of a target cohort relative to a reference cohort and showed that such metric largely explained the performance of MagicalRsq-X models. Finally, we found MagicalRsq-X saved up to 53 known genome-wide significant variants in one of the largest blood cell trait GWASs that would be missed using the original Rsq for QC. In conclusion, MagicalRsq-X shows superiority for post-imputation QC and benefits genetic studies by distinguishing well and poorly imputed lower-frequency variants.


Asunto(s)
Frecuencia de los Genes , Genotipo , Polimorfismo de Nucleótido Simple , Programas Informáticos , Humanos , Estudios de Cohortes , Desequilibrio de Ligamiento , Estudio de Asociación del Genoma Completo/métodos , Genoma Humano , Control de Calidad , Aprendizaje Automático , Secuenciación Completa del Genoma/normas , Secuenciación Completa del Genoma/métodos
9.
Am J Hum Genet ; 111(6): 999-1005, 2024 06 06.
Artículo en Inglés | MEDLINE | ID: mdl-38688278

RESUMEN

The differential performance of polygenic risk scores (PRSs) by group is one of the major ethical barriers to their clinical use. It is also one of the main practical challenges for any implementation effort. The social repercussions of how people are grouped in PRS research must be considered in communications with research participants, including return of results. Here, we outline the decisions faced and choices made by a large multi-site clinical implementation study returning PRSs to diverse participants in handling this issue of differential performance. Our approach to managing the complexities associated with the differential performance of PRSs serves as a case study that can help future implementers of PRSs to plot an anticipatory course in response to this issue.


Asunto(s)
Predisposición Genética a la Enfermedad , Herencia Multifactorial , Humanos , Herencia Multifactorial/genética , Factores de Riesgo , Estudio de Asociación del Genoma Completo , Medición de Riesgo , Pruebas Genéticas/métodos , Puntuación de Riesgo Genético
10.
Am J Hum Genet ; 110(11): 1853-1862, 2023 11 02.
Artículo en Inglés | MEDLINE | ID: mdl-37875120

RESUMEN

The heritability explained by local ancestry markers in an admixed population (hγ2) provides crucial insight into the genetic architecture of a complex disease or trait. Estimation of hγ2 can be susceptible to biases due to population structure in ancestral populations. Here, we present heritability estimation from admixture mapping summary statistics (HAMSTA), an approach that uses summary statistics from admixture mapping to infer heritability explained by local ancestry while adjusting for biases due to ancestral stratification. Through extensive simulations, we demonstrate that HAMSTA hγ2 estimates are approximately unbiased and are robust to ancestral stratification compared to existing approaches. In the presence of ancestral stratification, we show a HAMSTA-derived sampling scheme provides a calibrated family-wise error rate (FWER) of ∼5% for admixture mapping, unlike existing FWER estimation approaches. We apply HAMSTA to 20 quantitative phenotypes of up to 15,988 self-reported African American individuals in the Population Architecture using Genomics and Epidemiology (PAGE) study. We observe hˆγ2 in the 20 phenotypes range from 0.0025 to 0.033 (mean hˆγ2 = 0.012 ± 9.2 × 10-4), which translates to hˆ2 ranging from 0.062 to 0.85 (mean hˆ2 = 0.30 ± 0.023). Across these phenotypes we find little evidence of inflation due to ancestral population stratification in current admixture mapping studies (mean inflation factor of 0.99 ± 0.001). Overall, HAMSTA provides a fast and powerful approach to estimate genome-wide heritability and evaluate biases in test statistics of admixture mapping studies.


Asunto(s)
Negro o Afroamericano , Genética de Población , Humanos , Mapeo Cromosómico , Fenotipo , Polimorfismo de Nucleótido Simple/genética
11.
Am J Hum Genet ; 110(12): 2029-2041, 2023 Dec 07.
Artículo en Inglés | MEDLINE | ID: mdl-38006881

RESUMEN

Digital solutions are needed to support rapid increases in the application of genetic/genomic tests (GTs) in diverse clinical settings and patient populations. We developed GUÍA, a bilingual digital application that facilitates disclosure of GT results. The NYCKidSeq randomized controlled trial enrolled diverse children with neurologic, cardiac, and immunologic conditions who underwent GTs. The trial evaluated GUÍA's impact on understanding the GT results by randomizing families to results disclosure genetic counseling with GUÍA (intervention) or standard of care (SOC). Parents/legal guardians (participants) completed surveys at baseline, post-results disclosure, and 6 months later. Survey measures assessed the primary study outcomes of participants' perceived understanding of and confidence in explaining their child's GT results and the secondary outcome of objective understanding. The analysis included 551 diverse participants, 270 in the GUÍA arm and 281 in SOC. Participants in the GUÍA arm had significantly higher perceived understanding post-results (OR = 2.8, CI[1.004, 7.617], p = 0.049) and maintained higher objective understanding over time (OR = 1.1, CI[1.004, 1.127], p = 0.038) compared to SOC. There was no impact on perceived confidence. Hispanic/Latino(a) individuals in the GUÍA arm maintained higher perceived understanding (OR = 3.9, CI[1.603, 9.254], p = 0.003), confidence (OR = 2.7, CI[1.021, 7.277], p = 0.046), and objective understanding (OR = 1.1, CI[1.009, 1.212], p = 0.032) compared to SOC. This trial demonstrates that GUÍA positively impacts understanding of GT results in diverse parents of children with suspected genetic conditions and builds a case for utilizing GUÍA to deliver complex results. Continued development and evaluation of digital applications in diverse populations are critical for equitably scaling GT offerings in specialty clinics.


Asunto(s)
Revelación , Asesoramiento Genético , Niño , Humanos , Pruebas Genéticas , Padres , Genómica
12.
Nature ; 577(7789): 179-189, 2020 01.
Artículo en Inglés | MEDLINE | ID: mdl-31915397

RESUMEN

A primary goal of human genetics is to identify DNA sequence variants that influence biomedical traits, particularly those related to the onset and progression of human disease. Over the past 25 years, progress in realizing this objective has been transformed by advances in technology, foundational genomic resources and analytical tools, and by access to vast amounts of genotype and phenotype data. Genetic discoveries have substantially improved our understanding of the mechanisms responsible for many rare and common diseases and driven development of novel preventative and therapeutic strategies. Medical innovation will increasingly focus on delivering care tailored to individual patterns of genetic predisposition.


Asunto(s)
Variación Genética , Animales , Pruebas Genéticas , Genómica , Genotipo , Humanos , Fenotipo , Enfermedades Raras/genética
13.
Nature ; 582(7811): 234-239, 2020 06.
Artículo en Inglés | MEDLINE | ID: mdl-32499652

RESUMEN

On average, Peruvian individuals are among the shortest in the world1. Here we show that Native American ancestry is associated with reduced height in an ethnically diverse group of Peruvian individuals, and identify a population-specific, missense variant in the FBN1 gene (E1297G) that is significantly associated with lower height. Each copy of the minor allele (frequency of 4.7%) reduces height by 2.2 cm (4.4 cm in homozygous individuals). To our knowledge, this is the largest effect size known for a common height-associated variant. FBN1 encodes the extracellular matrix protein fibrillin 1, which is a major structural component of microfibrils. We observed less densely packed fibrillin-1-rich microfibrils with irregular edges in the skin of individuals who were homozygous for G1297 compared with individuals who were homozygous for E1297. Moreover, we show that the E1297G locus is under positive selection in non-African populations, and that the E1297 variant shows subtle evidence of positive selection specifically within the Peruvian population. This variant is also significantly more frequent in coastal Peruvian populations than in populations from the Andes or the Amazon, which suggests that short stature might be the result of adaptation to factors that are associated with the coastal environment in Peru.


Asunto(s)
Estatura/genética , Fibrilina-1/genética , Mutación Missense , Selección Genética , Femenino , Frecuencia de los Genes , Estudio de Asociación del Genoma Completo , Herencia , Humanos , Indígenas Sudamericanos/genética , Masculino , Microfibrillas/química , Microfibrillas/genética , Perú
14.
Am J Hum Genet ; 109(5): 871-884, 2022 05 05.
Artículo en Inglés | MEDLINE | ID: mdl-35349783

RESUMEN

Since 2005, genome-wide association (GWA) datasets have been largely biased toward sampling European ancestry individuals, and recent studies have shown that GWA results estimated from self-identified European individuals are not transferable to non-European individuals because of various confounding challenges. Here, we demonstrate that enrichment analyses that aggregate SNP-level association statistics at multiple genomic scales-from genes to genomic regions and pathways-have been underutilized in the GWA era and can generate biologically interpretable hypotheses regarding the genetic basis of complex trait architecture. We illustrate examples of the robust associations generated by enrichment analyses while studying 25 continuous traits assayed in 566,786 individuals from seven diverse self-identified human ancestries in the UK Biobank and the Biobank Japan as well as 44,348 admixed individuals from the PAGE consortium including cohorts of African American, Hispanic and Latin American, Native Hawaiian, and American Indian/Alaska Native individuals. We identify 1,000 gene-level associations that are genome-wide significant in at least two ancestry cohorts across these 25 traits as well as highly conserved pathway associations with triglyceride levels in European, East Asian, and Native Hawaiian cohorts.


Asunto(s)
Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Estudio de Asociación del Genoma Completo/métodos , Humanos , Herencia Multifactorial , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Grupos Raciales
15.
Am J Hum Genet ; 109(9): 1653-1666, 2022 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-35981533

RESUMEN

Understanding the genetic basis of human diseases and traits is dependent on the identification and accurate genotyping of genetic variants. Deep whole-genome sequencing (WGS), the gold standard technology for SNP and indel identification and genotyping, remains very expensive for most large studies. Here, we quantify the extent to which array genotyping followed by genotype imputation can approximate WGS in studies of individuals of African, Hispanic/Latino, and European ancestry in the US and of Finnish ancestry in Finland (a population isolate). For each study, we performed genotype imputation by using the genetic variants present on the Illumina Core, OmniExpress, MEGA, and Omni 2.5M arrays with the 1000G, HRC, and TOPMed imputation reference panels. Using the Omni 2.5M array and the TOPMed panel, ≥90% of bi-allelic single-nucleotide variants (SNVs) are well imputed (r2 > 0.8) down to minor-allele frequencies (MAFs) of 0.14% in African, 0.11% in Hispanic/Latino, 0.35% in European, and 0.85% in Finnish ancestries. There was little difference in TOPMed-based imputation quality among the arrays with >700k variants. Individual-level imputation quality varied widely between and within the three US studies. Imputation quality also varied across genomic regions, producing regions where even common (MAF > 5%) variants were consistently not well imputed across ancestries. The extent to which array genotyping and imputation can approximate WGS therefore depends on reference panel, genotype array, sample ancestry, and genomic location. Imputation quality by variant or genomic region can be queried with our new tool, RsqBrowser, now deployed on the Michigan Imputation Server.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento , Polimorfismo de Nucleótido Simple , Frecuencia de los Genes/genética , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Polimorfismo de Nucleótido Simple/genética , Secuenciación Completa del Genoma
16.
Am J Hum Genet ; 109(6): 1175-1181, 2022 06 02.
Artículo en Inglés | MEDLINE | ID: mdl-35504290

RESUMEN

Current publicly available tools that allow rapid exploration of linkage disequilibrium (LD) between markers (e.g., HaploReg and LDlink) are based on whole-genome sequence (WGS) data from 2,504 individuals in the 1000 Genomes Project. Here, we present TOP-LD, an online tool to explore LD inferred with high-coverage (∼30×) WGS data from 15,578 individuals in the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. TOP-LD provides a significant upgrade compared to current LD tools, as the TOPMed WGS data provide a more comprehensive representation of genetic variation than the 1000 Genomes data, particularly for rare variants and in the specific populations that we analyzed. For example, TOP-LD encompasses LD information for 150.3, 62.2, and 36.7 million variants for European, African, and East Asian ancestral samples, respectively, offering 2.6- to 9.1-fold increase in variant coverage compared to HaploReg 4.0 or LDlink. In addition, TOP-LD includes tens of thousands of structural variants (SVs). We demonstrate the value of TOP-LD in fine-mapping at the GGT1 locus associated with gamma glutamyltransferase in the African ancestry participants in UK Biobank. Beyond fine-mapping, TOP-LD can facilitate a wide range of applications that are based on summary statistics and estimates of LD. TOP-LD is freely available online.


Asunto(s)
Estudio de Asociación del Genoma Completo , Medicina de Precisión , Pueblo Asiatico , Humanos , Desequilibrio de Ligamiento/genética , Polimorfismo de Nucleótido Simple/genética , Secuenciación Completa del Genoma
17.
Am J Hum Genet ; 109(4): 669-679, 2022 04 07.
Artículo en Inglés | MEDLINE | ID: mdl-35263625

RESUMEN

One mechanism by which genetic factors influence complex traits and diseases is altering gene expression. Direct measurement of gene expression in relevant tissues is rarely tenable; however, genetically regulated gene expression (GReX) can be estimated using prediction models derived from large multi-omic datasets. These approaches have led to the discovery of many gene-trait associations, but whether models derived from predominantly European ancestry (EA) reference panels can map novel associations in ancestrally diverse populations remains unclear. We applied PrediXcan to impute GReX in 51,520 ancestrally diverse Population Architecture using Genomics and Epidemiology (PAGE) participants (35% African American, 45% Hispanic/Latino, 10% Asian, and 7% Hawaiian) across 25 key cardiometabolic traits and relevant tissues to identify 102 novel associations. We then compared associations in PAGE to those in a random subset of 50,000 White British participants from UK Biobank (UKBB50k) for height and body mass index (BMI). We identified 517 associations across 47 tissues in PAGE but not UKBB50k, demonstrating the importance of diverse samples in identifying trait-associated GReX. We observed that variants used in PrediXcan models were either more or less differentiated across continental-level populations than matched-control variants depending on the specific population reflecting sampling bias. Additionally, variants from identified genes specific to either PAGE or UKBB50k analyses were more ancestrally differentiated than those in genes detected in both analyses, underlining the value of population-specific discoveries. This suggests that while EA-derived transcriptome imputation models can identify new associations in non-EA populations, models derived from closely matched reference panels may yield further insights. Our findings call for more diversity in reference datasets of tissue-specific gene expression.


Asunto(s)
Enfermedades Cardiovasculares , Estudio de Asociación del Genoma Completo , Predisposición Genética a la Enfermedad , Humanos , Estilo de Vida , Polimorfismo de Nucleótido Simple , Transcriptoma
18.
Nucleic Acids Res ; 51(D1): D1300-D1311, 2023 01 06.
Artículo en Inglés | MEDLINE | ID: mdl-36350676

RESUMEN

Large biobank-scale whole genome sequencing (WGS) studies are rapidly identifying a multitude of coding and non-coding variants. They provide an unprecedented resource for illuminating the genetic basis of human diseases. Variant functional annotations play a critical role in WGS analysis, result interpretation, and prioritization of disease- or trait-associated causal variants. Existing functional annotation databases have limited scope to perform online queries and functionally annotate the genotype data of large biobank-scale WGS studies. We develop the Functional Annotation of Variants Online Resources (FAVOR) to meet these pressing needs. FAVOR provides a comprehensive multi-faceted variant functional annotation online portal that summarizes and visualizes findings of all possible nine billion single nucleotide variants (SNVs) across the genome. It allows for rapid variant-, gene- and region-level queries of variant functional annotations. FAVOR integrates variant functional information from multiple sources to describe the functional characteristics of variants and facilitates prioritizing plausible causal variants influencing human phenotypes. Furthermore, we provide a scalable annotation tool, FAVORannotator, to functionally annotate large-scale WGS studies and efficiently store the genotype and their variant functional annotation data in a single file using the annotated Genomic Data Structure (aGDS) format, making downstream analysis more convenient. FAVOR and FAVORannotator are available at https://favor.genohub.org.


Asunto(s)
Genoma Humano , Programas Informáticos , Humanos , Anotación de Secuencia Molecular , Genómica , Genotipo , Variación Genética
19.
Am J Hum Genet ; 108(11): 2099-2111, 2021 11 04.
Artículo en Inglés | MEDLINE | ID: mdl-34678161

RESUMEN

The integration of genomic data into health systems offers opportunities to identify genomic factors underlying the continuum of rare and common disease. We applied a population-scale haplotype association approach based on identity-by-descent (IBD) in a large multi-ethnic biobank to a spectrum of disease outcomes derived from electronic health records (EHRs) and uncovered a risk locus for liver disease. We used genome sequencing and in silico approaches to fine-map the signal to a non-coding variant (c.2784-12T>C) in the gene ABCB4. In vitro analysis confirmed the variant disrupted splicing of the ABCB4 pre-mRNA. Four of five homozygotes had evidence of advanced liver disease, and there was a significant association with liver disease among heterozygotes, suggesting the variant is linked to increased risk of liver disease in an allele dose-dependent manner. Population-level screening revealed the variant to be at a carrier rate of 1.95% in Puerto Rican individuals, likely as the result of a Puerto Rican founder effect. This work demonstrates that integrating EHR and genomic data at a population scale can facilitate strategies for understanding the continuum of genomic risk for common diseases, particularly in populations underrepresented in genomic medicine.


Asunto(s)
Atención a la Salud/organización & administración , Predisposición Genética a la Enfermedad , Hepatopatías/genética , Subfamilia B de Transportador de Casetes de Unión a ATP/genética , Registros Electrónicos de Salud , Haplotipos , Heterocigoto , Hispánicos o Latinos/genética , Homocigoto , Humanos , Puerto Rico
20.
Genet Med ; 26(1): 101011, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-37897232

RESUMEN

PURPOSE: To better understand the effects of returning diagnostic sequencing results on clinical actions and economic outcomes for pediatric patients with suspected genetic disorders. METHODS: Longitudinal physician claims data after diagnostic sequencing were obtained for patients aged 0 to 21 years with neurologic, cardiac, and immunologic disorders with suspected genetic etiology. We assessed specialist consultation rates prompted by primary diagnostic results, as well as marginal effects on overall 18-month physician services and costs. RESULTS: We included data on 857 patients (median age: 9.6 years) with a median follow-up of 17.3 months after disclosure of diagnostic sequencing results. The likelihood of having ≥1 recommendation for specialist consultation in 155 patients with positive findings was high (72%) vs 23% in 443 patients with uncertain findings and 21% in 259 patients with negative findings (P < .001). Follow-through consultation occurred in 30%. Increases in 18-month physician services and costs following a positive finding diminished after multivariable adjustment. Also, no significant differences between those with uncertain and negative findings were demonstrated. CONCLUSION: Our study did not provide evidence for significant increases in downstream physician services and costs after returning positive or uncertain diagnostic sequencing findings. More large-scale longitudinal studies are needed to confirm these findings.


Asunto(s)
Revelación , Médicos , Humanos , Niño , Costos y Análisis de Costo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA