Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 78
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Bioinformatics ; 38(23): 5279-5287, 2022 11 30.
Artigo em Inglês | MEDLINE | ID: mdl-36222570

RESUMO

MOTIVATION: Despite the increasing evidence of utility of genomic medicine in clinical practice, systematically integrating genomic medicine information and knowledge into clinical systems with a high-level of consistency, scalability and computability remains challenging. A comprehensive terminology is required for relevant concepts and the associated knowledge model for representing relationships. In this study, we leveraged PharmGKB, a comprehensive pharmacogenomics (PGx) knowledgebase, to formulate a terminology for drug response phenotypes that can represent relationships between genetic variants and treatments. We evaluated coverage of the terminology through manual review of a randomly selected subset of 200 sentences extracted from genetic reports that contained concepts for 'Genes and Gene Products' and 'Treatments'. RESULTS: Results showed that our proposed drug response phenotype terminology could cover 96% of the drug response phenotypes in genetic reports. Among 18 653 sentences that contained both 'Genes and Gene Products' and 'Treatments', 3011 sentences were able to be mapped to a drug response phenotype in our proposed terminology, among which the most discussed drug response phenotypes were response (994), sensitivity (829) and survival (332). In addition, we were able to re-analyze genetic report context incorporating the proposed terminology and enrich our previously proposed PGx knowledge model to reveal relationships between genetic variants and treatments. In conclusion, we proposed a drug response phenotype terminology that enhanced structured knowledge representation of genomic medicine. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Medicina Genômica , Farmacogenética , Farmacogenética/métodos , Bases de Conhecimento , Fenótipo
2.
Genet Med ; 25(4): 100006, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-36621880

RESUMO

PURPOSE: Assessing the risk of common, complex diseases requires consideration of clinical risk factors as well as monogenic and polygenic risks, which in turn may be reflected in family history. Returning risks to individuals and providers may influence preventive care or use of prophylactic therapies for those individuals at high genetic risk. METHODS: To enable integrated genetic risk assessment, the eMERGE (electronic MEdical Records and GEnomics) network is enrolling 25,000 diverse individuals in a prospective cohort study across 10 sites. The network developed methods to return cross-ancestry polygenic risk scores, monogenic risks, family history, and clinical risk assessments via a genome-informed risk assessment (GIRA) report and will assess uptake of care recommendations after return of results. RESULTS: GIRAs include summary care recommendations for 11 conditions, education pages, and clinical laboratory reports. The return of high-risk GIRA to individuals and providers includes guidelines for care and lifestyle recommendations. Assembling the GIRA required infrastructure and workflows for ingesting and presenting content from multiple sources. Recruitment began in February 2022. CONCLUSION: Return of a novel report for communicating monogenic, polygenic, and family history-based risk factors will inform the benefits of integrated genetic risk assessment for routine health care.


Assuntos
Genoma , Genômica , Humanos , Estudos Prospectivos , Genômica/métodos , Fatores de Risco , Medição de Risco
3.
BMC Med Inform Decis Mak ; 22(1): 23, 2022 01 28.
Artigo em Inglês | MEDLINE | ID: mdl-35090449

RESUMO

INTRODUCTION: Currently, one of the commonly used methods for disseminating electronic health record (EHR)-based phenotype algorithms is providing a narrative description of the algorithm logic, often accompanied by flowcharts. A challenge with this mode of dissemination is the potential for under-specification in the algorithm definition, which leads to ambiguity and vagueness. METHODS: This study examines incidents of under-specification that occurred during the implementation of 34 narrative phenotyping algorithms in the electronic Medical Record and Genomics (eMERGE) network. We reviewed the online communication history between algorithm developers and implementers within the Phenotype Knowledge Base (PheKB) platform, where questions could be raised and answered regarding the intended implementation of a phenotype algorithm. RESULTS: We developed a taxonomy of under-specification categories via an iterative review process between two groups of annotators. Under-specifications that lead to ambiguity and vagueness were consistently found across narrative phenotype algorithms developed by all involved eMERGE sites. DISCUSSION AND CONCLUSION: Our findings highlight that under-specification is an impediment to the accuracy and efficiency of the implementation of current narrative phenotyping algorithms, and we propose approaches for mitigating these issues and improved methods for disseminating EHR phenotyping algorithms.


Assuntos
Algoritmos , Registros Eletrônicos de Saúde , Genômica , Humanos , Bases de Conhecimento , Fenótipo
4.
J Biomed Inform ; 118: 103795, 2021 06.
Artigo em Inglês | MEDLINE | ID: mdl-33930535

RESUMO

Structured representation of clinical genetic results is necessary for advancing precision medicine. The Electronic Medical Records and Genomics (eMERGE) Network's Phase III program initially used a commercially developed XML message format for standardized and structured representation of genetic results for electronic health record (EHR) integration. In a desire to move towards a standard representation, the network created a new standardized format based upon Health Level Seven Fast Healthcare Interoperability Resources (HL7® FHIR®), to represent clinical genomics results. These new standards improve the utility of HL7® FHIR® as an international healthcare interoperability standard for management of genetic data from patients. This work advances the establishment of standards that are being designed for broad adoption in the current health information technology landscape.


Assuntos
Registros Eletrônicos de Saúde , Informática Médica , Genômica , Nível Sete de Saúde , Humanos , Medicina de Precisão
5.
Genet Med ; 22(11): 1821-1829, 2020 11.
Artigo em Inglês | MEDLINE | ID: mdl-32669677

RESUMO

PURPOSE: Secondary findings are typically offered in an all or none fashion when sequencing is used for clinical purposes. This study aims to describe the process of offering categorical and granular choices for results in a large research consortium. METHODS: Within the third phase of the electronic MEdical Records and GEnomics (eMERGE) Network, several sites implemented studies that allowed participants to choose the type of results they wanted to receive from a multigene sequencing panel. Sites were surveyed to capture the details of the implementation protocols and results of these choices. RESULTS: Across the ten eMERGE sites, 4664 participants including adolescents and adults were offered some type of choice. Categories of choices offered and methods for selecting categories varied. Most participants (94.5%) chose to learn all genetic results, while 5.5% chose subsets of results. Several sites allowed participants to change their choices at various time points, and 0.5% of participants made changes. CONCLUSION: Offering choices that include learning some results is important and should be a dynamic process to allow for changes in scientific knowledge, participant age group, and individual preference.


Assuntos
Registros Eletrônicos de Saúde , Genoma , Adolescente , Adulto , Genômica , Humanos , Grupos Populacionais , Inquéritos e Questionários
6.
Hum Mutat ; 39(11): 1690-1701, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30311374

RESUMO

Effective exchange of information about genetic variants is currently hampered by the lack of readily available globally unique variant identifiers that would enable aggregation of information from different sources. The ClinGen Allele Registry addresses this problem by providing (1) globally unique "canonical" variant identifiers (CAids) on demand, either individually or in large batches; (2) access to variant-identifying information in a searchable Registry; (3) links to allele-related records in many commonly used databases; and (4) services for adding links to information about registered variants in external sources. A core element of the Registry is a canonicalization service, implemented using in-memory sequence alignment-based index, which groups variant identifiers denoting the same nucleotide variant and assigns unique and dereferenceable CAids. More than 650 million distinct variants are currently registered, including those from gnomAD, ExAC, dbSNP, and ClinVar, including a small number of variants registered by Registry users. The Registry is accessible both via a web interface and programmatically via well-documented Hypertext Transfer Protocol (HTTP) Representational State Transfer Application Programming Interface (REST-APIs). For programmatic interoperability, the Registry content is accessible in the JavaScript Object Notation for Linked Data (JSON-LD) format. We present several use cases and demonstrate how the linked information may provide raw material for reasoning about variant's pathogenicity.


Assuntos
Bases de Dados Genéticas , Variação Genética/genética , Alelos , Humanos , Sistema de Registros , Software
7.
Hum Mutat ; 39(11): 1686-1689, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30311379

RESUMO

The Clinical Genome Resource (ClinGen)'s work to develop a knowledge base to support the understanding of genes and variants for use in precision medicine and research depends on robust, broadly applicable, and adaptable technical standards for sharing data and information. To forward this goal, ClinGen has joined with the Global Alliance for Genomics and Health (GA4GH) to support the development of open, freely-available technical standards and regulatory frameworks for secure and responsible sharing of genomic and health-related data. In its capacity as one of the 15 inaugural GA4GH "Driver Projects," ClinGen is providing input on the key standards needs of the global genomics community, and has committed to participate on GA4GH Work Streams to support the development of: (1) a standard model for computer-readable variant representation; (2) a data model for linking variant data to annotations; (3) a specification to enable sharing of genomic variant knowledge and associated clinical interpretations; and (4) a set of best practices for use of phenotype and disease ontologies. ClinGen's participation as a GA4GH Driver Project will provide a robust environment to test drive emerging genomic knowledge sharing standards and prove their utility among the community, while accelerating the construction of the ClinGen evidence base.


Assuntos
Genoma Humano/genética , Disseminação de Informação/métodos , Biologia Computacional , Bases de Dados Genéticas , Variação Genética , Genômica , Humanos , Medicina de Precisão
8.
Genet Med ; 19(2): 215-223, 2017 02.
Artigo em Inglês | MEDLINE | ID: mdl-27441996

RESUMO

INTRODUCTION: Reporting and sharing pharmacogenetic test results across clinical laboratories and electronic health records is a crucial step toward the implementation of clinical pharmacogenetics, but allele function and phenotype terms are not standardized. Our goal was to develop terms that can be broadly applied to characterize pharmacogenetic allele function and inferred phenotypes. MATERIALS AND METHODS: Terms currently used by genetic testing laboratories and in the literature were identified. The Clinical Pharmacogenetics Implementation Consortium (CPIC) used the Delphi method to obtain a consensus and agree on uniform terms among pharmacogenetic experts. RESULTS: Experts with diverse involvement in at least one area of pharmacogenetics (clinicians, researchers, genetic testing laboratorians, pharmacogenetics implementers, and clinical informaticians; n = 58) participated. After completion of five surveys, a consensus (>70%) was reached with 90% of experts agreeing to the final sets of pharmacogenetic terms. DISCUSSION: The proposed standardized pharmacogenetic terms will improve the understanding and interpretation of pharmacogenetic tests and reduce confusion by maintaining consistent nomenclature. These standard terms can also facilitate pharmacogenetic data sharing across diverse electronic health care record systems with clinical decision support.Genet Med 19 2, 215-223.


Assuntos
Testes Genéticos/normas , Farmacogenética/normas , Terminologia como Assunto , Alelos , Registros Eletrônicos de Saúde/normas , Humanos , Fenótipo , Inquéritos e Questionários
9.
J Biomed Inform ; 60: 84-94, 2016 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-26778834

RESUMO

Genomics is a promising tool that is becoming more widely available to improve the care and treatment of individuals. While there is much assertion, genomics will most certainly require the use of clinical decision support (CDS) to be fully realized in the routine clinical setting. The National Human Genome Research Institute (NHGRI) of the National Institutes of Health recently convened an in-person, multi-day meeting on this topic. It was widely recognized that there is a need to promote the innovation and development of resources for genomic CDS such as a CDS sandbox. The purpose of this study was to evaluate a proposed approach for such a genomic CDS sandbox among domain experts and potential users. Survey results indicate a significant interest and desire for a genomic CDS sandbox environment among domain experts. These results will be used to guide the development of a genomic CDS sandbox.


Assuntos
Biologia Computacional , Sistemas de Apoio a Decisões Clínicas , Genômica/métodos , Congressos como Assunto , Humanos , National Human Genome Research Institute (U.S.) , Software , Estados Unidos
10.
BMC Med Inform Decis Mak ; 15: 12, 2015 Feb 22.
Artigo em Inglês | MEDLINE | ID: mdl-25880555

RESUMO

BACKGROUND: Every year, hundreds of thousands of patients experience treatment failure or adverse drug reactions (ADRs), many of which could be prevented by pharmacogenomic testing. However, the primary knowledge needed for clinical pharmacogenomics is currently dispersed over disparate data structures and captured in unstructured or semi-structured formalizations. This is a source of potential ambiguity and complexity, making it difficult to create reliable information technology systems for enabling clinical pharmacogenomics. METHODS: We developed Web Ontology Language (OWL) ontologies and automated reasoning methodologies to meet the following goals: 1) provide a simple and concise formalism for representing pharmacogenomic knowledge, 2) finde errors and insufficient definitions in pharmacogenomic knowledge bases, 3) automatically assign alleles and phenotypes to patients, 4) match patients to clinically appropriate pharmacogenomic guidelines and clinical decision support messages and 5) facilitate the detection of inconsistencies and overlaps between pharmacogenomic treatment guidelines from different sources. We evaluated different reasoning systems and test our approach with a large collection of publicly available genetic profiles. RESULTS: Our methodology proved to be a novel and useful choice for representing, analyzing and using pharmacogenomic data. The Genomic Clinical Decision Support (Genomic CDS) ontology represents 336 SNPs with 707 variants; 665 haplotypes related to 43 genes; 22 rules related to drug-response phenotypes; and 308 clinical decision support rules. OWL reasoning identified CDS rules with overlapping target populations but differing treatment recommendations. Only a modest number of clinical decision support rules were triggered for a collection of 943 public genetic profiles. We found significant performance differences across available OWL reasoners. CONCLUSIONS: The ontology-based framework we developed can be used to represent, organize and reason over the growing wealth of pharmacogenomic knowledge, as well as to identify errors, inconsistencies and insufficient definitions in source data sets or individual patient data. Our study highlights both advantages and potential practical issues with such an ontology-based approach.


Assuntos
Ontologias Biológicas , Sistemas de Apoio a Decisões Clínicas , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos/prevenção & controle , Farmacogenética/métodos , Guias de Prática Clínica como Assunto , Medicina de Precisão/métodos , Inteligência Artificial , Tomada de Decisão Clínica , Humanos
11.
J Am Med Inform Assoc ; 31(2): 536-541, 2024 Jan 18.
Artigo em Inglês | MEDLINE | ID: mdl-38037121

RESUMO

OBJECTIVE: Given the importance AI in genomics and its potential impact on human health, the American Medical Informatics Association-Genomics and Translational Biomedical Informatics (GenTBI) Workgroup developed this assessment of factors that can further enable the clinical application of AI in this space. PROCESS: A list of relevant factors was developed through GenTBI workgroup discussions in multiple in-person and online meetings, along with review of pertinent publications. This list was then summarized and reviewed to achieve consensus among the group members. CONCLUSIONS: Substantial informatics research and development are needed to fully realize the clinical potential of such technologies. The development of larger datasets is crucial to emulating the success AI is achieving in other domains. It is important that AI methods do not exacerbate existing socio-economic, racial, and ethnic disparities. Genomic data standards are critical to effectively scale such technologies across institutions. With so much uncertainty, complexity and novelty in genomics and medicine, and with an evolving regulatory environment, the current focus should be on using these technologies in an interface with clinicians that emphasizes the value each brings to clinical decision-making.


Assuntos
Inteligência Artificial , Medicina , Humanos , Biologia Computacional , Genômica
12.
Nat Med ; 30(2): 480-487, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-38374346

RESUMO

Polygenic risk scores (PRSs) have improved in predictive performance, but several challenges remain to be addressed before PRSs can be implemented in the clinic, including reduced predictive performance of PRSs in diverse populations, and the interpretation and communication of genetic results to both providers and patients. To address these challenges, the National Human Genome Research Institute-funded Electronic Medical Records and Genomics (eMERGE) Network has developed a framework and pipeline for return of a PRS-based genome-informed risk assessment to 25,000 diverse adults and children as part of a clinical study. From an initial list of 23 conditions, ten were selected for implementation based on PRS performance, medical actionability and potential clinical utility, including cardiometabolic diseases and cancer. Standardized metrics were considered in the selection process, with additional consideration given to strength of evidence in African and Hispanic populations. We then developed a pipeline for clinical PRS implementation (score transfer to a clinical laboratory, validation and verification of score performance), and used genetic ancestry to calibrate PRS mean and variance, utilizing genetically diverse data from 13,475 participants of the All of Us Research Program cohort to train and test model parameters. Finally, we created a framework for regulatory compliance and developed a PRS clinical report for return to providers and for inclusion in an additional genome-informed risk assessment. The initial experience from eMERGE can inform the approach needed to implement PRS-based testing in diverse clinical settings.


Assuntos
Doença Crônica , Estratificação de Risco Genético , Saúde da População , Adulto , Criança , Humanos , Comunicação , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Fatores de Risco , Estados Unidos
13.
J Biomed Inform ; 46(4): 690-6, 2013 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-23727027

RESUMO

PharmGKB is a leading resource of high quality pharmacogenomics data that provides information about how genetic variations modulate an individual's response to drugs. PharmGKB contains information about genetic variations, pharmacokinetic and pharmacodynamic pathways, and the effect of variations on drug-related phenotypes. These relationships are represented using very general terms, however, and the precise semantic relationships among drugs, and diseases are not often captured. In this paper we develop a protocol to detect and disambiguate general clinical associations between drugs and diseases using more precise annotation terms from other data sources. PharmGKB provides very detailed clinical associations between genetic variants and drug response, including genotype-specific drug dosing guidelines, and this procedure will armGKB. The availability of more detailed data will help investigators to conduct more precise queries, such as finding particular diseases caused or treated by a specific drug. We first mapped drugs extracted from PharmGKB drug-disease relationships to those in the National Drug File Reference Terminology (NDF-RT) and to Structured Product Labels (SPLs). Specifically, we retrieved drug and disease role relationships describing and defining concepts according to their relationships with other concepts from NDF-RT. We also used the NCBO (National Center for Biomedical Ontology) annotator to annotate disease terms from the free text extracted from five SPL sections (indication, contraindication, ADE, precaution, and warning). Finally, we used the detailed drug and disease relationship information from NDF-RT and the SPLs to annotate and disambiguate the more general PharmGKB drug and disease associations.


Assuntos
Doença , Rotulagem de Medicamentos , Farmacogenética , Humanos , Farmacocinética , Farmacologia
14.
J Biomed Inform ; 46(2): 286-93, 2013 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-23201637

RESUMO

The Pharmacogenomics Research Network (PGRN) is a collaborative partnership of research groups funded by NIH to discover and understand how genome contributes to an individual's response to medication. Since traditional biomedical research studies and clinical trials are often conducted independently, common and standardized representations for data are seldom used. This leads to heterogeneity in data representation, which hinders data reuse, data integration and meta-analyses. This study demonstrates harmonization and semantic annotation work for pharmacogenomics data dictionaries collected from PGRN research groups. A semi-automated system was developed to support the harmonization/annotation process, which includes four individual steps, (1) pre-processing PGRN variables; (2) decomposing and normalizing variable descriptions; (3) semantically annotating words and phrases using controlled terminologies; (4) grouping PGRN variables into categories based on the annotation results and semantic types, for total 1514 PGRN variables. Our results demonstrate that there is a significant amount of variability in how pharmacogenomics data is represented and that additional standardization efforts are needed. This represents a critical first step toward identifying and creating data standards for pharmacogenomics studies.


Assuntos
Pesquisa Biomédica , Bases de Dados Factuais , Documentação/métodos , Aplicações da Informática Médica , Farmacogenética , Humanos , Semântica
15.
Stud Health Technol Inform ; 305: 398-401, 2023 Jun 29.
Artigo em Inglês | MEDLINE | ID: mdl-37387049

RESUMO

The National Institute of Health (NIH) Genetic Testing Registry (GTR) provides a variety of information about genetic tests such as relevant methods, conditions, and performing laboratories. This study mapped a subset of GTR data to the newly developed HL7®-FHIR® Genomic Study resource. Using open-source tools, a web application was developed to implement data mapping and provides many GTR test records as Genomic Study resources. The developed system demonstrates the feasibility of using open-source tools and the FHIR Genomic Study resource to represent publicly available genetic testing information. This study validates the overall design of the Genomic Study resource and proposes two enhancements to support additional data elements.


Assuntos
Genômica , Dados de Saúde Coletados Rotineiramente , Testes Genéticos , Laboratórios , Sistema de Registros
16.
J Pers Med ; 13(6)2023 May 31.
Artigo em Inglês | MEDLINE | ID: mdl-37373918

RESUMO

Familial Hypercholesterolemia (FH) is underdiagnosed in the United States. Clinical decision support (CDS) could increase FH detection once implemented in clinical workflows. We deployed CDS for FH at an academic medical center and sought clinician insights using an implementation survey. In November 2020, the FH CDS was deployed in the electronic health record at all Mayo Clinic sites in two formats: a best practice advisory (BPA) and an in-basket alert. Over three months, 104 clinicians participated in the survey (response rate 11.1%). Most clinicians (81%) agreed that CDS implementation was a good option for identifying FH patients; 78% recognized the importance of implementing the tool in practice, and 72% agreed it would improve early diagnosis of FH. In comparing the two alert formats, clinicians found the in-basket alert more acceptable (p = 0.036) and more feasible (p = 0.042) than the BPA. Overall, clinicians favored implementing the FH CDS in clinical practice and provided feedback that led to iterative refinement of the tool. Such a tool can potentially increase FH detection and optimize patient management.

17.
Pac Symp Biocomput ; 28: 383-394, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36540993

RESUMO

As the diversity of genomic variation data increases with our growing understanding of the role of variation in health and disease, it is critical to develop standards for precise inter-system exchange of these data for research and clinical applications. The Global Alliance for Genomics and Health (GA4GH) Variation Representation Specification (VRS) meets this need through a technical terminology and information model for disambiguating and concisely representing variation concepts. Here we discuss the recent Genotype model in VRS, which may be used to represent the allelic composition of a genetic locus. We demonstrate the use of the Genotype model and the constituent Haplotype model for the precise and interoperable representation of pharmacogenomic diplotypes, HGVS variants, and VCF records using VRS and discuss how this can be leveraged to enable interoperable exchange and search operations between assayed variation and genomic knowledgebases.


Assuntos
Biologia Computacional , Variação Genética , Humanos , Bases de Dados Genéticas , Genômica , Genótipo
18.
Pac Symp Biocomput ; 28: 437-448, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36540998

RESUMO

Polygenic risk scores (PRS) have led to enthusiasm for precision medicine. However, it is well documented that PRS do not generalize across groups differing in ancestry or sample characteristics e.g., age. Quantifying performance of PRS across different groups of study participants, using genome-wide association study (GWAS) summary statistics from multiple ancestry groups and sample sizes, and using different linkage disequilibrium (LD) reference panels may clarify which factors are limiting PRS transferability. To evaluate these factors in the PRS generation process, we generated body mass index (BMI) PRS (PRSBMI) in the Electronic Medical Records and Genomics (eMERGE) network (N=75,661). Analyses were conducted in two ancestry groups (European and African) and three age ranges (adult, teenagers, and children). For PRSBMI calculations, we evaluated five LD reference panels and three sets of GWAS summary statistics of varying sample size and ancestry. PRSBMI performance increased for both African and European ancestry individuals using cross-ancestry GWAS summary statistics compared to European-only summary statistics (6.3% and 3.7% relative R2 increase, respectively, pAfrican=0.038, pEuropean=6.26x10-4). The effects of LD reference panels were more pronounced in African ancestry study datasets. PRSBMI performance degraded in children; R2 was less than half of teenagers or adults. The effect of GWAS summary statistics sample size was small when modeled with the other factors. Additionally, the potential of using a PRS generated for one trait to predict risk for comorbid diseases is not well understood especially in the context of cross-ancestry analyses - we explored clinical comorbidities from the electronic health record associated with PRSBMI and identified significant associations with type 2 diabetes and coronary atherosclerosis. In summary, this study quantifies the effects that ancestry, GWAS summary statistic sample size, and LD reference panel have on PRS performance, especially in cross-ancestry and age-specific analyses.


Assuntos
Diabetes Mellitus Tipo 2 , Adulto , Adolescente , Criança , Humanos , Diabetes Mellitus Tipo 2/genética , Índice de Massa Corporal , Estudo de Associação Genômica Ampla , Predisposição Genética para Doença , Biologia Computacional , Fatores de Risco , Herança Multifatorial
19.
medRxiv ; 2023 Jun 05.
Artigo em Inglês | MEDLINE | ID: mdl-37333246

RESUMO

Polygenic risk scores (PRS) have improved in predictive performance supporting their use in clinical practice. Reduced predictive performance of PRS in diverse populations can exacerbate existing health disparities. The NHGRI-funded eMERGE Network is returning a PRS-based genome-informed risk assessment to 25,000 diverse adults and children. We assessed PRS performance, medical actionability, and potential clinical utility for 23 conditions. Standardized metrics were considered in the selection process with additional consideration given to strength of evidence in African and Hispanic populations. Ten conditions were selected with a range of high-risk thresholds: atrial fibrillation, breast cancer, chronic kidney disease, coronary heart disease, hypercholesterolemia, prostate cancer, asthma, type 1 diabetes, obesity, and type 2 diabetes. We developed a pipeline for clinical PRS implementation, used genetic ancestry to calibrate PRS mean and variance, created a framework for regulatory compliance, and developed a PRS clinical report. eMERGE's experience informs the infrastructure needed to implement PRS-based implementation in diverse clinical settings.

20.
BMC Bioinformatics ; 13: 164, 2012 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-22793366

RESUMO

BACKGROUND: Identifying variants associated with complex human traits in high-dimensional data is a central goal of genome-wide association studies. However, complicated etiologies such as gene-gene interactions are ignored by the univariate analysis usually applied in these studies. Random Forests (RF) are a popular data-mining technique that can accommodate a large number of predictor variables and allow for complex models with interactions. RF analysis produces measures of variable importance that can be used to rank the predictor variables. Thus, single nucleotide polymorphism (SNP) analysis using RFs is gaining popularity as a potential filter approach that considers interactions in high-dimensional data. However, the impact of data dimensionality on the power of RF to identify interactions has not been thoroughly explored. We investigate the ability of rankings from variable importance measures to detect gene-gene interaction effects and their potential effectiveness as filters compared to p-values from univariate logistic regression, particularly as the data becomes increasingly high-dimensional. RESULTS: RF effectively identifies interactions in low dimensional data. As the total number of predictor variables increases, probability of detection declines more rapidly for interacting SNPs than for non-interacting SNPs, indicating that in high-dimensional data the RF variable importance measures are capturing marginal effects rather than capturing the effects of interactions. CONCLUSIONS: While RF remains a promising data-mining technique that extends univariate methods to condition on multiple variables simultaneously, RF variable importance measures fail to detect interaction effects in high-dimensional data in the absence of a strong marginal component, and therefore may not be useful as a filter technique that allows for interaction effects in genome-wide data.


Assuntos
Mineração de Dados , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Genes , Humanos , Desequilíbrio de Ligação , Modelos Logísticos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA