Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 2 de 2
Filter
Add more filters

Database
Language
Publication year range
1.
Genet Epidemiol ; 43(1): 63-81, 2019 02.
Article in English | MEDLINE | ID: mdl-30298529

ABSTRACT

The Electronic Medical Records and Genomics (eMERGE) network is a network of medical centers with electronic medical records linked to existing biorepository samples for genomic discovery and genomic medicine research. The network sought to unify the genetic results from 78 Illumina and Affymetrix genotype array batches from 12 contributing medical centers for joint association analysis of 83,717 human participants. In this report, we describe the imputation of eMERGE results and methods to create the unified imputed merged set of genome-wide variant genotype data. We imputed the data using the Michigan Imputation Server, which provides a missing single-nucleotide variant genotype imputation service using the minimac3 imputation algorithm with the Haplotype Reference Consortium genotype reference set. We describe the quality control and filtering steps used in the generation of this data set and suggest generalizable quality thresholds for imputation and phenotype association studies. To test the merged imputed genotype set, we replicated a previously reported chromosome 6 HLA-B herpes zoster (shingles) association and discovered a novel zoster-associated loci in an epigenetic binding site near the terminus of chromosome 3 (3p29).


Subject(s)
Electronic Health Records , Genetic Predisposition to Disease , Genome-Wide Association Study , Herpes Zoster/genetics , Algorithms , Black People/genetics , Chromosomes, Human/genetics , Female , Haplotypes/genetics , Homozygote , Humans , Male , Phenotype , Polymorphism, Single Nucleotide/genetics , Principal Component Analysis , White People/genetics
2.
Stud Health Technol Inform ; 264: 1041-1045, 2019 Aug 21.
Article in English | MEDLINE | ID: mdl-31438083

ABSTRACT

Natural language processing (NLP) technologies have been successfully applied to cancer research by enabling automated phenotypic information extraction from narratives in electronic health records (EHRs) such as pathology reports; however, developing customized NLP solutions requires substantial effort. To facilitate the adoption of NLP in cancer research, we have developed a set of customizable modules for extracting comprehensive types of cancer-related information in pathology reports (e.g., tumor size, tumor stage, and biomarkers), by leveraging the existing CLAMP system, which provides user-friendly interfaces for building customized NLP solutions for individual needs. Evaluation using annotated data at Vanderbilt University Medical Center showed that CLAMP-Cancer could extract diverse types of cancer information with good F-measures (0.80-0.98). We then applied CLAMP-Cancer to an information extraction task at Mayo Clinic and showed that we can quickly build a customized NLP system with comparable performance with an existing system at Mayo Clinic. CLAMP-Cancer is freely available for academic use.


Subject(s)
Information Storage and Retrieval , Neoplasms , Electronic Health Records , Humans , Natural Language Processing , Research Report
SELECTION OF CITATIONS
SEARCH DETAIL