Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 54
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
J Immunol ; 210(5): 590-594, 2023 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-36688686

RESUMO

LIGHT (homologous to lymphotoxins, exhibits inducible expression, and competes with HSV glycoprotein D for herpes virus entry mediator, a receptor expressed by T lymphocytes), encoded by the TNFSF14 gene, is a cytokine belonging to the TNF superfamily. On binding to its receptors, herpes virus entry mediator and lymphotoxin ß receptor, it activates inflammatory responses. We conducted this study to determine whether plasma LIGHT levels are elevated in Crohn's disease (CD) in a pediatric population with the aim of nominating this cytokine as a therapeutic target. We used a single-molecule immunoassay to determine the circulating levels of free LIGHT in plasma from pediatric patients with CD in our biobank (n = 183), a panel of healthy pediatric (n = 9) or adult (n = 22) reference samples, and pediatric biobank controls (n = 19). We performed correlational analyses between LIGHT levels and the clinical characteristics of the CD cohort, including age, Montreal classification, family history, medical/surgical therapy, and routine blood test parameters. LIGHT levels were greatly elevated in CD, with an average of 305 versus 32.4 pg/ml for controls from the biobank (p < 0.0001). The outside reference samples showed levels of 57 pg/ml in pediatric controls and 55 pg/ml in adults (p < 0.0001). We found a statistically significant correlation between white blood cell count and free LIGHT (p < 0.046). We conclude that free, soluble LIGHT is increased 5- to 10-fold in pediatric CD across an array of disease subtypes and characteristics.


Assuntos
Doença de Crohn , Adulto , Criança , Humanos , Citocinas , Contagem de Leucócitos , Linfotoxina-alfa
2.
Hum Mol Genet ; 31(22): 3769-3776, 2022 11 10.
Artigo em Inglês | MEDLINE | ID: mdl-35642741

RESUMO

Mental disorders present a global health concern and have limited treatment options. In today's medical practice, medications such as antidepressants are prescribed not only for depression but also for conditions such as anxiety and attention deficit hyperactivity disorder (ADHD). Therefore, identifying gene targets for specific disorders is important and offers improved precision. In this study, we performed a genetic analysis of six common mental disorders-ADHD, anxiety, depression, delays in mental development, intellectual disabilities (IDs) and speech/language disorder-in the ethnic minority of African Americans (AAs) using whole genome sequencing (WGS). WGS data were generated from blood-derived DNA from 4178 AA individuals, including 1384 patients with the diagnosis of at least one mental disorder. Mutation burden analysis was applied based on rare and deleterious mutations in the AA population between cases and controls, and further analyzed in the context of patients with single mental disorder diagnosis. Certain genes uncovered demonstrated significant P-values in mutation burden analysis. In addition, exclusive recurrences in specific type of disorder were scanned through gene-drug interaction databases to assess for availability of potential medications. We uncovered 15 genes harboring deleterious mutations, including 3-Hydroxy-3-Methylglutaryl-CoA Reductase (HMGCR) and Uronyl 2-Sulfotransferase (UST) for ADHD; Farnesyltransferase, CAAX Box, Beta (FNTB) for anxiety; Xin Actin Binding Repeat Containing 2 (XIRP2), Natriuretic Peptide C (NPPC), Serine/Threonine Kinase 33 (STK33), Pannexin 1 (PANX1) and Neurotensin (NTS) for depression; RUNX Family Transcription Factor 3 (RUNX3), Tachykinin Receptor 1 (TACR1) and NADH:Ubiquinone Oxidoreductase Core Subunit S7 (NDUFS7) for delays in mental development; Hepsin (HPN) for ID and Collagen Type VI Alpha 3 Chain (COL6A3), Damage Specific DNA Binding Protein 1 (DDB1) and NADH:Ubiquinone Oxidoreductase Subunit A11 (NDUFA11) for speech/language disorder. Taken together, we have established critical insights into the development of new precision medicine approaches for mental disorders in AAs.


Assuntos
Transtorno do Deficit de Atenção com Hiperatividade , Transtornos da Linguagem , Transtornos Mentais , Humanos , Negro ou Afro-Americano/genética , Etnicidade , NAD/genética , Ubiquinona/genética , Grupos Minoritários , Sequenciamento Completo do Genoma , Oxirredutases/genética , Mutação , Proteínas do Tecido Nervoso/genética , Conexinas/genética
3.
Mol Cancer ; 22(1): 126, 2023 08 05.
Artigo em Inglês | MEDLINE | ID: mdl-37543594

RESUMO

Children with birth defects (BD) express distinct clinical features that often have various medical consequences, one of which is predisposition to the development of cancers. Identification of the underlying genetic mechanisms related to the development of cancer in BD patients would allow for preventive measures. We performed a whole genome sequencing (WGS) study on blood-derived DNA samples from 1566 individuals without chromosomal anomalies, including 454 BD probands with at least one type of malignant tumors, 767 cancer-free BD probands, and 345 healthy individuals. Exclusive recurrent variants were identified in BD-cancer and BD-only patients and mapped to their corresponding genomic regions. We observed statistically significant overlaps for protein-coding/ncRNA with exclusive variants in exons, introns, ncRNAs, and 3'UTR regions. Exclusive exonic variants, especially synonymous variants, tend to occur in prior exons locus in BD-cancer children. Intronic variants close to splicing site (< 500 bp from exon) have little overlaps in BD-cancer and BD-only patients. Exonic variants in non-coding RNA (ncRNA) tend to occur in different ncRNAs exons regardless of the overlaps. Notably, genes with 5' UTR variants are almost mutually exclusive between the two phenotypes. In conclusion, we conducted the first genomic study to explore the impact of recurrent variants exclusive to the two distinguished clinical phenotypes under study, BD with or without cancer, demonstrating enrichment of selective protein-coding/ncRNAs differentially expressed between these two phenotypes, suggesting that selective genetic factors may underlie the molecular processes of pediatric cancer development in BD children.


Assuntos
Neoplasias , Splicing de RNA , Humanos , Mutação , Éxons , Genômica , Neoplasias/genética , Íntrons
4.
Mol Psychiatry ; 27(3): 1469-1478, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-34997195

RESUMO

Mental disorders present a global health concern, while the diagnosis of mental disorders can be challenging. The diagnosis is even harder for patients who have more than one type of mental disorder, especially for young toddlers who are not able to complete questionnaires or standardized rating scales for diagnosis. In the past decade, multiple genomic association signals have been reported for mental disorders, some of which present attractive drug targets. Concurrently, machine learning algorithms, especially deep learning algorithms, have been successful in the diagnosis and/or labeling of complex diseases, such as attention deficit hyperactivity disorder (ADHD) or cancer. In this study, we focused on eight common mental disorders, including ADHD, depression, anxiety, autism, intellectual disabilities, speech/language disorder, delays in developments, and oppositional defiant disorder in the ethnic minority of African Americans. Blood-derived whole genome sequencing data from 4179 individuals were generated, including 1384 patients with the diagnosis of at least one mental disorder. The burden of genomic variants in coding/non-coding regions was applied as feature vectors in the deep learning algorithm. Our model showed ~65% accuracy in differentiating patients from controls. Ability to label patients with multiple disorders was similarly successful, with a hamming loss score less than 0.3, while exact diagnostic matches are around 10%. Genes in genomic regions with the highest weights showed enrichment of biological pathways involved in immune responses, antigen/nucleic acid binding, chemokine signaling pathway, and G-protein receptor activities. A noticeable fact is that variants in non-coding regions (e.g., ncRNA, intronic, and intergenic) performed equally well as variants in coding regions; however, unlike coding region variants, variants in non-coding regions do not express genomic hotspots whereas they carry much more narrow standard deviations, indicating they probably serve as alternative markers.


Assuntos
Transtorno do Deficit de Atenção com Hiperatividade , Aprendizado Profundo , Negro ou Afro-Americano/genética , Algoritmos , Transtorno do Deficit de Atenção com Hiperatividade/genética , Etnicidade , Humanos , Grupos Minoritários , Sequenciamento Completo do Genoma
5.
PLoS Genet ; 16(3): e1008684, 2020 03.
Artigo em Inglês | MEDLINE | ID: mdl-32226016

RESUMO

Lipid levels are important markers for the development of cardio-metabolic diseases. Although hundreds of associated loci have been identified through genetic association studies, the contribution of genetic factors to variation in lipids is not fully understood, particularly in U.S. minority groups. We performed genome-wide association analyses for four lipid traits in over 45,000 ancestrally diverse participants from the Population Architecture using Genomics and Epidemiology (PAGE) Study, followed by a meta-analysis with several European ancestry studies. We identified nine novel lipid loci, five of which showed evidence of replication in independent studies. Furthermore, we discovered one novel gene in a PrediXcan analysis, minority-specific independent signals at eight previously reported loci, and potential functional variants at two known loci through fine-mapping. Systematic examination of known lipid loci revealed smaller effect estimates in African American and Hispanic ancestry populations than those in Europeans, and better performance of polygenic risk scores based on minority-specific effect estimates. Our findings provide new insight into the genetic architecture of lipid traits and highlight the importance of conducting genetic studies in diverse populations in the era of precision medicine.


Assuntos
Lipídeos/sangue , Lipídeos/genética , Grupos Raciais/genética , Bases de Dados Genéticas , Feminino , Estudo de Associação Genômica Ampla/métodos , Genótipo , Humanos , Lipídeos/análise , Masculino , Metagenômica/métodos , Grupos Minoritários , Herança Multifatorial/genética , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Estados Unidos/epidemiologia
6.
J Allergy Clin Immunol ; 150(5): 1086-1096, 2022 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-35595084

RESUMO

BACKGROUND: Asthma is the most common chronic condition in children and the third leading cause of hospitalization in pediatrics. The genome-wide association study catalog reports 140 studies with genome-wide significance. A polygenic risk score (PRS) with predictive value across ancestries has not been evaluated for this important trait. OBJECTIVES: This study aimed to train and validate a PRS relying on genetic determinants for asthma to provide predictions for disease occurrence in pediatric cohorts of diverse ancestries. METHODS: This study applied a Bayesian regression framework method using the Trans-National Asthma Genetic Consortium genome-wide association study summary statistics to derive a multiancestral PRS score, used one Electronic Medical Records and Genomics (eMERGE) cohort as a training set, used a second independent eMERGE cohort to validate the score, and used the UK Biobank data to replicate the findings. A phenome-wide association study was performed using the PRS to identify shared genetic etiology with other phenotypes. RESULTS: The multiancestral asthma PRS was associated with asthma in the 2 pediatric validation datasets. Overall, the multiancestral asthma PRS has an area under the curve (AUC) of 0.70 (95% CI, 0.69-0.72) in the pediatric validation 1 and AUC of 0.66 (0.65-0.66) in the pediatric validation 2 datasets. We found significant discrimination across pediatric subcohorts of European (AUC, 95% CI, 0.60 and 0.66), African (AUC, 95% CI, 0.61 and 0.66), admixed American (AUC, 0.64 and 0.70), Southeast Asian (AUC, 0.65), and East Asian (AUC, 0.73) ancestry. Pediatric participants with the top 5% PRS had 2.80 to 5.82 increased odds of asthma compared to the bottom 5% across the training, validation 1, and validation 2 cohorts when adjusted for ancestry. Phenome-wide association study analysis confirmed the strong association of the identified PRS with asthma (odds ratio, 2.71, PFDR = 3.71 × 10-65) and related phenotypes. CONCLUSIONS: A multiancestral PRS for asthma based on Bayesian posterior genomic effect sizes identifies increased odds of pediatric asthma.


Assuntos
Asma , Estudo de Associação Genômica Ampla , Humanos , Criança , Estudo de Associação Genômica Ampla/métodos , Herança Multifatorial , Predisposição Genética para Doença , Polimorfismo de Nucleotídeo Único , Teorema de Bayes , Fatores de Risco , Asma/genética
7.
J Pediatr ; 248: 89-93, 2022 09.
Artigo em Inglês | MEDLINE | ID: mdl-35577121

RESUMO

OBJECTIVE: To evaluate Mendelian causes of neurodegenerative disorders in a cohort of pediatric patients. STUDY DESIGN: Patients enrolled in the Center for Applied Genomics Biobank at the Children's Hospital of Philadelphia with neurodegenerative symptoms were identified using an algorithm that consisted of including and excluding selected International Classification of Diseases, 9th and 10th edition codes. A manual chart review was then performed to abstract detailed clinical information. RESULTS: Of approximately 100 000 patients enrolled in the Center for Applied Genomics Biobank, 76 had a neurodegenerative phenotype. After chart review, 7 patients were excluded. Of the remaining 69 patients, 42 had a genetic diagnosis (60.9%) and 27 were undiagnosed (39.1%). There were 32 unique disorders. Common diagnoses included Rett syndrome, mitochondrial disorders, and neuronal ceroid lipofuscinoses. CONCLUSIONS: The disorders encountered in our cohort demonstrate the diverse diseases and pathophysiology that contribute to pediatric neurodegeneration. Establishing a diagnosis often informed clinical management, although curative treatment options are lacking. Many patients who underwent genetic evaluation remained undiagnosed, highlighting the importance of continued research efforts in this field.


Assuntos
Lipofuscinoses Ceroides Neuronais , Algoritmos , Criança , Estudos de Coortes , Hospitais Pediátricos , Humanos , Fenótipo
8.
Respir Res ; 23(1): 116, 2022 May 06.
Artigo em Inglês | MEDLINE | ID: mdl-35524249

RESUMO

BACKGROUND: Asthma is a complex condition largely attributed to the interactions among genes and environments as a heterogeneous phenotype. Obesity is significantly associated with asthma development, and genetic studies on obese vs. non-obese asthma are warranted. METHODS: To investigate asthma in the minority African American (AA) population with or without obesity, we performed a whole genome sequencing (WGS) study on blood-derived DNA of 4289 AA individuals, included 2226 asthma patients (1364 with obesity and 862 without obesity) and 2006 controls without asthma. The burden analysis of functional rare coding variants was performed by comparing asthma vs. controls and by stratified analysis of obese vs. non-obese asthma, respectively. RESULTS: Among the top 66 genes with P < 0.01 in the asthma vs. control analysis, stratified analysis by obesity showed inverse correlation of natural logarithm (LN) of P value between obese and non-obese asthma (r = - 0.757, P = 1.90E-13). Five genes previously reported in a genome-wide association study (GWAS) on asthma, including TSLP, SLC9A4, PSMB8, IGSF5, and IKZF4 were demonstrated association in the asthma vs. control analysis. The associations of IKZF4 and IGSF5 are only associated with obese asthma; and the association of SLC9A4 is only observed in non-obese asthma. In addition, the association of RSPH3 (the gene is related to primary ciliary dyskinesia) is observed in non-obese asthma. CONCLUSIONS: These findings highlight genetic heterogeneity between obese and non-obese asthma in patients of AA ancestry.


Assuntos
Asma , Estudo de Associação Genômica Ampla , Negro ou Afro-Americano/genética , Asma/diagnóstico , Asma/epidemiologia , Asma/genética , Heterogeneidade Genética , Predisposição Genética para Doença/genética , Humanos , Obesidade/diagnóstico , Obesidade/epidemiologia , Obesidade/genética , Polimorfismo de Nucleotídeo Único/genética
9.
Int J Obes (Lond) ; 45(1): 155-169, 2021 01.
Artigo em Inglês | MEDLINE | ID: mdl-32952152

RESUMO

BACKGROUND/OBJECTIVES: Melanocortin-4 receptor (MC4R) plays an essential role in food intake and energy homeostasis. More than 170 MC4R variants have been described over the past two decades, with conflicting reports regarding the prevalence and phenotypic effects of these variants in diverse cohorts. To determine the frequency of MC4R variants in large cohort of different ancestries, we evaluated the MC4R coding region for 20,537 eMERGE participants with sequencing data plus additional 77,454 independent individuals with genome-wide genotyping data at this locus. SUBJECTS/METHODS: The sequencing data were obtained from the eMERGE phase III study, in which multisample variant call format calls have been generated, curated, and annotated. In addition to penetrance estimation using body mass index (BMI) as a binary outcome, GWAS and PheWAS were performed using median BMI in linear regression analyses. All results were adjusted for principal components, age, sex, and sites of genotyping. RESULTS: Targeted sequencing data of MC4R revealed 125 coding variants in 1839 eMERGE participants including 30 unreported coding variants that were predicted to be functionally damaging. Highly penetrant unreported variants included (L325I, E308K, D298N, S270F, F261L, T248A, D111V, and Y80F) in which seven participants had obesity class III defined as BMI ≥ 40 kg/m2. In GWAS analysis, in addition to known risk haplotype upstream of MC4R (best variant rs6567160 (P = 5.36 × 10-25, Beta = 0.37), a novel rare haplotype was detected which was protective against obesity and encompassed the V103I variant with known gain-of-function properties (P = 6.23 × 10-08, Beta = -0.62). PheWAS analyses extended this protective effect of V103I to type 2 diabetes, diabetic nephropathy, and chronic renal failure independent of BMI. CONCLUSIONS: MC4R screening in a large eMERGE cohort confirmed many previous findings, extend the MC4R pleotropic effects, and discovered additional MC4R rare alleles that probably contribute to obesity.


Assuntos
Variação Genética/genética , Estudo de Associação Genômica Ampla , Obesidade , Receptor Tipo 4 de Melanocortina/genética , Adulto , Idoso , Índice de Massa Corporal , Estudos de Coortes , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Obesidade/epidemiologia , Obesidade/genética
10.
Genes Immun ; 20(7): 555-565, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-30459343

RESUMO

Resting-state white blood cell (WBC) count is a marker of inflammation and immune system health. There is evidence that WBC count is not fixed over time and there is heterogeneity in WBC trajectory that is associated with morbidity and mortality. Latent class mixed modeling (LCMM) is a method that can identify unobserved heterogeneity in longitudinal data and attempts to classify individuals into groups based on a linear model of repeated measurements. We applied LCMM to repeated WBC count measures derived from electronic medical records of participants of the National Human Genetics Research Institute (NHRGI) electronic MEdical Record and GEnomics (eMERGE) network study, revealing two WBC count trajectory phenotypes. Advancing these phenotypes to GWAS, we found genetic associations between trajectory class membership and regions on chromosome 1p34.3 and chromosome 11q13.4. The chromosome 1 region contains CSF3R, which encodes the granulocyte colony-stimulating factor receptor. This protein is a major factor in neutrophil stimulation and proliferation. The association on chromosome 11 contain genes RNF169 and XRRA1; both involved in the regulation of double-strand break DNA repair.


Assuntos
Contagem de Leucócitos/métodos , Leucócitos/classificação , Adulto , Idoso , Bases de Dados Genéticas , Registros Eletrônicos de Saúde , Feminino , Estudo de Associação Genômica Ampla , Humanos , Análise de Classes Latentes , Masculino , Pessoa de Meia-Idade , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Proteínas/genética , Receptores de Fator Estimulador de Colônias/genética , Ubiquitina-Proteína Ligases/genética
11.
Circulation ; 138(22): 2469-2481, 2018 11 27.
Artigo em Inglês | MEDLINE | ID: mdl-30571344

RESUMO

BACKGROUND: Proteomic approaches allow measurement of thousands of proteins in a single specimen, which can accelerate biomarker discovery. However, applying these technologies to massive biobanks is not currently feasible because of the practical barriers and costs of implementing such assays at scale. To overcome these challenges, we used a "virtual proteomic" approach, linking genetically predicted protein levels to clinical diagnoses in >40 000 individuals. METHODS: We used genome-wide association data from the Framingham Heart Study (n=759) to construct genetic predictors for 1129 plasma protein levels. We validated the genetic predictors for 268 proteins and used them to compute predicted protein levels in 41 288 genotyped individuals in the Electronic Medical Records and Genomics (eMERGE) cohort. We tested associations for each predicted protein with 1128 clinical phenotypes. Lead associations were validated with directly measured protein levels and either low-density lipoprotein cholesterol or subclinical atherosclerosis in the MDCS (Malmö Diet and Cancer Study; n=651). RESULTS: In the virtual proteomic analysis in eMERGE, 55 proteins were associated with 89 distinct diagnoses at a false discovery rate q<0.1. Among these, 13 associations involved lipid (n=7) or atherosclerosis (n=6) phenotypes. We tested each association for validation in MDCS using directly measured protein levels. At Bonferroni-adjusted significance thresholds, levels of apolipoprotein E isoforms were associated with hyperlipidemia, and circulating C-type lectin domain family 1 member B and platelet-derived growth factor receptor-ß predicted subclinical atherosclerosis. Odds ratios for carotid atherosclerosis were 1.31 (95% CI, 1.08-1.58; P=0.006) per 1-SD increment in C-type lectin domain family 1 member B and 0.79 (0.66-0.94; P=0.008) per 1-SD increment in platelet-derived growth factor receptor-ß. CONCLUSIONS: We demonstrate a biomarker discovery paradigm to identify candidate biomarkers of cardiovascular and other diseases.


Assuntos
Biomarcadores/sangue , Doenças das Artérias Carótidas/diagnóstico , Estudo de Associação Genômica Ampla , Proteoma/análise , Adulto , Idoso , Idoso de 80 Anos ou mais , Doenças das Artérias Carótidas/genética , Feminino , Genótipo , Humanos , Lectinas Tipo C/análise , Masculino , Pessoa de Meia-Idade , Razão de Chances , Fenótipo , Polimorfismo de Nucleotídeo Único , Proteômica , Receptor beta de Fator de Crescimento Derivado de Plaquetas/sangue
12.
BMC Med ; 17(1): 135, 2019 07 17.
Artigo em Inglês | MEDLINE | ID: mdl-31311600

RESUMO

BACKGROUND: Non-alcoholic fatty liver disease (NAFLD) is a common chronic liver illness with a genetically heterogeneous background that can be accompanied by considerable morbidity and attendant health care costs. The pathogenesis and progression of NAFLD is complex with many unanswered questions. We conducted genome-wide association studies (GWASs) using both adult and pediatric participants from the Electronic Medical Records and Genomics (eMERGE) Network to identify novel genetic contributors to this condition. METHODS: First, a natural language processing (NLP) algorithm was developed, tested, and deployed at each site to identify 1106 NAFLD cases and 8571 controls and histological data from liver tissue in 235 available participants. These include 1242 pediatric participants (396 cases, 846 controls). The algorithm included billing codes, text queries, laboratory values, and medication records. Next, GWASs were performed on NAFLD cases and controls and case-only analyses using histologic scores and liver function tests adjusting for age, sex, site, ancestry, PC, and body mass index (BMI). RESULTS: Consistent with previous results, a robust association was detected for the PNPLA3 gene cluster in participants with European ancestry. At the PNPLA3-SAMM50 region, three SNPs, rs738409, rs738408, and rs3747207, showed strongest association (best SNP rs738409 p = 1.70 × 10- 20). This effect was consistent in both pediatric (p = 9.92 × 10- 6) and adult (p = 9.73 × 10- 15) cohorts. Additionally, this variant was also associated with disease severity and NAFLD Activity Score (NAS) (p = 3.94 × 10- 8, beta = 0.85). PheWAS analysis link this locus to a spectrum of liver diseases beyond NAFLD with a novel negative correlation with gout (p = 1.09 × 10- 4). We also identified novel loci for NAFLD disease severity, including one novel locus for NAS score near IL17RA (rs5748926, p = 3.80 × 10- 8), and another near ZFP90-CDH1 for fibrosis (rs698718, p = 2.74 × 10- 11). Post-GWAS and gene-based analyses identified more than 300 genes that were used for functional and pathway enrichment analyses. CONCLUSIONS: In summary, this study demonstrates clear confirmation of a previously described NAFLD risk locus and several novel associations. Further collaborative studies including an ethnically diverse population with well-characterized liver histologic features of NAFLD are needed to further validate the novel findings.


Assuntos
Hepatopatia Gordurosa não Alcoólica/genética , Adulto , Idoso , Índice de Massa Corporal , Estudos de Casos e Controles , Redes Comunitárias/organização & administração , Redes Comunitárias/estatística & dados numéricos , Progressão da Doença , Registros Eletrônicos de Saúde/organização & administração , Registros Eletrônicos de Saúde/estatística & dados numéricos , Feminino , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Genômica/organização & administração , Genômica/estatística & dados numéricos , Humanos , Lipase/genética , Masculino , Proteínas de Membrana/genética , Pessoa de Meia-Idade , Morbidade , Hepatopatia Gordurosa não Alcoólica/epidemiologia , Fenótipo , Polimorfismo de Nucleotídeo Único , Transdução de Sinais/genética
13.
J Biomed Inform ; 99: 103293, 2019 11.
Artigo em Inglês | MEDLINE | ID: mdl-31542521

RESUMO

BACKGROUND: Implementation of phenotype algorithms requires phenotype engineers to interpret human-readable algorithms and translate the description (text and flowcharts) into computable phenotypes - a process that can be labor intensive and error prone. To address the critical need for reducing the implementation efforts, it is important to develop portable algorithms. METHODS: We conducted a retrospective analysis of phenotype algorithms developed in the Electronic Medical Records and Genomics (eMERGE) network and identified common customization tasks required for implementation. A novel scoring system was developed to quantify portability from three aspects: Knowledge conversion, clause Interpretation, and Programming (KIP). Tasks were grouped into twenty representative categories. Experienced phenotype engineers were asked to estimate the average time spent on each category and evaluate time saving enabled by a common data model (CDM), specifically the Observational Medical Outcomes Partnership (OMOP) model, for each category. RESULTS: A total of 485 distinct clauses (phenotype criteria) were identified from 55 phenotype algorithms, corresponding to 1153 customization tasks. In addition to 25 non-phenotype-specific tasks, 46 tasks are related to interpretation, 613 tasks are related to knowledge conversion, and 469 tasks are related to programming. A score between 0 and 2 (0 for easy, 1 for moderate, and 2 for difficult portability) is assigned for each aspect, yielding a total KIP score range of 0 to 6. The average clause-wise KIP score to reflect portability is 1.37 ±â€¯1.38. Specifically, the average knowledge (K) score is 0.64 ±â€¯0.66, interpretation (I) score is 0.33 ±â€¯0.55, and programming (P) score is 0.40 ±â€¯0.64. 5% of the categories can be completed within one hour (median). 70% of the categories take from days to months to complete. The OMOP model can assist with vocabulary mapping tasks. CONCLUSION: This study presents firsthand knowledge of the substantial implementation efforts in phenotyping and introduces a novel metric (KIP) to measure portability of phenotype algorithms for quantifying such efforts across the eMERGE Network. Phenotype developers are encouraged to analyze and optimize the portability in regards to knowledge, interpretation and programming. CDMs can be used to improve the portability for some 'knowledge-oriented' tasks.


Assuntos
Registros Eletrônicos de Saúde/classificação , Informática Médica/métodos , Algoritmos , Genômica , Humanos , Fenótipo , Estudos Retrospectivos
14.
J Biomed Inform ; 96: 103253, 2019 08.
Artigo em Inglês | MEDLINE | ID: mdl-31325501

RESUMO

BACKGROUND: Implementing clinical phenotypes across a network is labor intensive and potentially error prone. Use of a common data model may facilitate the process. METHODS: Electronic Medical Records and Genomics (eMERGE) sites implemented the Observational Health Data Sciences and Informatics (OHDSI) Observational Medical Outcomes Partnership (OMOP) Common Data Model across their electronic health record (EHR)-linked DNA biobanks. Two previously implemented eMERGE phenotypes were converted to OMOP and implemented across the network. RESULTS: It was feasible to implement the common data model across sites, with laboratory data producing the greatest challenge due to local encoding. Sites were then able to execute the OMOP phenotype in less than one day, as opposed to weeks of effort to manually implement an eMERGE phenotype in their bespoke research EHR databases. Of the sites that could compare the current OMOP phenotype implementation with the original eMERGE phenotype implementation, specific agreement ranged from 100% to 43%, with disagreements due to the original phenotype, the OMOP phenotype, changes in data, and issues in the databases. Using the OMOP query as a standard comparison revealed differences in the original implementations despite starting from the same definitions, code lists, flowcharts, and pseudocode. CONCLUSION: Using a common data model can dramatically speed phenotype implementation at the cost of having to populate that data model, though this will produce a net benefit as the number of phenotype implementations increases. Inconsistencies among the implementations of the original queries point to a potential benefit of using a common data model so that actual phenotype code and logic can be shared, mitigating human error in reinterpretation of a narrative phenotype definition.


Assuntos
Transtorno do Deficit de Atenção com Hiperatividade/diagnóstico , Bases de Dados Factuais , Diabetes Mellitus Tipo 2/diagnóstico , Registros Eletrônicos de Saúde , Coleta de Dados , Humanos , Informática Médica , National Human Genome Research Institute (U.S.) , Estudos Observacionais como Assunto , Avaliação de Resultados em Cuidados de Saúde , Fenótipo , Projetos de Pesquisa , Software , Estados Unidos
15.
Hum Mol Genet ; 25(18): 4127-4142, 2016 09 15.
Artigo em Inglês | MEDLINE | ID: mdl-27559109

RESUMO

More than a million childhood diarrhoeal episodes occur worldwide each year, and in developed countries a considerable part of them are caused by viral infections. In this study, we aimed to search for genetic variants associated with diarrhoeal disease in young children by meta-analyzing genome-wide association studies, and to elucidate plausible biological mechanisms. The study was conducted in the context of the Early Genetics and Lifecourse Epidemiology (EAGLE) consortium. Data about diarrhoeal disease in two time windows (around 1 year of age and around 2 years of age) was obtained via parental questionnaires, doctor interviews or medical records. Standard quality control and statistical tests were applied to the 1000 Genomes imputed genotypic data. The meta-analysis (N = 5758) followed by replication (N = 3784) identified a genome-wide significant association between rs8111874 and diarrhoea at age 1 year. Conditional analysis suggested that the causal variant could be rs601338 (W154X) in the FUT2 gene. Children with the A allele, which results in a truncated FUT2 protein, had lower risk of diarrhoea. FUT2 participates in the production of histo-blood group antigens and has previously been implicated in the susceptibility to infections, including Rotavirus and Norovirus Gene-set enrichment analysis suggested pathways related to the histo-blood group antigen production, and the regulation of ion transport and blood pressure. Among others, the gastrointestinal tract, and the immune and neuro-secretory systems were detected as relevant organs. In summary, this genome-wide association meta-analysis suggests the implication of the FUT2 gene in diarrhoeal disease in young children from the general population.


Assuntos
Diarreia/genética , Fucosiltransferases/genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Alelos , Pré-Escolar , Diarreia/patologia , Feminino , Genótipo , Humanos , Lactente , Masculino , Polimorfismo de Nucleotídeo Único , Galactosídeo 2-alfa-L-Fucosiltransferase
16.
Pharmacogenet Genomics ; 28(11): 256-259, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30334910

RESUMO

Asthma is the leading chronic disease in children. Several studies have identified genetic biomarkers associated with susceptibility and severity in both adult and pediatric cases. In this study, we evaluated outcomes in 400 African American and European American pediatric cases all of whom were regular users of inhaled corticosteroids. Patients were stratified by genotype using two single nucleotide polymorphisms in the ß-2-adrenergic receptor (ADRB2) gene - rs1042713 and rs1042714, previously associated with asthma outcome. These correspond to nonsynonymous single nucleotide polymorphisms at positions 16 [arginine to glycine (Arg16Gly); rs1042713] and 27 [glutamic acid to glutamine (Glu27Gln); rs1042714], which are relatively common (minor allele frequencies ∼40-50%), and have been well characterized in asthma pharmacogenetics. We controlled for adherence to the National Heart, Lung and Blood Institute guidelines using deep mining of electronic health record data to determine treatment course. We found no significant effect for rs1042713 (Arg16Gly) but did identify an effect for rs1042714, where participants homozygous for Gln27 had increased exacerbations while taking inhaled corticosteroids in comparison with those who were either heterozygous or homozygous for Glu27. This is consistent with previous studies and demonstrates for the first time that the Glu27 variant in the ADRB2 gene is associated with increased frequencies of asthma exacerbations. Moreover, this study also lends an important proof-of-principle on how electronic health records linked to genotype can be efficiently and systematically mined to delineate health outcomes.


Assuntos
Corticosteroides/efeitos adversos , Asma/genética , Predisposição Genética para Doença , Receptores Adrenérgicos beta 2/genética , Adolescente , Corticosteroides/uso terapêutico , Adulto , Alelos , Asma/patologia , Criança , Registros Eletrônicos de Saúde , Feminino , Frequência do Gene , Genótipo , Heterozigoto , Humanos , Masculino , Farmacogenética , Polimorfismo de Nucleotídeo Único/genética , Adulto Jovem
17.
Hum Mol Genet ; 24(4): 1155-68, 2015 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-25281659

RESUMO

Common genetic variants have been identified for adult height, but not much is known about the genetics of skeletal growth in early life. To identify common genetic variants that influence fetal skeletal growth, we meta-analyzed 22 genome-wide association studies (Stage 1; N = 28 459). We identified seven independent top single nucleotide polymorphisms (SNPs) (P < 1 × 10(-6)) for birth length, of which three were novel and four were in or near loci known to be associated with adult height (LCORL, PTCH1, GPR126 and HMGA2). The three novel SNPs were followed-up in nine replication studies (Stage 2; N = 11 995), with rs905938 in DC-STAMP domain containing 2 (DCST2) genome-wide significantly associated with birth length in a joint analysis (Stages 1 + 2; ß = 0.046, SE = 0.008, P = 2.46 × 10(-8), explained variance = 0.05%). Rs905938 was also associated with infant length (N = 28 228; P = 5.54 × 10(-4)) and adult height (N = 127 513; P = 1.45 × 10(-5)). DCST2 is a DC-STAMP-like protein family member and DC-STAMP is an osteoclast cell-fusion regulator. Polygenic scores based on 180 SNPs previously associated with human adult stature explained 0.13% of variance in birth length. The same SNPs explained 2.95% of the variance of infant length. Of the 180 known adult height loci, 11 were genome-wide significantly associated with infant length (SF3B4, LCORL, SPAG17, C6orf173, PTCH1, GDF5, ZNFX1, HHIP, ACAN, HLA locus and HMGA2). This study highlights that common variation in DCST2 influences variation in early growth and adult height.


Assuntos
Proteínas Adaptadoras de Transdução de Sinal/genética , Estatura/genética , Estudos de Associação Genética , Variação Genética , Proteínas de Membrana/genética , Proteínas Adaptadoras de Transdução de Sinal/metabolismo , Adulto , Fatores Etários , Alelos , Biologia Computacional , Bases de Dados Genéticas , Genótipo , Humanos , Recém-Nascido , Proteínas de Membrana/metabolismo , Fenótipo , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Característica Quantitativa Herdável , Reprodutibilidade dos Testes
18.
J Immunol ; 195(4): 1599-607, 2015 Aug 15.
Artigo em Inglês | MEDLINE | ID: mdl-26188062

RESUMO

Food allergy is a significant public health concern, especially among children. Previous candidate gene studies suggested a few susceptibility loci for food allergy, but no study investigated the contribution of copy number variations (CNVs) to food allergy on a genome-wide scale. To investigate the genetics of food allergy, we performed CNV assessment using high-resolution genome-wide single nucleotide polymorphism arrays. CNV calls from a total of 357 cases with confirmed food allergy and 3980 controls were analyzed within a discovery cohort, followed by a replication analysis composed of 167 cases and 1573 controls. We identified that CNVs in CTNNA3 were significantly associated with food allergy in both the discovery cohort and the replication cohort. Of particular interest, CTNNA3 CNVs hit exons or intron regions rich in histone marker H3K4Me1. CNVs in a second gene (RBFOX1) showed a significant association (p = 7.35 × 10(-5)) with food allergy at the genome-wide level in our meta-analysis of the European ancestry cohorts. The presence of these CNVs was confirmed by quantitative PCR. Furthermore, knockdown of CTNNA3 resulted in upregulation of CD63 and CD203c in mononuclear cells upon PMA stimulation, suggesting a role in sensitization to allergen. We uncovered at least two plausible genes harboring CNV loci that are enriched in pediatric patients with food allergies. The novel gene candidates discovered in this study by genome-wide CNV analysis are compelling drug and diagnostic targets for food allergy.


Assuntos
Variações do Número de Cópias de DNA , Hipersensibilidade Alimentar/genética , Hipersensibilidade Alimentar/imunologia , Predisposição Genética para Doença , Proteínas de Ligação a RNA/genética , alfa Catenina/genética , Adolescente , Fatores Etários , Estudos de Casos e Controles , Criança , Pré-Escolar , Feminino , Deleção de Genes , Estudos de Associação Genética , Humanos , Masculino , Metanálise como Assunto , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único , Fatores de Processamento de RNA , RNA Interferente Pequeno , Reprodutibilidade dos Testes
19.
Neuroimage ; 124(Pt B): 1115-1119, 2016 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-25840117

RESUMO

The Philadelphia Neurodevelopmental Cohort (PNC) is a large-scale study of child development that combines neuroimaging, diverse clinical and cognitive phenotypes, and genomics. Data from this rich resource is now publicly available through the Database of Genotypes and Phenotypes (dbGaP). Here we focus on the data from the PNC that is available through dbGaP and describe how users can access this data, which is evolving to be a significant resource for the broader neuroscience community for studies of normal and abnormal neurodevelopment.


Assuntos
Encéfalo/anormalidades , Encéfalo/crescimento & desenvolvimento , Deficiências do Desenvolvimento/patologia , Deficiências do Desenvolvimento/psicologia , Disseminação de Informação , Sistema Nervoso/crescimento & desenvolvimento , Adolescente , Criança , Desenvolvimento Infantil , Cognição , Feminino , Genômica , Humanos , Internet , Masculino , Neuroimagem
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA