Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
1.
Nat Aging ; 2024 Jun 04.
Artículo en Inglés | MEDLINE | ID: mdl-38834882

RESUMEN

Clonal hematopoiesis of indeterminate potential (CHIP), whereby somatic mutations in hematopoietic stem cells confer a selective advantage and drive clonal expansion, not only correlates with age but also confers increased risk of morbidity and mortality. Here, we leverage genetically predicted traits to identify factors that determine CHIP clonal expansion rate. We used the passenger-approximated clonal expansion rate method to quantify the clonal expansion rate for 4,370 individuals in the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) cohort and calculated polygenic risk scores for DNA methylation aging, inflammation-related measures and circulating protein levels. Clonal expansion rate was significantly associated with both genetically predicted and measured epigenetic clocks. No associations were identified with inflammation-related lab values or diseases and CHIP expansion rate overall. A proteome-wide search identified predicted circulating levels of myeloid zinc finger 1 and anti-Müllerian hormone as associated with an increased CHIP clonal expansion rate and tissue inhibitor of metalloproteinase 1 and glycine N-methyltransferase as associated with decreased CHIP clonal expansion rate. Together, our findings identify epigenetic and proteomic patterns associated with the rate of hematopoietic clonal expansion.

2.
Nat Commun ; 15(1): 3800, 2024 May 07.
Artículo en Inglés | MEDLINE | ID: mdl-38714703

RESUMEN

Clonal hematopoiesis (CH) is characterized by the acquisition of a somatic mutation in a hematopoietic stem cell that results in a clonal expansion. These driver mutations can be single nucleotide variants in cancer driver genes or larger structural rearrangements called mosaic chromosomal alterations (mCAs). The factors that influence the variations in mCA fitness and ultimately result in different clonal expansion rates are not well understood. We used the Passenger-Approximated Clonal Expansion Rate (PACER) method to estimate clonal expansion rate as PACER scores for 6,381 individuals in the NHLBI TOPMed cohort with gain, loss, and copy-neutral loss of heterozygosity mCAs. Our mCA fitness estimates, derived by aggregating per-individual PACER scores, were correlated (R2 = 0.49) with an alternative approach that estimated fitness of mCAs in the UK Biobank using population-level distributions of clonal fraction. Among individuals with JAK2 V617F clonal hematopoiesis of indeterminate potential or mCAs affecting the JAK2 gene on chromosome 9, PACER score was strongly correlated with erythrocyte count. In a cross-sectional analysis, genome-wide association study of estimates of mCA expansion rate identified a TCL1A locus variant associated with mCA clonal expansion rate, with suggestive variants in NRIP1 and TERT.


Asunto(s)
Aberraciones Cromosómicas , Hematopoyesis Clonal , Mosaicismo , Humanos , Hematopoyesis Clonal/genética , Masculino , Femenino , Estudio de Asociación del Genoma Completo , Janus Quinasa 2/genética , Telomerasa/genética , Telomerasa/metabolismo , Pérdida de Heterocigocidad , Estudios Transversales , Mutación , Persona de Mediana Edad , Células Madre Hematopoyéticas/metabolismo , Polimorfismo de Nucleótido Simple , Anciano
3.
medRxiv ; 2023 Oct 21.
Artículo en Inglés | MEDLINE | ID: mdl-37905118

RESUMEN

Clonal hematopoiesis (CH) is characterized by the acquisition of a somatic mutation in a hematopoietic stem cell that results in a clonal expansion. These driver mutations can be single nucleotide variants in cancer driver genes or larger structural rearrangements called mosaic chromosomal alterations (mCAs). The factors that influence the variations in mCA fitness and ultimately result in different clonal expansion rates are not well-understood. We used the Passenger-Approximated Clonal Expansion Rate (PACER) method to estimate clonal expansion rate for 6,381 individuals in the NHLBI TOPMed cohort with gain, loss, and copy-neutral loss of heterozygosity mCAs. Our estimates of mCA fitness were correlated (R 2 = 0.49) with an alternative approach that estimated fitness of mCAs in the UK Biobank using a theoretical probability distribution. Individuals with lymphoid-associated mCAs had a significantly higher white blood cell count and faster clonal expansion rate. In a cross-sectional analysis, genome-wide association study of estimates of mCA expansion rate identified TCL1A , NRIP1 , and TERT locus variants as modulators of mCA clonal expansion rate.

4.
bioRxiv ; 2023 Oct 24.
Artículo en Inglés | MEDLINE | ID: mdl-37745614

RESUMEN

The effects of genetic variation on complex traits act mainly through changes in gene regulation. Although many genetic variants have been linked to target genes in cis, the trans-regulatory cascade mediating their effects remains largely uncharacterized. Mapping trans-regulators based on natural genetic variation, including eQTL mapping, has been challenging due to small effects. Experimental perturbation approaches offer a complementary and powerful approach to mapping trans-regulators. We used CRISPR knockouts of 84 genes in primary CD4+ T cells to perturb an immune cell gene network, targeting both inborn error of immunity (IEI) disease transcription factors (TFs) and background TFs matched in constraint and expression level, but without a known immune disease association. We developed a novel Bayesian structure learning method called Linear Latent Causal Bayes (LLCB) to estimate the gene regulatory network from perturbation data and observed 211 directed edges among the genes which could not be detected in existing CD4+ trans-eQTL data. We used LLCB to characterize the differences between the IEI and background TFs, finding that the gene groups were highly interconnected, but that IEI TFs were much more likely to regulate immune cell specific pathways and immune GWAS genes. We further characterized nine coherent gene programs based on downstream effects of the TFs and linked these modules to regulation of GWAS genes, finding that canonical JAK-STAT family members are regulated by KMT2A, a global epigenetic regulator. These analyses reveal the trans-regulatory cascade from upstream epigenetic regulator to intermediate TFs to downstream effector cytokines and elucidate the logic linking immune GWAS genes to key signaling pathways.

5.
bioRxiv ; 2023 Jul 31.
Artículo en Inglés | MEDLINE | ID: mdl-37577640

RESUMEN

Due to the abundance of single cell RNA-seq data, a number of methods for predicting expression after perturbation have recently been published. Expression prediction methods are enticing because they promise to answer pressing questions in fields ranging from developmental genetics to cell fate engineering and because they are faster, cheaper, and higher-throughput than their experimental counterparts. However, the absolute and relative accuracy of these methods is poorly characterized, limiting their informed use, their improvement, and the interpretation of their predictions. To address these issues, we created a benchmarking platform that combines a panel of large-scale perturbation datasets with an expression forecasting software engine that encompasses or interfaces to current methods. We used our platform to systematically assess methods, parameters, and sources of auxiliary data. We found that uninformed baseline predictions, which were not always included in prior evaluations, yielded the same or better mean absolute error than benchmarked methods in all test cases. These results cast doubt on the ability of current expression forecasting methods to provide mechanistic insights or to rank hypotheses for experimental follow-up. However, given the rapid pace of innovation in the field, new approaches may yield more accurate expression predictions. Our platform will serve as a neutral benchmark to improve methods and to identify contexts in which expression prediction can succeed.

6.
Sci Adv ; 9(17): eabm4945, 2023 04 28.
Artículo en Inglés | MEDLINE | ID: mdl-37126548

RESUMEN

Nononcogenic somatic mutations are thought to be uncommon and inconsequential. To test this, we analyzed 43,693 National Heart, Lung and Blood Institute Trans-Omics for Precision Medicine blood whole genomes from 37 cohorts and identified 7131 non-missense somatic mutations that are recurrently mutated in at least 50 individuals. These recurrent non-missense somatic mutations (RNMSMs) are not clearly explained by other clonal phenomena such as clonal hematopoiesis. RNMSM prevalence increased with age, with an average 50-year-old having 27 RNMSMs. Inherited germline variation associated with RNMSM acquisition. These variants were found in genes involved in adaptive immune function, proinflammatory cytokine production, and lymphoid lineage commitment. In addition, the presence of eight specific RNMSMs associated with blood cell traits at effect sizes comparable to Mendelian genetic mutations. Overall, we found that somatic mutations in blood are an unexpectedly common phenomenon with ancestry-specific determinants and human health consequences.


Asunto(s)
Mutación de Línea Germinal , Hematopoyesis , Humanos , Persona de Mediana Edad , Mutación , Mutación Missense , Fenotipo
7.
Nature ; 616(7958): 755-763, 2023 04.
Artículo en Inglés | MEDLINE | ID: mdl-37046083

RESUMEN

Mutations in a diverse set of driver genes increase the fitness of haematopoietic stem cells (HSCs), leading to clonal haematopoiesis1. These lesions are precursors for blood cancers2-6, but the basis of their fitness advantage remains largely unknown, partly owing to a paucity of large cohorts in which the clonal expansion rate has been assessed by longitudinal sampling. Here, to circumvent this limitation, we developed a method to infer the expansion rate from data from a single time point. We applied this method to 5,071 people with clonal haematopoiesis. A genome-wide association study revealed that a common inherited polymorphism in the TCL1A promoter was associated with a slower expansion rate in clonal haematopoiesis overall, but the effect varied by driver gene. Those carrying this protective allele exhibited markedly reduced growth rates or prevalence of clones with driver mutations in TET2, ASXL1, SF3B1 and SRSF2, but this effect was not seen in clones with driver mutations in DNMT3A. TCL1A was not expressed in normal or DNMT3A-mutated HSCs, but the introduction of mutations in TET2 or ASXL1 led to the expression of TCL1A protein and the expansion of HSCs in vitro. The protective allele restricted TCL1A expression and expansion of mutant HSCs, as did experimental knockdown of TCL1A expression. Forced expression of TCL1A promoted the expansion of human HSCs in vitro and mouse HSCs in vivo. Our results indicate that the fitness advantage of several commonly mutated driver genes in clonal haematopoiesis may be mediated by TCL1A activation.


Asunto(s)
Hematopoyesis Clonal , Células Madre Hematopoyéticas , Animales , Humanos , Ratones , Alelos , Hematopoyesis Clonal/genética , Estudio de Asociación del Genoma Completo , Hematopoyesis/genética , Células Madre Hematopoyéticas/citología , Células Madre Hematopoyéticas/metabolismo , Mutación , Regiones Promotoras Genéticas
8.
Nat Commun ; 13(1): 5350, 2022 09 12.
Artículo en Inglés | MEDLINE | ID: mdl-36097025

RESUMEN

Age-related changes to the genome-wide DNA methylation (DNAm) pattern observed in blood are well-documented. Clonal hematopoiesis of indeterminate potential (CHIP), characterized by the age-related acquisition and expansion of leukemogenic mutations in hematopoietic stem cells (HSCs), is associated with blood cancer and coronary artery disease (CAD). Epigenetic regulators DNMT3A and TET2 are the two most frequently mutated CHIP genes. Here, we present results from an epigenome-wide association study for CHIP in 582 Cardiovascular Health Study (CHS) participants, with replication in 2655 Atherosclerosis Risk in Communities (ARIC) Study participants. We show that DNMT3A and TET2 CHIP have distinct and directionally opposing genome-wide DNAm association patterns consistent with their regulatory roles, albeit both promoting self-renewal of HSCs. Mendelian randomization analyses indicate that a subset of DNAm alterations associated with these two leading CHIP genes may promote the risk for CAD.


Asunto(s)
Hematopoyesis Clonal , Enfermedad de la Arteria Coronaria , Hematopoyesis Clonal/genética , Enfermedad de la Arteria Coronaria/genética , Metilación de ADN/genética , Hematopoyesis/genética , Células Madre Hematopoyéticas , Humanos
9.
Cell Genom ; 2(1)2022 Jan 12.
Artículo en Inglés | MEDLINE | ID: mdl-35530816

RESUMEN

Genetic studies on telomere length are important for understanding age-related diseases. Prior GWAS for leukocyte TL have been limited to European and Asian populations. Here, we report the first sequencing-based association study for TL across ancestrally-diverse individuals (European, African, Asian and Hispanic/Latino) from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. We used whole genome sequencing (WGS) of whole blood for variant genotype calling and the bioinformatic estimation of telomere length in n=109,122 individuals. We identified 59 sentinel variants (p-value <5×10-9) in 36 loci associated with telomere length, including 20 newly associated loci (13 were replicated in external datasets). There was little evidence of effect size heterogeneity across populations. Fine-mapping at OBFC1 indicated the independent signals colocalized with cell-type specific eQTLs for OBFC1 (STN1). Using a multi-variant gene-based approach, we identified two genes newly implicated in telomere length, DCLRE1B (SNM1B) and PARN. In PheWAS, we demonstrated our TL polygenic trait scores (PTS) were associated with increased risk of cancer-related phenotypes.

10.
Sci Adv ; 8(14): eabl6579, 2022 Apr 08.
Artículo en Inglés | MEDLINE | ID: mdl-35385311

RESUMEN

Human genetic studies support an inverse causal relationship between leukocyte telomere length (LTL) and coronary artery disease (CAD), but directionally mixed effects for LTL and diverse malignancies. Clonal hematopoiesis of indeterminate potential (CHIP), characterized by expansion of hematopoietic cells bearing leukemogenic mutations, predisposes both hematologic malignancy and CAD. TERT (which encodes telomerase reverse transcriptase) is the most significantly associated germline locus for CHIP in genome-wide association studies. Here, we investigated the relationship between CHIP, LTL, and CAD in the Trans-Omics for Precision Medicine (TOPMed) program (n = 63,302) and UK Biobank (n = 47,080). Bidirectional Mendelian randomization studies were consistent with longer genetically imputed LTL increasing propensity to develop CHIP, but CHIP then, in turn, hastens to shorten measured LTL (mLTL). We also demonstrated evidence of modest mediation between CHIP and CAD by mLTL. Our data promote an understanding of potential causal relationships across CHIP and LTL toward prevention of CAD.

11.
J Clin Invest ; 132(4)2022 02 15.
Artículo en Inglés | MEDLINE | ID: mdl-34990411

RESUMEN

BACKGROUNDCurative gene therapies for sickle cell disease (SCD) are currently undergoing clinical evaluation. The occurrence of myeloid malignancies in these trials has prompted safety concerns. Individuals with SCD are predisposed to myeloid malignancies, but the underlying causes remain undefined. Clonal hematopoiesis (CH) is a premalignant condition that also confers significant predisposition to myeloid cancers. While it has been speculated that CH may play a role in SCD-associated cancer predisposition, limited data addressing this issue have been reported.METHODSHere, we leveraged 74,190 whole-genome sequences to robustly study CH in SCD. Somatic mutation calling methods were used to assess CH in all samples and comparisons between individuals with and without SCD were performed.RESULTSWhile we had sufficient power to detect a greater than 2-fold increased rate of CH, we found no detectable variation in rate or clone properties between individuals affected by SCD and controls. The rate of CH in individuals with SCD was unaltered by hydroxyurea use.CONCLUSIONSWe did not observe an increased risk for acquiring detectable CH in SCD, at least as measured by whole-genome sequencing. These results should help guide ongoing efforts and further studies that seek to better define the risk factors underlying myeloid malignancy predisposition in SCD and help ensure that curative therapies can be more safely applied.FUNDINGNew York Stem Cell Foundation and the NIH.


Asunto(s)
Anemia de Células Falciformes/genética , Hematopoyesis Clonal/genética , Anemia de Células Falciformes/terapia , Femenino , Humanos , Masculino , Secuenciación Completa del Genoma
12.
Blood Cancer Discov ; 2(5): 500-517, 2021 09.
Artículo en Inglés | MEDLINE | ID: mdl-34568833

RESUMEN

Clonal hematopoiesis results from somatic mutations in cancer driver genes in hematopoietic stem cells. We sought to identify novel drivers of clonal expansion using an unbiased analysis of sequencing data from 84,683 persons and identified common mutations in the 5-methylcytosine reader, ZBTB33, as well as in YLPM1, SRCAP, and ZNF318. We also identified these mutations at low frequency in myelodysplastic syndrome patients. Zbtb33 edited mouse hematopoietic stem and progenitor cells exhibited a competitive advantage in vivo and increased genome-wide intron retention. ZBTB33 mutations potentially link DNA methylation and RNA splicing, the two most commonly mutated pathways in clonal hematopoiesis and MDS.


Asunto(s)
Hematopoyesis Clonal , Síndromes Mielodisplásicos , Animales , Hematopoyesis/genética , Células Madre Hematopoyéticas , Humanos , Ratones , Síndromes Mielodisplásicos/genética , Empalme del ARN/genética , Factores de Transcripción/genética
14.
J Am Heart Assoc ; 10(5): e018789, 2021 02.
Artículo en Inglés | MEDLINE | ID: mdl-33619969

RESUMEN

Background Presence of clonal hematopoiesis of indeterminate potential (CHIP) is associated with a higher risk of atherosclerotic cardiovascular disease, cancer, and mortality. The relationship between a healthy lifestyle and CHIP is unknown. Methods and Results This analysis included 8709 postmenopausal women (mean age, 66.5 years) enrolled in the WHI (Women's Health Initiative), free of cancer or cardiovascular disease, with deep-coverage whole genome sequencing data available. Information on lifestyle factors (body mass index, smoking, physical activity, and diet quality) was obtained, and a healthy lifestyle score was created on the basis of healthy criteria met (0 point [least healthy] to 4 points [most healthy]). CHIP was derived on the basis of a prespecified list of leukemogenic driver mutations. The prevalence of CHIP was 8.6%. A higher healthy lifestyle score was not associated with CHIP (multivariable-adjusted odds ratio [OR] [95% CI], 0.99 [0.80-1.23] and 1.13 [0.93-1.37]) for the upper (3 or 4 points) and middle category (2 points), respectively, versus referent (0 or 1 point). Across score components, a normal and overweight body mass index compared with obese was significantly associated with a lower odds for CHIP (OR, 0.71 [95% CI, 0.57-0.88] and 0.83 [95% CI, 0.68-1.01], respectively; P-trend 0.0015). Having never smoked compared with being a current smoker tended to be associated with lower odds for CHIP. Conclusions A healthy lifestyle, based on a composite score, was not related to CHIP among postmenopausal women. However, across individual lifestyle factors, having a normal body mass index was strongly associated with a lower prevalence of CHIP. These findings support the idea that certain healthy lifestyle factors are associated with a lower frequency of CHIP.


Asunto(s)
Enfermedades Cardiovasculares/etiología , Hematopoyesis Clonal/fisiología , ADN/genética , Estilo de Vida , Posmenopausia , Salud de la Mujer , Anciano , Enfermedades Cardiovasculares/epidemiología , Enfermedades Cardiovasculares/genética , Femenino , Frecuencia de los Genes , Humanos , Persona de Mediana Edad , Prevalencia , Estudios Retrospectivos , Estados Unidos/epidemiología
15.
PLoS Genet ; 16(11): e1009077, 2020 11.
Artículo en Inglés | MEDLINE | ID: mdl-33175840

RESUMEN

Phenotypes extracted from Electronic Health Records (EHRs) are increasingly prevalent in genetic studies. EHRs contain hundreds of distinct clinical laboratory test results, providing a trove of health data beyond diagnoses. Such lab data is complex and lacks a ubiquitous coding scheme, making it more challenging than diagnosis data. Here we describe the first large-scale cross-health system genome-wide association study (GWAS) of EHR-based quantitative laboratory-derived phenotypes. We meta-analyzed 70 lab traits matched between the BioVU cohort from the Vanderbilt University Health System and the Michigan Genomics Initiative (MGI) cohort from Michigan Medicine. We show high replication of known association for these traits, validating EHR-based measurements as high-quality phenotypes for genetic analysis. Notably, our analysis provides the first replication for 699 previous GWAS associations across 46 different traits. We discovered 31 novel associations at genome-wide significance for 22 distinct traits, including the first reported associations for two lab-based traits. We replicated 22 of these novel associations in an independent tranche of BioVU samples. The summary statistics for all association tests are freely available to benefit other researchers. Finally, we performed mirrored analyses in BioVU and MGI to assess competing analytic practices for EHR lab traits. We find that using the mean of all available lab measurements provides a robust summary value, but alternate summarizations can improve power in certain circumstances. This study provides a proof-of-principle for cross health system GWAS and is a framework for future studies of quantitative EHR lab traits.


Asunto(s)
Registros Electrónicos de Salud/estadística & datos numéricos , Estudios de Asociación Genética/métodos , Estudio de Asociación del Genoma Completo/métodos , Bancos de Muestras Biológicas , Estudios de Cohortes , Registros Electrónicos de Salud/tendencias , Genómica , Humanos , Michigan , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Carácter Cuantitativo Heredable
16.
Nature ; 586(7831): 763-768, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-33057201

RESUMEN

Age is the dominant risk factor for most chronic human diseases, but the mechanisms through which ageing confers this risk are largely unknown1. The age-related acquisition of somatic mutations that lead to clonal expansion in regenerating haematopoietic stem cell populations has recently been associated with both haematological cancer2-4 and coronary heart disease5-this phenomenon is termed clonal haematopoiesis of indeterminate potential (CHIP)6. Simultaneous analyses of germline and somatic whole-genome sequences provide the opportunity to identify root causes of CHIP. Here we analyse high-coverage whole-genome sequences from 97,691 participants of diverse ancestries in the National Heart, Lung, and Blood Institute Trans-omics for Precision Medicine (TOPMed) programme, and identify 4,229 individuals with CHIP. We identify associations with blood cell, lipid and inflammatory traits that are specific to different CHIP driver genes. Association of a genome-wide set of germline genetic variants enabled the identification of three genetic loci associated with CHIP status, including one locus at TET2 that was specific to individuals of African ancestry. In silico-informed in vitro evaluation of the TET2 germline locus enabled the identification of a causal variant that disrupts a TET2 distal enhancer, resulting in increased self-renewal of haematopoietic stem cells. Overall, we observe that germline genetic variation shapes haematopoietic stem cell function, leading to CHIP through mechanisms that are specific to clonal haematopoiesis as well as shared mechanisms that lead to somatic mutations across tissues.


Asunto(s)
Hematopoyesis Clonal/genética , Predisposición Genética a la Enfermedad , Genoma Humano/genética , Secuenciación Completa del Genoma , Adulto , África/etnología , Anciano , Anciano de 80 o más Años , Población Negra/genética , Autorrenovación de las Células/genética , Proteínas de Unión al ADN/genética , Dioxigenasas , Femenino , Mutación de Línea Germinal/genética , Células Madre Hematopoyéticas/citología , Células Madre Hematopoyéticas/metabolismo , Humanos , Péptidos y Proteínas de Señalización Intracelular/genética , Masculino , Persona de Mediana Edad , National Heart, Lung, and Blood Institute (U.S.) , Fenotipo , Medicina de Precisión , Proteínas Proto-Oncogénicas/genética , Proteínas de Motivos Tripartitos/genética , Estados Unidos , alfa Carioferinas/genética
17.
Genet Epidemiol ; 43(7): 800-814, 2019 10.
Artículo en Inglés | MEDLINE | ID: mdl-31433078

RESUMEN

The power of genetic association analyses can be increased by jointly meta-analyzing multiple correlated phenotypes. Here, we develop a meta-analysis framework, Meta-MultiSKAT, that uses summary statistics to test for association between multiple continuous phenotypes and variants in a region of interest. Our approach models the heterogeneity of effects between studies through a kernel matrix and performs a variance component test for association. Using a genotype kernel, our approach can test for rare-variants and the combined effects of both common and rare-variants. To achieve robust power, within Meta-MultiSKAT, we developed fast and accurate omnibus tests combining different models of genetic effects, functional genomic annotations, multiple correlated phenotypes, and heterogeneity across studies. In addition, Meta-MultiSKAT accommodates situations where studies do not share exactly the same set of phenotypes or have differing correlation patterns among the phenotypes. Simulation studies confirm that Meta-MultiSKAT can maintain the type-I error rate at the exome-wide level of 2.5 × 10-6 . Further simulations under different models of association show that Meta-MultiSKAT can improve the power of detection from 23% to 38% on average over single phenotype-based meta-analysis approaches. We demonstrate the utility and improved power of Meta-MultiSKAT in the meta-analyses of four white blood cell subtype traits from the Michigan Genomics Initiative (MGI) and SardiNIA studies.


Asunto(s)
Estudios de Asociación Genética , Metaanálisis como Asunto , Frecuencia de los Genes/genética , Genotipo , Humanos , Italia , Leucocitos/metabolismo , Modelos Genéticos , Mutación/genética , Fenotipo
18.
Genet Epidemiol ; 43(8): 980-995, 2019 12.
Artículo en Inglés | MEDLINE | ID: mdl-31452258

RESUMEN

Array genotyping is a cost-effective and widely used tool that enables assessment of up to millions of genetic markers in hundreds of thousands of individuals. Genotyping array data are typically highly accurate but sensitive to mixing of DNA samples from multiple individuals before or during genotyping. Contaminated samples can lead to genotyping errors and consequently cause false positive signals or reduce power of association analyses. Here, we propose a new method to identify contaminated samples and the sources of contamination within a genotyping batch. Through analysis of array intensity and genotype data from intentionally mixed samples and 22,366 samples of the Michigan Genomics Initiative, an ongoing biobank-based study, we show that our method can reliably estimate contamination. We also show that identifying sources of contamination can implicate problematic sample processing steps and guide process improvements. Compared to existing methods, our approach can estimate the proportion of contaminating DNA more accurately, eliminate the need for external databases of allele frequencies, and provide contamination estimates that are more robust to the ancestral origin of the contaminating sample.


Asunto(s)
Contaminación de ADN , Técnicas de Genotipaje , ADN , Frecuencia de los Genes , Marcadores Genéticos , Genómica/métodos , Genotipo , Técnicas de Genotipaje/métodos , Humanos , Polimorfismo de Nucleótido Simple
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA