RESUMO
BACKGROUND: Dilated cardiomyopathy (DCM) is genetically heterogeneous, with >100 purported disease genes tested in clinical laboratories. However, many genes were originally identified based on candidate-gene studies that did not adequately account for background population variation. Here we define the frequency of rare variation in 2538 patients with DCM across protein-coding regions of 56 commonly tested genes and compare this to both 912 confirmed healthy controls and a reference population of 60 706 individuals to identify clinically interpretable genes robustly associated with dominant monogenic DCM. METHODS: We used the TruSight Cardio sequencing panel to evaluate the burden of rare variants in 56 putative DCM genes in 1040 patients with DCM and 912 healthy volunteers processed with identical sequencing and bioinformatics pipelines. We further aggregated data from 1498 patients with DCM sequenced in diagnostic laboratories and the Exome Aggregation Consortium database for replication and meta-analysis. RESULTS: Truncating variants in TTN and DSP were associated with DCM in all comparisons. Variants in MYH7, LMNA, BAG3, TNNT2, TNNC1, PLN, ACTC1, NEXN, TPM1, and VCL were significantly enriched in specific patient subsets, with the last 2 genes potentially contributing primarily to early-onset forms of DCM. Overall, rare variants in these 12 genes potentially explained 17% of cases in the outpatient clinic cohort representing a broad range of adult patients with DCM and 26% of cases in the diagnostic referral cohort enriched in familial and early-onset DCM. Although the absence of a significant excess in other genes cannot preclude a limited role in disease, such genes have limited diagnostic value because novel variants will be uninterpretable and their diagnostic yield is minimal. CONCLUSIONS: In the largest sequenced DCM cohort yet described, we observe robust disease association with 12 genes, highlighting their importance in DCM and translating into high interpretability in diagnostic testing. The other genes analyzed here will need to be rigorously evaluated in ongoing curation efforts to determine their validity as Mendelian DCM genes but have limited value in diagnostic testing in DCM at present. This data will contribute to community gene curation efforts and will reduce erroneous and inconclusive findings in diagnostic testing.
Assuntos
Proteínas Reguladoras de Apoptose/genética , Cardiomiopatia Dilatada/genética , Predisposição Genética para Doença , Testes Genéticos , Proteínas Adaptadoras de Transdução de Sinal/genética , Adolescente , Adulto , Cardiomiopatia Dilatada/diagnóstico , Exoma/genética , Feminino , Heterogeneidade Genética , Humanos , Masculino , Adulto JovemRESUMO
PURPOSE: Accurate discrimination of benign and pathogenic rare variation remains a priority for clinical genome interpretation. State-of-the-art machine learning variant prioritization tools are imprecise and ignore important parameters defining gene-disease relationships, e.g., distinct consequences of gain-of-function versus loss-of-function variants. We hypothesized that incorporating disease-specific information would improve tool performance. METHODS: We developed a disease-specific variant classifier, CardioBoost, that estimates the probability of pathogenicity for rare missense variants in inherited cardiomyopathies and arrhythmias. We assessed CardioBoost's ability to discriminate known pathogenic from benign variants, prioritize disease-associated variants, and stratify patient outcomes. RESULTS: CardioBoost has high global discrimination accuracy (precision recall area under the curve [AUC] 0.91 for cardiomyopathies; 0.96 for arrhythmias), outperforming existing tools (4-24% improvement). CardioBoost obtains excellent accuracy (cardiomyopathies 90.2%; arrhythmias 91.9%) for variants classified with >90% confidence, and increases the proportion of variants classified with high confidence more than twofold compared with existing tools. Variants classified as disease-causing are associated with both disease status and clinical severity, including a 21% increased risk (95% confidence interval [CI] 11-29%) of severe adverse outcomes by age 60 in patients with hypertrophic cardiomyopathy. CONCLUSIONS: A disease-specific variant classifier outperforms state-of-the-art genome-wide tools for rare missense variants in inherited cardiac conditions ( https://www.cardiodb.org/cardioboost/ ), highlighting broad opportunities for improved pathogenicity prediction through disease specificity.
Assuntos
Cardiomiopatias , Mutação de Sentido Incorreto , Algoritmos , Área Sob a Curva , Cardiomiopatias/diagnóstico , Cardiomiopatias/genética , Humanos , Pessoa de Meia-Idade , Mutação de Sentido Incorreto/genética , VirulênciaRESUMO
BACKGROUND: Cancer therapy-induced cardiomyopathy (CCM) is associated with cumulative drug exposures and preexisting cardiovascular disorders. These parameters incompletely account for substantial interindividual susceptibility to CCM. We hypothesized that rare variants in cardiomyopathy genes contribute to CCM. METHODS: We studied 213 patients with CCM from 3 cohorts: retrospectively recruited adults with diverse cancers (n=99), prospectively phenotyped adults with breast cancer (n=73), and prospectively phenotyped children with acute myeloid leukemia (n=41). Cardiomyopathy genes, including 9 prespecified genes, were sequenced. The prevalence of rare variants was compared between CCM cohorts and The Cancer Genome Atlas participants (n=2053), healthy volunteers (n=445), and an ancestry-matched reference population. Clinical characteristics and outcomes were assessed and stratified by genotypes. A prevalent CCM genotype was modeled in anthracycline-treated mice. RESULTS: CCM was diagnosed 0.4 to 9 years after chemotherapy; 90% of these patients received anthracyclines. Adult patients with CCM had cardiovascular risk factors similar to the US population. Among 9 prioritized genes, patients with CCM had more rare protein-altering variants than comparative cohorts ( P≤1.98e-04). Titin-truncating variants (TTNtvs) predominated, occurring in 7.5% of patients with CCM versus 1.1% of The Cancer Genome Atlas participants ( P=7.36e-08), 0.7% of healthy volunteers ( P=3.42e-06), and 0.6% of the reference population ( P=5.87e-14). Adult patients who had CCM with TTNtvs experienced more heart failure and atrial fibrillation ( P=0.003) and impaired myocardial recovery ( P=0.03) than those without. Consistent with human data, anthracycline-treated TTNtv mice and isolated TTNtv cardiomyocytes showed sustained contractile dysfunction unlike wild-type ( P=0.0004 and P<0.002, respectively). CONCLUSIONS: Unrecognized rare variants in cardiomyopathy-associated genes, particularly TTNtvs, increased the risk for CCM in children and adults, and adverse cardiac events in adults. Genotype, along with cumulative chemotherapy dosage and traditional cardiovascular risk factors, improves the identification of patients who have cancer at highest risk for CCM. CLINICAL TRIAL REGISTRATION: URL: https://www.clinicaltrials.gov . Unique identifiers: NCT01173341; AAML1031; NCT01371981.
Assuntos
Antineoplásicos/efeitos adversos , Cardiomiopatias/induzido quimicamente , Cardiomiopatias/genética , Variação Genética/genética , Neoplasias/tratamento farmacológico , Neoplasias/genética , Adulto , Idoso , Animais , Cardiomiopatias/epidemiologia , Estudos de Coortes , Feminino , Variação Genética/efeitos dos fármacos , Humanos , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Camundongos Transgênicos , Pessoa de Meia-Idade , Neoplasias/epidemiologia , Estudos Prospectivos , Estudos RetrospectivosRESUMO
PURPOSE: Internationally adopted variant interpretation guidelines from the American College of Medical Genetics and Genomics (ACMG) are generic and require disease-specific refinement. Here we developed CardioClassifier ( http://www.cardioclassifier.org ), a semiautomated decision-support tool for inherited cardiac conditions (ICCs). METHODS: CardioClassifier integrates data retrieved from multiple sources with user-input case-specific information, through an interactive interface, to support variant interpretation. Combining disease- and gene-specific knowledge with variant observations in large cohorts of cases and controls, we refined 14 computational ACMG criteria and created three ICC-specific rules. RESULTS: We benchmarked CardioClassifier on 57 expertly curated variants and show full retrieval of all computational data, concordantly activating 87.3% of rules. A generic annotation tool identified fewer than half as many clinically actionable variants (64/219 vs. 156/219, Fisher's P = 1.1 × 10-18), with important false positives, illustrating the critical importance of disease and gene-specific annotations. CardioClassifier identified putatively disease-causing variants in 33.7% of 327 cardiomyopathy cases, comparable with leading ICC laboratories. Through addition of manually curated data, variants found in over 40% of cardiomyopathy cases are fully annotated, without requiring additional user-input data. CONCLUSION: CardioClassifier is an ICC-specific decision-support tool that integrates expertly curated computational annotations with case-specific data to generate fast, reproducible, and interactive variant pathogenicity reports, according to best practice guidelines.
Assuntos
Anormalidades Cardiovasculares/genética , Testes Genéticos , Genoma Humano/genética , Software , Anormalidades Cardiovasculares/diagnóstico , Anormalidades Cardiovasculares/patologia , Biologia Computacional , Técnicas de Apoio para a Decisão , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , MutaçãoRESUMO
AIM: Hypertrophic cardiomyopathy (HCM) exhibits genetic heterogeneity that is dominated by variation in eight sarcomeric genes. Genetic variation in a large number of non-sarcomeric genes has also been implicated in HCM but not formally assessed. Here we used very large case and control cohorts to determine the extent to which variation in non-sarcomeric genes contributes to HCM. METHODS AND RESULTS: We sequenced known and putative HCM genes in a new large prospective HCM cohort (n = 804) and analysed data alongside the largest published series of clinically genotyped HCM patients (n = 6179), previously published HCM cohorts and reference population samples from the exome aggregation consortium (ExAC, n = 60 706) to assess variation in 31 genes implicated in HCM. We found no significant excess of rare (minor allele frequency < 1:10 000 in ExAC) protein-altering variants over controls for most genes tested and conclude that novel variants in these genes are rarely interpretable, even for genes with previous evidence of co-segregation (e.g. ACTN2). To provide an aid for variant interpretation, we integrated HCM gene sequence data with aggregated pedigree and functional data and suggest a means of assessing gene pathogenicity in HCM using this evidence. CONCLUSION: We show that genetic variation in the majority of non-sarcomeric genes implicated in HCM is not associated with the condition, reinforce the fact that the sarcomeric gene variation is the primary cause of HCM known to date and underscore that the aetiology of HCM is unknown in the majority of patients.
Assuntos
Cardiomiopatia Hipertrófica/genética , Genes/genética , Sarcômeros/genética , Estudos de Casos e Controles , Feminino , Variação Genética , Humanos , Masculino , Mutação/genética , Estudos ProspectivosRESUMO
Epigenetic modifications such as DNA methylation play a key role in gene regulation and disease susceptibility. However, little is known about the genome-wide frequency, localization, and function of methylation variation and how it is regulated by genetic and environmental factors. We utilized the Multiple Tissue Human Expression Resource (MuTHER) and generated Illumina 450K adipose methylome data from 648 twins. We found that individual CpGs had low variance and that variability was suppressed in promoters. We noted that DNA methylation variation was highly heritable (h(2)median = 0.34) and that shared environmental effects correlated with metabolic phenotype-associated CpGs. Analysis of methylation quantitative-trait loci (metQTL) revealed that 28% of CpGs were associated with nearby SNPs, and when overlapping them with adipose expression quantitative-trait loci (eQTL) from the same individuals, we found that 6% of the loci played a role in regulating both gene expression and DNA methylation. These associations were bidirectional, but there were pronounced negative associations for promoter CpGs. Integration of metQTL with adipose reference epigenomes and disease associations revealed significant enrichment of metQTL overlapping metabolic-trait or disease loci in enhancers (the strongest effects were for high-density lipoprotein cholesterol and body mass index [BMI]). We followed up with the BMI SNP rs713586, a cg01884057 metQTL that overlaps an enhancer upstream of ADCY3, and used bisulphite sequencing to refine this region. Our results showed widespread population invariability yet sequence dependence on adipose DNA methylation but that incorporating maps of regulatory elements aid in linking CpG variation to gene regulation and disease risk in a tissue-dependent manner.
Assuntos
Tecido Adiposo , Metilação de DNA , Polimorfismo de Nucleotídeo Único , Sequências Reguladoras de Ácido Nucleico , Índice de Massa Corporal , Mapeamento Cromossômico , Epigenômica , Feminino , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Genoma Humano , Humanos , Hibridização Genética , Análise de Sequência com Séries de Oligonucleotídeos , Fenótipo , Locos de Características Quantitativas , Análise de Sequência de DNA , Sulfitos/metabolismo , Gêmeos/genéticaRESUMO
While there have been studies exploring regulatory variation in one or more tissues, the complexity of tissue-specificity in multiple primary tissues is not yet well understood. We explore in depth the role of cis-regulatory variation in three human tissues: lymphoblastoid cell lines (LCL), skin, and fat. The samples (156 LCL, 160 skin, 166 fat) were derived simultaneously from a subset of well-phenotyped healthy female twins of the MuTHER resource. We discover an abundance of cis-eQTLs in each tissue similar to previous estimates (858 or 4.7% of genes). In addition, we apply factor analysis (FA) to remove effects of latent variables, thus more than doubling the number of our discoveries (1,822 eQTL genes). The unique study design (Matched Co-Twin Analysis--MCTA) permits immediate replication of eQTLs using co-twins (93%-98%) and validation of the considerable gain in eQTL discovery after FA correction. We highlight the challenges of comparing eQTLs between tissues. After verifying previous significance threshold-based estimates of tissue-specificity, we show their limitations given their dependency on statistical power. We propose that continuous estimates of the proportion of tissue-shared signals and direct comparison of the magnitude of effect on the fold change in expression are essential properties that jointly provide a biologically realistic view of tissue-specificity. Under this framework we demonstrate that 30% of eQTLs are shared among the three tissues studied, while another 29% appear exclusively tissue-specific. However, even among the shared eQTLs, a substantial proportion (10%-20%) have significant differences in the magnitude of fold change between genotypic classes across tissues. Our results underline the need to account for the complexity of eQTL tissue-specificity in an effort to assess consequences of such variants for complex traits.
Assuntos
Tecido Adiposo/metabolismo , Genes Reguladores/genética , Locos de Características Quantitativas/genética , Pele/metabolismo , Linhagem Celular , Células Cultivadas , Interpretação Estatística de Dados , Feminino , Perfilação da Expressão Gênica , Genótipo , Humanos , Especificidade de Órgãos/genética , Fenótipo , GêmeosRESUMO
The immune system comprises multiple cell lineages and heterogeneous subsets found in blood and tissues throughout the body. While human immune responses differ between sites and over age, the underlying sources of variation remain unclear as most studies are limited to peripheral blood. Here, we took a systems approach to comprehensively profile RNA and surface protein expression of over 1.25 million immune cells isolated from blood, lymphoid organs, and mucosal tissues of 24 organ donors aged 20-75 years. We applied a multimodal classifier to annotate the major immune cell lineages (T cells, B cells, innate lymphoid cells, and myeloid cells) and their corresponding subsets across the body, leveraging probabilistic modeling to define bases for immune variations across donors, tissue, and age. We identified dominant tissue-specific effects on immune cell composition and function across lineages for lymphoid sites, intestines, and blood-rich tissues. Age-associated effects were intrinsic to both lineage and site as manifested by macrophages in mucosal sites, B cells in lymphoid organs, and T and NK cells in blood-rich sites. Our results reveal tissue-specific signatures of immune homeostasis throughout the body and across different ages. This information provides a basis for defining the transcriptional underpinnings of immune variation and potential associations with disease-associated immune pathologies across the human lifespan.
RESUMO
BACKGROUND: International guidelines for variant interpretation in Mendelian disease set stringent criteria to report a variant as (likely) pathogenic, prioritising control of false-positive rate over test sensitivity and diagnostic yield. Genetic testing is also more likely informative in individuals with well-characterised variants from extensively studied European-ancestry populations. Inherited cardiomyopathies are relatively common Mendelian diseases that allow empirical calibration and assessment of this framework. METHODS: We compared rare variants in large hypertrophic cardiomyopathy (HCM) cohorts (up to 6179 cases) to reference populations to identify variant classes with high prior likelihoods of pathogenicity, as defined by etiological fraction (EF). We analysed the distribution of variants using a bespoke unsupervised clustering algorithm to identify gene regions in which variants are significantly clustered in cases. RESULTS: Analysis of variant distribution identified regions in which variants are significantly enriched in cases and variant location was a better discriminator of pathogenicity than generic computational functional prediction algorithms. Non-truncating variant classes with an EF ≥ 0.95 were identified in five established HCM genes. Applying this approach leads to an estimated 14-20% increase in cases with actionable HCM variants, i.e. variants classified as pathogenic/likely pathogenic that might be used for predictive testing in probands' relatives. CONCLUSIONS: When found in a patient confirmed to have disease, novel variants in some genes and regions are empirically shown to have a sufficiently high probability of pathogenicity to support a "likely pathogenic" classification, even without additional segregation or functional data. This could increase the yield of high confidence actionable variants, consistent with the framework and recommendations of current guidelines. The techniques outlined offer a consistent and unbiased approach to variant interpretation for Mendelian disease genetic testing. We propose adaptations to ACMG/AMP guidelines to incorporate such evidence in a quantitative and transparent manner.
Assuntos
Cardiomiopatia Hipertrófica/genética , Testes Genéticos/normas , Mutação , Cardiomiopatia Hipertrófica/patologia , Humanos , Guias de Prática Clínica como AssuntoRESUMO
BACKGROUND: Alcoholic cardiomyopathy (ACM) is defined by a dilated and impaired left ventricle due to chronic excess alcohol consumption. It is largely unknown which factors determine cardiac toxicity on exposure to alcohol. OBJECTIVES: This study sought to evaluate the role of variation in cardiomyopathy-associated genes in the pathophysiology of ACM, and to examine the effects of alcohol intake and genotype on dilated cardiomyopathy (DCM) severity. METHODS: The authors characterized 141 ACM cases, 716 DCM cases, and 445 healthy volunteers. The authors compared the prevalence of rare, protein-altering variants in 9 genes associated with inherited DCM. They evaluated the effect of genotype and alcohol consumption on phenotype in DCM. RESULTS: Variants in well-characterized DCM-causing genes were more prevalent in patients with ACM than control subjects (13.5% vs. 2.9%; p = 1.2 ×10-5), but similar between patients with ACM and DCM (19.4%; p = 0.12) and with a predominant burden of titin truncating variants (TTNtv) (9.9%). Separately, we identified an interaction between TTN genotype and excess alcohol consumption in a cohort of DCM patients not meeting ACM criteria. On multivariate analysis, DCM patients with a TTNtv who consumed excess alcohol had an 8.7% absolute reduction in ejection fraction (95% confidence interval: -2.3% to -15.1%; p < 0.007) compared with those without TTNtv and excess alcohol consumption. The presence of TTNtv did not predict phenotype, outcome, or functional recovery on treatment in ACM patients. CONCLUSIONS: TTNtv represent a prevalent genetic predisposition for ACM, and are also associated with a worse left ventricular ejection fraction in DCM patients who consume alcohol above recommended levels. Familial evaluation and genetic testing should be considered in patients presenting with ACM.
Assuntos
Cardiomiopatia Alcoólica/etiologia , Cardiomiopatia Alcoólica/genética , Cardiotoxicidade/etiologia , Cardiotoxicidade/genética , Predisposição Genética para Doença/etiologia , Predisposição Genética para Doença/genética , Adulto , Idoso , Cardiomiopatia Alcoólica/diagnóstico , Cardiotoxicidade/diagnóstico , Estudos de Coortes , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Estudos Prospectivos , AutorrelatoRESUMO
Variants in NDUFB11, which encodes a structural component of complex I of the mitochondrial respiratory chain (MRC), were recently independently reported to cause histiocytoid cardiomyopathy (histiocytoid CM) and microphthalmia with linear skin defects syndrome (MLS syndrome). Here we report an additional case of histiocytoid CM, which carries a de novo nonsense variant in NDUFB11 (ENST00000276062.8: c.262C > T; p.[Arg88*]) identified using whole-exome sequencing (WES) of a family trio. An identical variant has been previously reported in association with MLS syndrome. The case we describe here lacked the diagnostic features of MLS syndrome, but a detailed clinical comparison of the two cases revealed significant phenotypic overlap. Heterozygous variants in HCCS (which encodes an important mitochondrially targeted protein) and COX7B, which, like NDUFB11, encodes a protein of the MRC, have also previously been identified in MLS syndrome including a case with features of both MLS syndrome and histiocytoid CM. However, a systematic review of WES data from previously published histiocytoid CM cases, alongside four additional cases presented here for the first time, did not identify any variants in these genes. We conclude that NDUFB11 variants play a role in the pathogenesis of both histiocytoid CM and MLS and that these disorders are allelic (genetically related).
RESUMO
Sequence-based variation in gene expression is a key driver of disease risk. Common variants regulating expression in cis have been mapped in many expression quantitative trait locus (eQTL) studies, typically in single tissues from unrelated individuals. Here, we present a comprehensive analysis of gene expression across multiple tissues conducted in a large set of mono- and dizygotic twins that allows systematic dissection of genetic (cis and trans) and non-genetic effects on gene expression. Using identity-by-descent estimates, we show that at least 40% of the total heritable cis effect on expression cannot be accounted for by common cis variants, a finding that reveals the contribution of low-frequency and rare regulatory variants with respect to both transcriptional regulation and complex trait susceptibility. We show that a substantial proportion of gene expression heritability is trans to the structural gene, and we identify several replicating trans variants that act predominantly in a tissue-restricted manner and may regulate the transcription of many genes.