RESUMO
While it is widely thought that de novo mutations (DNMs) occur randomly, we previously showed that some DNMs are enriched because they are positively selected in the testes of aging men. These "selfish" mutations cause disorders with a shared presentation of features, including exclusive paternal origin, significant increase of the father's age, and high apparent germline mutation rate. To date, all known selfish mutations cluster within the components of the RTK-RAS-MAPK signaling pathway, a critical modulator of testicular homeostasis. Here, we demonstrate the selfish nature of the SMAD4 DNMs causing Myhre syndrome (MYHRS). By analyzing 16 informative trios, we show that MYHRS-causing DNMs originated on the paternally derived allele in all cases. We document a statistically significant epidemiological paternal age effect of 6.3 years excess for fathers of MYHRS probands. We developed an ultra-sensitive assay to quantify spontaneous MYHRS-causing SMAD4 variants in sperm and show that pathogenic variants at codon 500 are found at elevated level in sperm of most men and exhibit a strong positive correlation with donor's age, indicative of a high apparent germline mutation rate. Finally, we performed in vitro assays to validate the peculiar functional behavior of the clonally selected DNMs and explored the basis of the pathophysiology of the different SMAD4 sperm-enriched variants. Taken together, these data provide compelling evidence that SMAD4, a gene operating outside the canonical RAS-MAPK signaling pathway, is associated with selfish spermatogonial selection and raises the possibility that other genes/pathways are under positive selection in the aging human testis.
Assuntos
Mutação em Linhagem Germinativa , Deficiência Intelectual , Proteína Smad4 , Humanos , Masculino , Proteína Smad4/genética , Deficiência Intelectual/genética , Contratura/genética , Adulto , Fácies , Espermatozoides/metabolismo , Espermatozoides/patologia , Criptorquidismo/genética , Transtornos do Crescimento/genética , Deformidades Congênitas da Mão/genética , Seleção Genética , Alelos , Idade Paterna , Testículo/patologia , Testículo/metabolismoRESUMO
BACKGROUND: Exercise-induced cardiac remodeling can be profound, resulting in clinical overlap with dilated cardiomyopathy, yet the significance of reduced ejection fraction (EF) in athletes is unclear. The aim is to assess the prevalence, clinical consequences, and genetic predisposition of reduced EF in athletes. METHODS: Young endurance athletes were recruited from elite training programs and underwent comprehensive cardiac phenotyping and genetic testing. Those with reduced EF using cardiac magnetic resonance imaging (defined as left ventricular EF <50%, or right ventricular EF <45%, or both) were compared with athletes with normal EF. A validated polygenic risk score for indexed left ventricular end-systolic volume (LVESVi-PRS), previously associated with dilated cardiomyopathy, was assessed. Clinical events were recorded over a mean of 4.4 years. RESULTS: Of the 281 elite endurance athletes (22±8 years, 79.7% male) undergoing comprehensive assessment, 44 of 281 (15.7%) had reduced left ventricular EF (N=12; 4.3%), right ventricular EF (N=14; 5.0%), or both (N=18; 6.4%). Reduced EF was associated with a higher burden of ventricular premature beats (13.6% versus 3.8% with >100 ventricular premature beats/24 h; P=0.008) and lower left ventricular global longitudinal strain (-17%±2% versus -19%±2%; P<0.001). Athletes with reduced EF had a higher mean LVESVi-PRS (0.57±0.13 versus 0.51±0.14; P=0.009) with athletes in the top decile of LVESVi-PRS having an 11-fold increase in the likelihood of reduced EF compared with those in the bottom decile (P=0.034). Male sex and higher LVESVi-PRS were the only significant predictors of reduced EF in a multivariate analysis that included age and fitness. During follow-up, no athletes developed symptomatic heart failure or arrhythmias. Two athletes died, 1 from trauma and 1 from sudden cardiac death, the latter having a reduced right ventricular EF and a LVESVi-PRS >95%. CONCLUSIONS: Reduced EF occurs in approximately 1 in 6 elite endurance athletes and is related to genetic predisposition in addition to exercise training. Genetic and imaging markers may help identify endurance athletes in whom scrutiny about long-term clinical outcomes may be appropriate. REGISTRATION: URL: https://www.anzctr.org.au/Trial/Registration/TrialReview.aspx?id=374976&isReview=true; Unique identifier: ACTRN12618000716268.
Assuntos
Atletas , Cardiomiopatia Dilatada , Volume Sistólico , Humanos , Masculino , Cardiomiopatia Dilatada/genética , Cardiomiopatia Dilatada/fisiopatologia , Cardiomiopatia Dilatada/diagnóstico por imagem , Feminino , Adulto , Adulto Jovem , Resistência Física/genética , Adolescente , Predisposição Genética para Doença , Remodelação Ventricular , Função Ventricular EsquerdaRESUMO
Single cell RNA sequencing of human full thickness Crohn's disease (CD) small bowel resection specimens was used to identify potential therapeutic targets for stricturing (S) CD. Using an unbiased approach, 16 cell lineages were assigned within 14,539 sequenced cells from patient-matched SCD and non-stricturing (NSCD) preparations. SCD and NSCD contained identical cell types. Amongst immune cells, B cells and plasma cells were selectively increased in SCD samples. B cell subsets suggested formation of tertiary lymphoid tissue in SCD and compared with NSCD there was an increase in IgG, and a decrease in IgA plasma cells, consistent with their potential role in CD fibrosis. Two Lumican-positive fibroblast subtypes were identified and subclassified based on expression of selectively enriched genes as fibroblast clusters (C) 12 and C9. Cells within these clusters expressed the profibrotic genes Decorin (C12) and JUN (C9). C9 cells expressed ACTA2; ECM genes COL4A1, COL4A2, COL15A1, COL6A3, COL18A1 and ADAMDEC1; LAMB1 and GREM1. GO and KEGG Biological terms showed extracellular matrix and stricture organization associated with C12 and C9, and regulation of WNT pathway genes with C9. Trajectory and differential gene analysis of C12 and C9 identified four sub-clusters. Intra sub-cluster gene analysis detected 13 co-regulated gene modules that aligned along predicted pseudotime trajectories. CXCL14 and ADAMDEC1 were key markers in module 1. Our findings support further investigation of fibroblast heterogeneity and interactions with local and circulating immune cells at earlier time points in fibrosis progression. Breaking these interactions by targeting one or other population may improve therapeutic management for SCD.
Assuntos
Linfócitos B , Doença de Crohn , Fibroblastos , Análise de Célula Única , Humanos , Doença de Crohn/genética , Doença de Crohn/patologia , Doença de Crohn/metabolismo , Fibroblastos/metabolismo , Fibroblastos/patologia , Análise de Célula Única/métodos , Linfócitos B/metabolismo , Linfócitos B/imunologia , Linfócitos B/patologia , Masculino , Feminino , Adulto , Perfilação da Expressão GênicaRESUMO
BACKGROUND: A wide range of tools are available for the detection of copy number variants (CNVs) from whole-genome sequencing (WGS) data. However, none of them focus on clinically-relevant CNVs, such as those that are associated with known genetic syndromes. Such variants are often large in size, typically 1-5 Mb, but currently available CNV callers have been developed and benchmarked for the discovery of smaller variants. Thus, the ability of these programs to detect tens of real syndromic CNVs remains largely unknown. RESULTS: Here we present ConanVarvar, a tool which implements a complete workflow for the targeted analysis of large germline CNVs from WGS data. ConanVarvar comes with an intuitive R Shiny graphical user interface and annotates identified variants with information about 56 associated syndromic conditions. We benchmarked ConanVarvar and four other programs on a dataset containing real and simulated syndromic CNVs larger than 1 Mb. In comparison to other tools, ConanVarvar reports 10-30 times less false-positive variants without compromising sensitivity and is quicker to run, especially on large batches of samples. CONCLUSIONS: ConanVarvar is a useful instrument for primary analysis in disease sequencing studies, where large CNVs could be the cause of disease.
Assuntos
Variações do Número de Cópias de DNA , Células Germinativas , Sequenciamento Completo do Genoma , Fluxo de Trabalho , Sequenciamento de Nucleotídeos em Larga EscalaRESUMO
Pre-B cell leukemia factor 1 (PBX1) is an essential developmental transcription factor, mutations in which have recently been associated with CAKUTHED syndrome, characterized by multiple congenital defects including congenital heart disease (CHD). During analysis of a whole-exome-sequenced cohort of heterogeneous CHD patients, we identified a de novo missense variant, PBX1:c.551G>C p.R184P, in a patient with tetralogy of Fallot with absent pulmonary valve and extra-cardiac phenotypes. Functional analysis of this variant by creating a CRISPR-Cas9 gene-edited mouse model revealed multiple congenital anomalies. Congenital heart defects (persistent truncus arteriosus and ventricular septal defect), hypoplastic lungs, hypoplastic/ectopic kidneys, aplastic adrenal glands and spleen, as well as atretic trachea and palate defects were observed in the homozygous mutant embryos at multiple stages of development. We also observed developmental anomalies in a proportion of heterozygous embryos, suggestive of a dominant mode of inheritance. Analysis of gene expression and protein levels revealed that although Pbx1 transcripts are higher in homozygotes, amounts of PBX1 protein are significantly decreased. Here, we have presented the first functional model of a missense PBX1 variant and provided strong evidence that p.R184P is disease-causal. Our findings also expand the phenotypic spectrum associated with pathogenic PBX1 variants in both humans and mice.
Assuntos
Sistemas CRISPR-Cas/genética , Cardiopatias Congênitas/genética , Fator de Transcrição 1 de Leucemia de Células Pré-B/genética , Persistência do Tronco Arterial/genética , Adulto , Animais , Modelos Animais de Doenças , Exoma/genética , Feminino , Cardiopatias Congênitas/patologia , Heterozigoto , Humanos , Lactente , Masculino , Camundongos , Mutação de Sentido Incorreto/genética , Linhagem , Fenótipo , Persistência do Tronco Arterial/patologia , Sequenciamento do ExomaRESUMO
Congenital heart disease (CHD) is the most common birth defect and brings with it significant mortality and morbidity. The application of exome and genome sequencing has greatly improved the rate of genetic diagnosis for CHD but the cause in the majority of cases remains uncertain. It is clear that genetics, as well as environmental influences, play roles in the aetiology of CHD. Here we address both these aspects of causation with respect to the Notch signalling pathway. In our CHD cohort, variants in core Notch pathway genes account for 20% of those that cause disease, a rate that did not increase with the inclusion of genes of the broader Notch pathway and its regulators. This is reinforced by case-control burden analysis where variants in Notch pathway genes are enriched in CHD patients. This enrichment is due to variation in NOTCH1. Functional analysis of some novel missense NOTCH1 and DLL4 variants in cultured cells demonstrate reduced signalling activity, allowing variant reclassification. Although loss-of-function variants in DLL4 are known to cause Adams-Oliver syndrome, this is the first report of a hypomorphic DLL4 allele as a cause of isolated CHD. Finally, we demonstrate a gene-environment interaction in mouse embryos between Notch1 heterozygosity and low oxygen- or anti-arrhythmic drug-induced gestational hypoxia, resulting in an increased incidence of heart defects. This implies that exposure to environmental insults such as hypoxia could explain variable expressivity and penetrance of observed CHD in families carrying Notch pathway variants.
Assuntos
Interação Gene-Ambiente , Predisposição Genética para Doença , Genômica/métodos , Cardiopatias Congênitas/patologia , Mutação , Receptor Notch1/genética , Animais , Estudos de Casos e Controles , Feminino , Cardiopatias Congênitas/etiologia , Cardiopatias Congênitas/genética , Humanos , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Camundongos Knockout , Sequenciamento do ExomaRESUMO
The genetic causes of multiple congenital anomalies are incompletely understood. Here, we report novel heterozygous predicted loss-of-function (LoF) and predicted damaging missense variants in the WW domain binding protein 11 (WBP11) gene in seven unrelated families with a variety of overlapping congenital malformations, including cardiac, vertebral, tracheo-esophageal, renal and limb defects. WBP11 encodes a component of the spliceosome with the ability to activate pre-messenger RNA splicing. We generated a Wbp11 null allele in mouse using CRISPR-Cas9 targeting. Wbp11 homozygous null embryos die prior to E8.5, indicating that Wbp11 is essential for development. Fewer Wbp11 heterozygous null mice are found than expected due to embryonic and postnatal death. Importantly, Wbp11 heterozygous null mice are small and exhibit defects in axial skeleton, kidneys and esophagus, similar to the affected individuals, supporting the role of WBP11 haploinsufficiency in the development of congenital malformations in humans. LoF WBP11 variants should be considered as a possible cause of VACTERL association as well as isolated Klippel-Feil syndrome, renal agenesis or esophageal atresia.
Assuntos
Anormalidades Múltiplas/genética , Proteínas de Ligação a DNA/genética , Haploinsuficiência/genética , Rim/metabolismo , Fatores de Processamento de RNA/genética , Anormalidades Múltiplas/patologia , Canal Anal/anormalidades , Canal Anal/patologia , Animais , Esôfago/anormalidades , Esôfago/metabolismo , Esôfago/patologia , Cardiopatias Congênitas/genética , Cardiopatias Congênitas/patologia , Heterozigoto , Humanos , Rim/anormalidades , Rim/patologia , Deformidades Congênitas dos Membros/genética , Deformidades Congênitas dos Membros/patologia , Mutação com Perda de Função/genética , Camundongos , Splicing de RNA/genética , Coluna Vertebral/anormalidades , Coluna Vertebral/patologia , Traqueia/anormalidades , Traqueia/patologiaRESUMO
Congenital heart disease (CHD) has a multifactorial aetiology, raising the possibility of an underlying genetic burden, predisposing to disease but also variable expression, including variation in disease severity, and incomplete penetrance. Using whole genome sequencing (WGS), the findings of this study, indicate that complex, critical CHD is distinct from other types of disease due to increased genetic burden in common variation, specifically among established CHD genes. Additionally, these findings highlight associations with regulatory genes and environmental "stressors" in the final presentation of disease.
Assuntos
Cardiopatias Congênitas , Humanos , Cardiopatias Congênitas/genéticaRESUMO
BACKGROUND: The most common cyanotic congenital heart disease (CHD) requiring management as a neonate is transposition of great arteries (TGA). Clinically, up to 50% of TGA patients develop some form of neurodevelopmental disability (NDD), thought to have a significant genetic component. A "ciliopathy" and links with laterality disorders have been proposed. This first report of whole genome sequencing in TGA, sought to identify clinically relevant variants contributing to heart, brain and laterality defects. METHODS: Initial whole genome sequencing analyses on 100 TGA patients focussed on established disease genes related to CHD (n = 107), NDD (n = 659) and heterotaxy (n = 74). Single variant as well as copy number variant analyses were conducted. Variant pathogenicity was assessed using the American College of Medical Genetics and Genomics-Association for Molecular Pathology guidelines. RESULTS: Fifty-five putatively damaging variants were identified in established disease genes associated with CHD, NDD and heterotaxy; however, no clinically relevant variants could be attributed to disease. Notably, case-control analyses identified significantly more predicted-damaging, silent and total variants in TGA cases than healthy controls in established CHD genes (P < .001), NDD genes (P < .001) as well as across the three gene panels (P < .001). CONCLUSION: We present compelling evidence that the majority of TGA is not caused by monogenic rare variants and is most likely oligogenic and/or polygenic in nature, highlighting the complex genetic architecture and multifactorial influences on this CHD sub-type and its long-term sequelae. Assessment of variant burden in key heart, brain and/or laterality genes may be required to unravel the genetic contributions to TGA and related disabilities.
Assuntos
Cardiopatias Congênitas , Transposição dos Grandes Vasos , Artérias , Encéfalo/diagnóstico por imagem , Cardiopatias Congênitas/genética , Humanos , Recém-Nascido , Transposição dos Grandes Vasos/genética , Sequenciamento Completo do GenomaRESUMO
Mosaic mutations present in the germline have important implications for reproductive risk and disease transmission. We previously demonstrated a phenomenon occurring in the male germline, whereby specific mutations arising spontaneously in stem cells (spermatogonia) lead to clonal expansion, resulting in elevated mutation levels in sperm over time. This process, termed "selfish spermatogonial selection," explains the high spontaneous birth prevalence and strong paternal age-effect of disorders such as achondroplasia and Apert, Noonan and Costello syndromes, with direct experimental evidence currently available for specific positions of six genes (FGFR2, FGFR3, RET, PTPN11, HRAS, and KRAS). We present a discovery screen to identify novel mutations and genes showing evidence of positive selection in the male germline, by performing massively parallel simplex PCR using RainDance technology to interrogate mutational hotspots in 67 genes (51.5 kb in total) in 276 biopsies of testes from five men (median age, 83 yr). Following ultradeep sequencing (about 16,000×), development of a low-frequency variant prioritization strategy, and targeted validation, we identified 61 distinct variants present at frequencies as low as 0.06%, including 54 variants not previously directly associated with selfish selection. The majority (80%) of variants identified have previously been implicated in developmental disorders and/or oncogenesis and include mutations in six newly associated genes (BRAF, CBL, MAP2K1, MAP2K2, RAF1, and SOS1), all of which encode components of the RAS-MAPK pathway and activate signaling. Our findings extend the link between mutations dysregulating the RAS-MAPK pathway and selfish selection, and show that the aging male germline is a repository for such deleterious mutations.
Assuntos
Proteínas Quinases Ativadas por Mitógeno/metabolismo , Mutação , Transdução de Sinais , Testículo/metabolismo , Proteínas ras/metabolismo , Idoso , Idoso de 80 Anos ou mais , Variação Genética , Humanos , Masculino , Pessoa de Meia-IdadeRESUMO
MOTIVATION: In 2018, Google published an innovative variant caller, DeepVariant, which converts pileups of sequence reads into images and uses a deep neural network to identify single-nucleotide variants and small insertion/deletions from next-generation sequencing data. This approach outperforms existing state-of-the-art tools. However, DeepVariant was designed to call variants within a single sample. In disease sequencing studies, the ability to examine a family trio (father-mother-affected child) provides greater power for disease mutation discovery. RESULTS: To further improve DeepVariant's variant calling accuracy in family-based sequencing studies, we have developed a family-based variant calling pipeline, dv-trio, which incorporates the trio information from the Mendelian genetic model into variant calling based on DeepVariant. AVAILABILITY AND IMPLEMENTATION: dv-trio is available via an open source BSD3 license at GitHub (https://github.com/VCCRI/dv-trio/). CONTACT: e.giannoulatou@victorchang.edu.au. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Mutação INDEL , Criança , Humanos , Mutação , Redes Neurais de Computação , SoftwareRESUMO
Transposed elements (TEs) have dramatically shaped evolution of the exon-intron structure and significantly contributed to morbidity, but how recent TE invasions into older TEs cooperate in generating new coding sequences is poorly understood. Employing an updated repository of new exon-intron boundaries induced by pathogenic mutations, termed DBASS, here we identify novel TE clusters that facilitated exon selection. To explore the extent to which such TE exons maintain RNA secondary structure of their progenitors, we carried out structural studies with a composite exon that was derived from a long terminal repeat (LTR78) and AluJ and was activated by a C > T mutation optimizing the 5' splice site. Using a combination of SHAPE, DMS and enzymatic probing, we show that the disease-causing mutation disrupted a conserved AluJ stem that evolved from helix 3.3 (or 5b) of 7SL RNA, liberating a primordial GC 5' splice site from the paired conformation for interactions with the spliceosome. The mutation also reduced flexibility of conserved residues in adjacent exon-derived loops of the central Alu hairpin, revealing a cross-talk between traditional and auxilliary splicing motifs that evolved from opposite termini of 7SL RNA and were approximated by Watson-Crick base-pairing already in organisms without spliceosomal introns. We also identify existing Alu exons activated by the same RNA rearrangement. Collectively, these results provide valuable TE exon models for studying formation and kinetics of pre-mRNA building blocks required for splice-site selection and will be useful for fine-tuning auxilliary splicing motifs and exon and intron size constraints that govern aberrant splice-site activation.
Assuntos
Elementos de DNA Transponíveis , Sítios de Splice de RNA , Splicing de RNA , Alelos , Sequência de Bases , Éxons , Regulação da Expressão Gênica , Humanos , Íntrons , Mutação , Conformação de Ácido Nucleico , Análise de Sequência de RNA , Transcrição GênicaRESUMO
The burden of cardiovascular disease in women is being increasingly appreciated. Nevertheless, both clinicians and the general public are largely unaware that cardiovascular disease is the leading cause of death worldwide in women in all countries and that outcomes after a heart attack are worse for women than men. Of note, certain types of cardiovascular disease have a predilection for women, including spontaneous coronary artery dissection (SCAD) and fibromuscular dysplasia (FMD). Although uncommon, SCAD is being increasingly recognised as the cause of an acute coronary syndrome (ACS) and can recur. It is a potentially fatal, under-diagnosed condition that affects relatively young women, who often have few traditional risk factors, and is the commonest cause of a myocardial infarction associated with pregnancy. In contrast, FMD often remains silent but when manifested can also cause major sequelae, including renal infarction, stroke, cervical artery dissection and gut infarction. Here we provide an update on the diagnosis, aetiology and management of these important disorders that overwhelmingly affect women.
Assuntos
Anomalias dos Vasos Coronários/etiologia , Vasos Coronários/diagnóstico por imagem , Displasia Fibromuscular/complicações , Doenças Vasculares/congênito , Angiografia Coronária , Anomalias dos Vasos Coronários/diagnóstico , Feminino , Displasia Fibromuscular/diagnóstico , Humanos , Fatores de Risco , Doenças Vasculares/diagnóstico , Doenças Vasculares/etiologiaRESUMO
BACKGROUND: Congenital malformations can be manifested as combinations of phenotypes that co-occur more often than expected by chance. In many such cases, it has proved difficult to identify a genetic cause. We sought the genetic cause of cardiac, vertebral, and renal defects, among others, in unrelated patients. METHODS: We used genomic sequencing to identify potentially pathogenic gene variants in families in which a person had multiple congenital malformations. We tested the function of the variant by using assays of in vitro enzyme activity and by quantifying metabolites in patient plasma. We engineered mouse models with similar variants using the CRISPR (clustered regularly interspaced short palindromic repeats)-Cas9 system. RESULTS: Variants were identified in two genes that encode enzymes of the kynurenine pathway, 3-hydroxyanthranilic acid 3,4-dioxygenase (HAAO) and kynureninase (KYNU). Three patients carried homozygous variants predicting loss-of-function changes in the HAAO or KYNU proteins (HAAO p.D162*, HAAO p.W186*, or KYNU p.V57Efs*21). Another patient carried heterozygous KYNU variants (p.Y156* and p.F349Kfs*4). The mutant enzymes had greatly reduced activity in vitro. Nicotinamide adenine dinucleotide (NAD) is synthesized de novo from tryptophan through the kynurenine pathway. The patients had reduced levels of circulating NAD. Defects similar to those in the patients developed in the embryos of Haao-null or Kynu-null mice owing to NAD deficiency. In null mice, the prevention of NAD deficiency during gestation averted defects. CONCLUSIONS: Disruption of NAD synthesis caused a deficiency of NAD and congenital malformations in humans and mice. Niacin supplementation during gestation prevented the malformations in mice. (Funded by the National Health and Medical Research Council of Australia and others.).
Assuntos
3-Hidroxiantranilato 3,4-Dioxigenase/genética , Anormalidades Congênitas/genética , Suplementos Nutricionais , Hidrolases/genética , NAD/deficiência , Niacina/uso terapêutico , 3-Hidroxiantranilato 3,4-Dioxigenase/metabolismo , Canal Anal/anormalidades , Animais , Anormalidades Congênitas/prevenção & controle , Modelos Animais de Doenças , Esôfago/anormalidades , Feminino , Cardiopatias Congênitas/genética , Cardiopatias Congênitas/prevenção & controle , Humanos , Hidrolases/metabolismo , Rim/anormalidades , Deformidades Congênitas dos Membros/genética , Deformidades Congênitas dos Membros/prevenção & controle , Masculino , Camundongos , Camundongos Knockout , Mutação , NAD/biossíntese , NAD/genética , Análise de Sequência de DNA , Coluna Vertebral/anormalidades , Traqueia/anormalidadesRESUMO
MOTIVATION: In silico prediction tools are essential for identifying variants which create or disrupt cis-splicing motifs. However, there are limited options for genome-scale discovery of splice-altering variants. RESULTS: We have developed Spliceogen, a highly scalable pipeline integrating predictions from some of the individually best performing models for splice motif prediction: MaxEntScan, GeneSplicer, ESRseq and Branchpointer. AVAILABILITY AND IMPLEMENTATION: Spliceogen is available as a command line tool which accepts VCF/BED inputs and handles both single nucleotide variants (SNVs) and indels (https://github.com/VCCRI/Spliceogen). SNV databases with prediction scores are also available, covering all possible SNVs at all genomic positions within all Gencode-annotated multi-exon transcripts. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Splicing de RNA , Software , Éxons , Genômica , Mutação INDELRESUMO
BACKGROUND: The human gut microbiome plays a critical role in the carcinogenesis of colorectal cancer (CRC). However, a comprehensive analysis of the interaction between the host and microbiome is still lacking. RESULTS: We found correlations between the change in abundance of microbial taxa, butyrate-related colonic metabolites, and methylation-associated host gene expression in colonic tumour mucosa tissues compared with the adjacent normal mucosa tissues. The increase of genus Fusobacterium abundance was correlated with a decrease in the level of 4-hydroxybutyric acid (4-HB) and expression of immune-related peptidase inhibitor 16 (PI16), Fc Receptor Like A (FCRLA) and Lymphocyte Specific Protein 1 (LSP1). The decrease in the abundance of another potentially 4-HB-associated genus, Prevotella 2, was also found to be correlated with the down-regulated expression of metallothionein 1 M (MT1M). Additionally, the increase of glutamic acid-related family Halomonadaceae was correlated with the decreased expression of reelin (RELN). The decreased abundance of genus Paeniclostridium and genus Enterococcus were correlated with increased lactic acid level, and were also linked to the expression change of Phospholipase C Beta 1 (PLCB1) and Immunoglobulin Superfamily Member 9 (IGSF9) respectively. Interestingly, 4-HB, glutamic acid and lactic acid are all butyrate precursors, which may modify gene expression by epigenetic regulation such as DNA methylation. CONCLUSIONS: Our study identified associations between previously reported CRC-related microbial taxa, butyrate-related metabolites and DNA methylation-associated gene expression in tumour and normal colonic mucosa tissues from CRC patients, which uncovered a possible mechanism of the role of microbiome in the carcinogenesis of CRC. In addition, these findings offer insight into potential new biomarkers, therapeutic and/or prevention strategies for CRC.
Assuntos
Neoplasias Colorretais/microbiologia , Microbioma Gastrointestinal/fisiologia , Mucosa Intestinal/microbiologia , Bactérias/classificação , Bactérias/genética , Bactérias/isolamento & purificação , Bactérias/metabolismo , Butiratos/metabolismo , Colo/metabolismo , Colo/microbiologia , Colo/patologia , Neoplasias Colorretais/genética , Neoplasias Colorretais/metabolismo , Neoplasias Colorretais/patologia , Metilação de DNA , Epigênese Genética , Regulação Neoplásica da Expressão Gênica , Humanos , Mucosa Intestinal/metabolismo , Metaboloma , Proteína Reelina , TranscriptomaRESUMO
Bacteremia (bacterial bloodstream infection) is a major cause of illness and death in sub-Saharan Africa but little is known about the role of human genetics in susceptibility. We conducted a genome-wide association study of bacteremia susceptibility in more than 5,000 Kenyan children as part of the Wellcome Trust Case Control Consortium 2 (WTCCC2). Both the blood-culture-proven bacteremia case subjects and healthy infants as controls were recruited from Kilifi, on the east coast of Kenya. Streptococcus pneumoniae is the most common cause of bacteremia in Kilifi and was thus the focus of this study. We identified an association between polymorphisms in a long intergenic non-coding RNA (lincRNA) gene (AC011288.2) and pneumococcal bacteremia and replicated the results in the same population (p combined = 1.69 × 10(-9); OR = 2.47, 95% CI = 1.84-3.31). The susceptibility allele is African specific, derived rather than ancestral, and occurs at low frequency (2.7% in control subjects and 6.4% in case subjects). Our further studies showed AC011288.2 expression only in neutrophils, a cell type that is known to play a major role in pneumococcal clearance. Identification of this novel association will further focus research on the role of lincRNAs in human infectious disease.
Assuntos
Bacteriemia/genética , Pneumonia Pneumocócica/genética , Polimorfismo Genético/genética , RNA Longo não Codificante/genética , Streptococcus pneumoniae/genética , Adolescente , Bacteriemia/microbiologia , Bacteriemia/patologia , Estudos de Casos e Controles , Criança , Pré-Escolar , Estudo de Associação Genômica Ampla , Humanos , Lactente , Recém-Nascido , Quênia/epidemiologia , Pneumonia Pneumocócica/microbiologia , Pneumonia Pneumocócica/patologia , Fatores de RiscoRESUMO
PURPOSE: Congenital heart disease (CHD) affects up to 1% of live births. However, a genetic diagnosis is not made in most cases. The purpose of this study was to assess the outcomes of genome sequencing (GS) of a heterogeneous cohort of CHD patients. METHODS: Ninety-seven families with probands born with CHD requiring surgical correction were recruited for genome sequencing. At minimum, a proband-parents trio was sequenced per family. GS data were analyzed via a two-tiered method: application of a high-confidence gene screen (hcCHD), and comprehensive analysis. Identified variants were assessed for pathogenicity using the American College of Medical Genetics and Genomics-Association for Molecular Pathology (ACMG-AMP) guidelines. RESULTS: Clinically relevant genetic variants in known and emerging CHD genes were identified. The hcCHD screen identified a clinically actionable variant in 22% of families. Subsequent comprehensive analysis identified a clinically actionable variant in an additional 9% of families in genes with recent disease associations. Overall, this two-tiered approach provided a clinically relevant variant for 31% of families. CONCLUSIONS: Interrogating GS data using our two-tiered method allowed identification of variants with high clinical utility in a third of our heterogeneous cohort. However, association of emerging genes with CHD etiology, and development of novel technologies for variant assessment and interpretation, will increase diagnostic yield during future reassessment of our GS data.
Assuntos
Testes Genéticos/métodos , Cardiopatias Congênitas/diagnóstico , Cardiopatias Congênitas/genética , Sequência de Bases/genética , Mapeamento Cromossômico/métodos , Estudos de Coortes , Exoma/genética , Família , Feminino , Predisposição Genética para Doença/genética , Variação Genética/genética , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Masculino , Mutação/genética , Pais , Análise de Sequência de DNA/métodos , Sequenciamento Completo do Genoma/métodosRESUMO
PURPOSE: We evaluated strategies for identifying disease-causing variants in genetic testing for dilated cardiomyopathy (DCM). METHODS: Cardiomyopathy gene panel testing was performed in 532 DCM patients and 527 healthy control subjects. Rare variants in 41 genes were stratified using variant-level and gene-level characteristics. RESULTS: A majority of DCM cases and controls carried rare protein-altering cardiomyopathy gene variants. Variant-level characteristics alone had limited discriminative value. Differentiation between groups was substantially improved by addition of gene-level information that incorporated ranking of genes based on literature evidence for disease association. The odds of DCM were increased to nearly 9-fold for truncating variants or high-impact missense variants in the subset of 14 genes that had the strongest biological links to DCM (P <0.0001). For some of these genes, DCM-associated variants appeared to be clustered in key protein functional domains. Multiple rare variants were present in many family probands, however, there was generally only one "driver" pathogenic variant that cosegregated with disease. CONCLUSION: Rare variants in cardiomyopathy genes can be effectively stratified by combining variant-level and gene-level information. Prioritization of genes based on their a priori likelihood of disease causation is a key factor in identifying clinically actionable variants in cardiac genetic testing.
Assuntos
Cardiomiopatia Dilatada/genética , Testes Genéticos , Sequenciamento de Nucleotídeos em Larga Escala , Doenças Raras/genética , Cardiomiopatia Dilatada/diagnóstico , Cardiomiopatia Dilatada/patologia , Feminino , Predisposição Genética para Doença , Humanos , Masculino , Pessoa de Meia-Idade , Mutação de Sentido Incorreto , Linhagem , Doenças Raras/diagnóstico , Doenças Raras/patologiaRESUMO
BACKGROUND: Tissue genomic heterogeneity (t-HET) in patients with epithelial ovarian cancer (OVCA) is related to tissue plasticity, i.e., flexibility to adapt to adverse molecular environments. Here, we interrogated the presence and clinical relevance of OVCA t-HET. METHODS: We applied high-depth (>2000×) sequencing on 297 paraffin tissue samples (fallopian tubes, ovaries, intra-abdominal metastases) from 71 treatment-naïve patients who subsequently received first-line platinum-based chemotherapy. Based on tissue mutation patterns, we distinguished tissue genotypes into: no mutation (33/297 samples; 11.1%), stable (173; 58.2%) and unstable (91; 30.7%). We profiled genotypes per patient and assessed t-HET in 69 patients. Predicted pathogenic mutations refer to germline and/or tissues. RESULTS: Among all 71 patients, 46 (64.8%) had pathogenic BRCA1 mutations and 15 (21.7%) had BRCA1/2 disruption (i.e., pathogenic mutations with position-LOH). We classified 29 patients with t-HET (42%), all with pathogenic BRCA1; t-HET was observed in 64% with such mutations (pâ¯<â¯0.001). As opposed to non-t-HET, matched tissues in t-HET shared pathogenic BRCA1 (pâ¯<â¯0.001) but not BRCA2 and TP53. Germline BRCA1 mutations in tissues exhibited position-LOH; heterozygous status; or, partial loss of the inherited allele accompanied by additional clonal mutations. Patients with t-HET had worse outcome (log-rank pâ¯=â¯0.048 [progression-free]; pâ¯=â¯0.037 [overall survival]), including 12/15 patients with disrupted BRCA1/2 and 3 BRCA1 carriers with partial germline loss in tissues. CONCLUSIONS: Pathogenic BRCA1 mutations appear necessary but may not be sufficient for the establishment of t-HET. t-HET may be associated with worse outcome, including in patients with disrupted BRCA1/2, which is usually considered as a favourable marker. OVCA t-HET may need to be addressed for treatment decisions.